MIME-Version: 1.0
Content-Type: multipart/alternative;
	boundary="----_=_NextPart_001_01C07F27.F9EEC200"
In-Reply-To:  <200101061950.OAA03845@pluto.math.albany.edu>
Content-class: urn:content-classes:message
Subject:      Re: GELLMU progress
Date: Mon, 15 Jan 2001 11:20:28 +0100
Message-ID:  <v03110700b6887a6f7b7e@[195.100.226.143]>
From: "Hans Aberg" <haberg@MATEMATIK.SU.SE>
Sender: "Mailing list for the LaTeX3 project" <LATEX-L@URZ.UNI-HEIDELBERG.DE>
To: "Multiple recipients of list LATEX-L" <LATEX-L@URZ.UNI-HEIDELBERG.DE>
Reply-To: "Mailing list for the LaTeX3 project" <LATEX-L@URZ.UNI-HEIDELBERG.DE>
Status: R

This is a multi-part message in MIME format.

------_=_NextPart_001_01C07F27.F9EEC200
Content-Type: text/plain;
	charset="iso-8859-1"
Content-Transfer-Encoding: quoted-printable

At 14:50 -0500 1-01-06, William F. Hammond wrote:
>Although I've written in C, I've never gotten into C++.  Are there
>good regular expression libraries for C++?

I once wrote a regular expression -> NFA translator. If one wants to
translate into a DFA, one problem is that some regular words produces
exponential size DFA's. One way around this is to translate DFA =
transitions
dynamically, in which case both size and time can be made fast (space as
NFA, time as DFA).

I have never seen any such libraries, but they must exist, as there are
programs like grep and the like, which uses regular expression and must =
use
some kind of FA (finite automata) for string identification.

In your case, I think you want to attach rules to the parsing, so parser
generateors seem to be better. In addition, you probably have a limited
amount of programmer time at your disposal. So this speaks for relying =
on
existing parser generators. Once one knows what is needed, one can =
choose
other alternatives as a means of optimization.

If one does not want to use C/C++, there is a parser generator in =
Haskell
http://haskell.org/, called "Happy" I think. And for Java, there is =
ANTLR
http://www.antlr.org/.

  Hans Aberg

------_=_NextPart_001_01C07F27.F9EEC200
Content-Type: text/html;
	charset="iso-8859-1"
Content-Transfer-Encoding: quoted-printable

<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 3.2//EN">
<HTML>
<HEAD>
<META HTTP-EQUIV=3D"Content-Type" CONTENT=3D"text/html; =
charset=3Diso-8859-1">
<META NAME=3D"Generator" CONTENT=3D"MS Exchange Server version =
6.5.7654.12">
<TITLE>     Re: GELLMU progress</TITLE>
</HEAD>
<BODY>
<!-- Converted from text/plain format -->

<P><FONT SIZE=3D2>At 14:50 -0500 1-01-06, William F. Hammond =
wrote:</FONT>

<BR><FONT SIZE=3D2>&gt;Although I've written in C, I've never gotten =
into C++.&nbsp; Are there</FONT>

<BR><FONT SIZE=3D2>&gt;good regular expression libraries for C++?</FONT>
</P>

<P><FONT SIZE=3D2>I once wrote a regular expression -&gt; NFA =
translator. If one wants to</FONT>

<BR><FONT SIZE=3D2>translate into a DFA, one problem is that some =
regular words produces</FONT>

<BR><FONT SIZE=3D2>exponential size DFA's. One way around this is to =
translate DFA transitions</FONT>

<BR><FONT SIZE=3D2>dynamically, in which case both size and time can be =
made fast (space as</FONT>

<BR><FONT SIZE=3D2>NFA, time as DFA).</FONT>
</P>

<P><FONT SIZE=3D2>I have never seen any such libraries, but they must =
exist, as there are</FONT>

<BR><FONT SIZE=3D2>programs like grep and the like, which uses regular =
expression and must use</FONT>

<BR><FONT SIZE=3D2>some kind of FA (finite automata) for string =
identification.</FONT>
</P>

<P><FONT SIZE=3D2>In your case, I think you want to attach rules to the =
parsing, so parser</FONT>

<BR><FONT SIZE=3D2>generateors seem to be better. In addition, you =
probably have a limited</FONT>

<BR><FONT SIZE=3D2>amount of programmer time at your disposal. So this =
speaks for relying on</FONT>

<BR><FONT SIZE=3D2>existing parser generators. Once one knows what is =
needed, one can choose</FONT>

<BR><FONT SIZE=3D2>other alternatives as a means of optimization.</FONT>
</P>

<P><FONT SIZE=3D2>If one does not want to use C/C++, there is a parser =
generator in Haskell</FONT>

<BR><FONT SIZE=3D2><A =
HREF=3D"http://haskell.org/">http://haskell.org/</A>, called =
&quot;Happy&quot; I think. And for Java, there is ANTLR</FONT>

<BR><FONT SIZE=3D2><A =
HREF=3D"http://www.antlr.org/">http://www.antlr.org/</A>.</FONT>
</P>

<P><FONT SIZE=3D2>&nbsp; Hans Aberg</FONT>
</P>

</BODY>
</HTML>
------_=_NextPart_001_01C07F27.F9EEC200--