MIME-Version: 1.0
Content-Type: multipart/alternative;
	boundary="----_=_NextPart_001_01C078BA.EF468400"
In-Reply-To:  <200101061950.OAA03845@pluto.math.albany.edu>
Content-class: urn:content-classes:message
Subject:      Re: GELLMU progress
Date: Sun, 7 Jan 2001 15:57:14 +0100
Message-ID:  <v03110701b67e2173e4d7@[195.100.226.135]>
From: "Hans Aberg" <haberg@MATEMATIK.SU.SE>
Sender: "Mailing list for the LaTeX3 project" <LATEX-L@URZ.UNI-HEIDELBERG.DE>
To: "Multiple recipients of list LATEX-L" <LATEX-L@URZ.UNI-HEIDELBERG.DE>
Reply-To: "Mailing list for the LaTeX3 project" <LATEX-L@URZ.UNI-HEIDELBERG.DE>
Status: R

This is a multi-part message in MIME format.

------_=_NextPart_001_01C078BA.EF468400
Content-Type: text/plain;
	charset="iso-8859-1"
Content-Transfer-Encoding: quoted-printable

At 14:50 -0500 1-01-06, William F. Hammond wrote:
>> If you are in the need of various translations, have you tried using =
Flex
>> (lexical analyzer generator) and Bison (parser generator, or
>> compiler-compiler), see
>
>Are you saying that it's easier to code translations from XML using
>lex and yacc descendants rather than using standard XML tools such as
>sgmlspl, jade, or xt?  I find that hard to believe.  (Of course, the
>situation before 1996 was different.)

I do not know exactly what you want to achieve: I get the impression =
that
you have an language of your own of some sort, and want to be able to
translate it into different formats. If your language is just a dialect =
of
XML, and there are XML parser generators available similar to that of
Bison, then use that.

The translation I needed was as follows: From my own language, I want to
output C++ code. This proved very difficult, because local code =
generates
information (such as include files, declarations, and definitions) that
should be output in different places and files in the C++ output files.

Therefore, instead of doing the parsing immediately into a new language, =
I
invented an intermediate "formatting" language: Given a set of macro
definitions, normally provided a formatting file (thus providing the
specific data of the output language, in my case, C++), and a set of
iterated lookup tables (in iternal binary format), produced by the =
parsing,
it knows how to pick together suitable output files.

The idea is to make the actual parsing as independent as possible of any
output language, only producing the lookup tables. Then by merely =
switching
the formatting file with the macro definitions, one can generate output =
to
different languages.

>> -- I use them together with C++, which is convenient as the latter =
has
>> standard string classes.
>
>Although I've written in C, I've never gotten into C++.  Are there
>good regular expression libraries for C++?

If you need full regular expressions and a full LR(1) parser within your
language, then the simplest approach is to let your language output Flex =
.l
and Bison .y files; then compile these files using Flex and Bison, and
finally compile the files so output using a C++ compiler. This is sort =
of a
standard computer technique: for example, the Haskell compiler GHC =
produces
.c files in this way.

Also note that Flex and Bison are themselves also compilers, and one can
use Flex and Bison to write new versions of themselves. -- Actually, =
they
do. :-)

-- I only use C++ because it is convenient to produce an internal binary
representation, which later can be used to produce the C++ output =
format.
The iterated lookup tables I use are just
  map<string, variable>
(meaning that one can index a finite set of variable by string keys) =
where
"variable" is a class with suitable lookup information.

Let's take a simple example: In the output in my application, I need to
build a sequence of classes, which can have a sequence of methods, with
definitions that should be output in various places. The main point is =
that
one has a sequence of lookup localities, like in most modern computer
languages.

In my formatting file, I may have something like the stuff below: Here,
  <#header|...|header#>
  <#header|...|#>
encloses a macro definition, and <|header_name|> is an invocation of the
variable "header_name", and so on.

<#header|
#ifndef Synergy_<|header_name|>_header
#define Synergy_<|header_name|>_header

#if !__cplusplus
#error Header file "<|header_name|>" only for C++.
#endif

#include <stdexcept>

#include "data"
#include "construct"

<|header_preamble|>

namespace Synergy {

<|class.declaration|>

} // namespace Synergy

#endif // <|header_name|>_
|header#>

<#class.declaration|
extern Synergy::data global_<|class_name:cpp|>;
class <|class_name:cpp|> : public virtual construct {
public:
  static const char* category;
  static object_method_base* lookup_method(const std::string&);
  static Synergy::data global;
  class object;
  typedef <|class_name:cpp|> constructor;
  virtual root* clone() const { return new
Synergy::<|class_name:cpp|>(*this); }
  virtual bool cloneable() { return <|object_cloneable|>; }
  <|object_copy_to_clone_method|>
  virtual Synergy::data method_method(Synergy::data&);
  virtual Synergy::data method_object(Synergy::data& x) { return new
object(x); }
  virtual Synergy::data method_object_method(Synergy::data& x);
  <|constructor_method.declaration|>
  <|constructor_cpp.declare|>

  class object : <|object_base|><|object_cpp_base|> {
  public:
    static const char* category;
    static object_method_base* lookup_method(const std::string&);
    <|object_constructor.declaration|>
    virtual root* clone() const { <|clone_method_definition|> }
    <|copy_method|>
    <|object_data|>
    virtual Synergy::data method_constructor(Synergy::data&) { return
Synergy::global_<|class_name:cpp|>; }
    virtual Synergy::data method_method(Synergy::data&);
    <|method.declaration|>
    <|object_cpp.declare|>
  };
};
|class.declaration#>

<#method.declaration|
virtual Synergy::data
method_<|method_name:cpp|>(Synergy::data&)<|method_is_abstract|>;
|#>


In my approach each variable can actually have a sequence of lookups
attached to it, so it becomes easy to sequence a series of classes with =
the
same template.

Suppose that we want to format a class named `foo' with an object method
named `bar' (among other data). Then the C++ code for that (the way I
implemented it) would look something like
    // Create a new class named "foo":
  (*table)["class"][push_back]["class_name"] =3D "foo";
    // Create a method named "bar" belonging to last created class =
("foo"):
  (*table)["class"][last]["method"][push_back]["method_name"] =3D "bar";

The formatter then uses this lookup table with same kind of iterated
localities like in say TeX, or any other modern computer language: When =
one
prints out the "header" macro, when it encounters the =
"class.declaration"
variable, it iterates through all classes using the "class.declaration"
macro definition. Then, when in the "class.declaration" definition, when =
it
encounters the "method.declaration", it iterates through all methods
_in_that_class_. If a name is not found locally, it iterates towards the
base to find a more global name.

>> One approach is to parse objects into something like the DOM =
(Document
>> Object Model, http://www.w3.org/), and then onto that hook a program =
that
>> can translate into several different formats.
>
>Of course, sgmlspl, jade, xt, and other standard sgml/xml tools
>provide good frameworks for translating into as many different formats
>as one likes by writing, respectively, Perl, DSSSL, and XSLT.
>(Possibly also it would be viable to use David Carlisle's xmltex
>followed by Eitan Gurari's tex4ht in which case one writes TeX.)

So actually, I do not parse into a language, but into a binary model, =
which
has essentially the same general capacities (a local lookup system) of =
any
language. Then I use another program to format that into a suitable
language.

>  I wonder how some
>of these things would survive a double translation
>
>      gellmu/article ---(hypothetical)---> TEI ----> LaTeX .

So what I use is something like this your "hypothetical" label here, =
except
that it is not a language that I use, but a binary model, a sequence of
iterated lookup tables.

>2.  The default "article" document type for _regular_ GELLMU provides
>three character names for each of the 33 non-alphanumeric but
>printable ASCII characters.

As it is a binary model, such parsing concerns are irrelevant.

For example, I wanted to write classes with _arbitrary_ binary string
names, which does not work with C++, which only allows alpha-numerical
names and underscore with some restrictions. But it is easy to mangle
(encode) arbitrary binary string names, which I did by an addition to =
the
formatter; then it is also irrelevant what kind of parsing I use in my
original language to produce arbitrary binary string names.

If one plays this game along, one ends up with developing a better and
better intermediate binary model. For example, suppose I want to write a
floating number. Right now, it would suffice to use say the C++ syntax, =
and
parse them as strings which are output verbatim in the C++ files. But
suppose I want to produce output to some languages with a different =
syntax
than C++ in this respect. Then it would be natural to represent the
floating numbers in some internal binary model, and add to the formatter
the capacity to write out floating point numbers in different formats.

Of course, my needs are specialized at OOPL -> OOPL language =
translations,
and DPL ("document PL") translations may have other needs.

  Hans Aberg

------_=_NextPart_001_01C078BA.EF468400
Content-Type: text/html;
	charset="iso-8859-1"
Content-Transfer-Encoding: quoted-printable

<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 3.2//EN">
<HTML>
<HEAD>
<META HTTP-EQUIV=3D"Content-Type" CONTENT=3D"text/html; =
charset=3Diso-8859-1">
<META NAME=3D"Generator" CONTENT=3D"MS Exchange Server version =
6.5.7654.12">
<TITLE>     Re: GELLMU progress</TITLE>
</HEAD>
<BODY>
<!-- Converted from text/plain format -->

<P><FONT SIZE=3D2>At 14:50 -0500 1-01-06, William F. Hammond =
wrote:</FONT>

<BR><FONT SIZE=3D2>&gt;&gt; If you are in the need of various =
translations, have you tried using Flex</FONT>

<BR><FONT SIZE=3D2>&gt;&gt; (lexical analyzer generator) and Bison =
(parser generator, or</FONT>

<BR><FONT SIZE=3D2>&gt;&gt; compiler-compiler), see</FONT>

<BR><FONT SIZE=3D2>&gt;</FONT>

<BR><FONT SIZE=3D2>&gt;Are you saying that it's easier to code =
translations from XML using</FONT>

<BR><FONT SIZE=3D2>&gt;lex and yacc descendants rather than using =
standard XML tools such as</FONT>

<BR><FONT SIZE=3D2>&gt;sgmlspl, jade, or xt?&nbsp; I find that hard to =
believe.&nbsp; (Of course, the</FONT>

<BR><FONT SIZE=3D2>&gt;situation before 1996 was different.)</FONT>
</P>

<P><FONT SIZE=3D2>I do not know exactly what you want to achieve: I get =
the impression that</FONT>

<BR><FONT SIZE=3D2>you have an language of your own of some sort, and =
want to be able to</FONT>

<BR><FONT SIZE=3D2>translate it into different formats. If your language =
is just a dialect of</FONT>

<BR><FONT SIZE=3D2>XML, and there are XML parser generators available =
similar to that of</FONT>

<BR><FONT SIZE=3D2>Bison, then use that.</FONT>
</P>

<P><FONT SIZE=3D2>The translation I needed was as follows: From my own =
language, I want to</FONT>

<BR><FONT SIZE=3D2>output C++ code. This proved very difficult, because =
local code generates</FONT>

<BR><FONT SIZE=3D2>information (such as include files, declarations, and =
definitions) that</FONT>

<BR><FONT SIZE=3D2>should be output in different places and files in the =
C++ output files.</FONT>
</P>

<P><FONT SIZE=3D2>Therefore, instead of doing the parsing immediately =
into a new language, I</FONT>

<BR><FONT SIZE=3D2>invented an intermediate &quot;formatting&quot; =
language: Given a set of macro</FONT>

<BR><FONT SIZE=3D2>definitions, normally provided a formatting file =
(thus providing the</FONT>

<BR><FONT SIZE=3D2>specific data of the output language, in my case, =
C++), and a set of</FONT>

<BR><FONT SIZE=3D2>iterated lookup tables (in iternal binary format), =
produced by the parsing,</FONT>

<BR><FONT SIZE=3D2>it knows how to pick together suitable output =
files.</FONT>
</P>

<P><FONT SIZE=3D2>The idea is to make the actual parsing as independent =
as possible of any</FONT>

<BR><FONT SIZE=3D2>output language, only producing the lookup tables. =
Then by merely switching</FONT>

<BR><FONT SIZE=3D2>the formatting file with the macro definitions, one =
can generate output to</FONT>

<BR><FONT SIZE=3D2>different languages.</FONT>
</P>

<P><FONT SIZE=3D2>&gt;&gt; -- I use them together with C++, which is =
convenient as the latter has</FONT>

<BR><FONT SIZE=3D2>&gt;&gt; standard string classes.</FONT>

<BR><FONT SIZE=3D2>&gt;</FONT>

<BR><FONT SIZE=3D2>&gt;Although I've written in C, I've never gotten =
into C++.&nbsp; Are there</FONT>

<BR><FONT SIZE=3D2>&gt;good regular expression libraries for C++?</FONT>
</P>

<P><FONT SIZE=3D2>If you need full regular expressions and a full LR(1) =
parser within your</FONT>

<BR><FONT SIZE=3D2>language, then the simplest approach is to let your =
language output Flex .l</FONT>

<BR><FONT SIZE=3D2>and Bison .y files; then compile these files using =
Flex and Bison, and</FONT>

<BR><FONT SIZE=3D2>finally compile the files so output using a C++ =
compiler. This is sort of a</FONT>

<BR><FONT SIZE=3D2>standard computer technique: for example, the Haskell =
compiler GHC produces</FONT>

<BR><FONT SIZE=3D2>.c files in this way.</FONT>
</P>

<P><FONT SIZE=3D2>Also note that Flex and Bison are themselves also =
compilers, and one can</FONT>

<BR><FONT SIZE=3D2>use Flex and Bison to write new versions of =
themselves. -- Actually, they</FONT>

<BR><FONT SIZE=3D2>do. :-)</FONT>
</P>

<P><FONT SIZE=3D2>-- I only use C++ because it is convenient to produce =
an internal binary</FONT>

<BR><FONT SIZE=3D2>representation, which later can be used to produce =
the C++ output format.</FONT>

<BR><FONT SIZE=3D2>The iterated lookup tables I use are just</FONT>

<BR><FONT SIZE=3D2>&nbsp; map&lt;string, variable&gt;</FONT>

<BR><FONT SIZE=3D2>(meaning that one can index a finite set of variable =
by string keys) where</FONT>

<BR><FONT SIZE=3D2>&quot;variable&quot; is a class with suitable lookup =
information.</FONT>
</P>

<P><FONT SIZE=3D2>Let's take a simple example: In the output in my =
application, I need to</FONT>

<BR><FONT SIZE=3D2>build a sequence of classes, which can have a =
sequence of methods, with</FONT>

<BR><FONT SIZE=3D2>definitions that should be output in various places. =
The main point is that</FONT>

<BR><FONT SIZE=3D2>one has a sequence of lookup localities, like in most =
modern computer</FONT>

<BR><FONT SIZE=3D2>languages.</FONT>
</P>

<P><FONT SIZE=3D2>In my formatting file, I may have something like the =
stuff below: Here,</FONT>

<BR><FONT SIZE=3D2>&nbsp; &lt;#header|...|header#&gt;</FONT>

<BR><FONT SIZE=3D2>&nbsp; &lt;#header|...|#&gt;</FONT>

<BR><FONT SIZE=3D2>encloses a macro definition, and =
&lt;|header_name|&gt; is an invocation of the</FONT>

<BR><FONT SIZE=3D2>variable &quot;header_name&quot;, and so on.</FONT>
</P>

<P><FONT SIZE=3D2>&lt;#header|</FONT>

<BR><FONT SIZE=3D2>#ifndef Synergy_&lt;|header_name|&gt;_header</FONT>

<BR><FONT SIZE=3D2>#define Synergy_&lt;|header_name|&gt;_header</FONT>
</P>

<P><FONT SIZE=3D2>#if !__cplusplus</FONT>

<BR><FONT SIZE=3D2>#error Header file &quot;&lt;|header_name|&gt;&quot; =
only for C++.</FONT>

<BR><FONT SIZE=3D2>#endif</FONT>
</P>

<P><FONT SIZE=3D2>#include &lt;stdexcept&gt;</FONT>
</P>

<P><FONT SIZE=3D2>#include &quot;data&quot;</FONT>

<BR><FONT SIZE=3D2>#include &quot;construct&quot;</FONT>
</P>

<P><FONT SIZE=3D2>&lt;|header_preamble|&gt;</FONT>
</P>

<P><FONT SIZE=3D2>namespace Synergy {</FONT>
</P>

<P><FONT SIZE=3D2>&lt;|class.declaration|&gt;</FONT>
</P>

<P><FONT SIZE=3D2>} // namespace Synergy</FONT>
</P>

<P><FONT SIZE=3D2>#endif // &lt;|header_name|&gt;_</FONT>

<BR><FONT SIZE=3D2>|header#&gt;</FONT>
</P>

<P><FONT SIZE=3D2>&lt;#class.declaration|</FONT>

<BR><FONT SIZE=3D2>extern Synergy::data =
global_&lt;|class_name:cpp|&gt;;</FONT>

<BR><FONT SIZE=3D2>class &lt;|class_name:cpp|&gt; : public virtual =
construct {</FONT>

<BR><FONT SIZE=3D2>public:</FONT>

<BR><FONT SIZE=3D2>&nbsp; static const char* category;</FONT>

<BR><FONT SIZE=3D2>&nbsp; static object_method_base* lookup_method(const =
std::string&amp;);</FONT>

<BR><FONT SIZE=3D2>&nbsp; static Synergy::data global;</FONT>

<BR><FONT SIZE=3D2>&nbsp; class object;</FONT>

<BR><FONT SIZE=3D2>&nbsp; typedef &lt;|class_name:cpp|&gt; =
constructor;</FONT>

<BR><FONT SIZE=3D2>&nbsp; virtual root* clone() const { return =
new</FONT>

<BR><FONT SIZE=3D2>Synergy::&lt;|class_name:cpp|&gt;(*this); }</FONT>

<BR><FONT SIZE=3D2>&nbsp; virtual bool cloneable() { return =
&lt;|object_cloneable|&gt;; }</FONT>

<BR><FONT SIZE=3D2>&nbsp; &lt;|object_copy_to_clone_method|&gt;</FONT>

<BR><FONT SIZE=3D2>&nbsp; virtual Synergy::data =
method_method(Synergy::data&amp;);</FONT>

<BR><FONT SIZE=3D2>&nbsp; virtual Synergy::data =
method_object(Synergy::data&amp; x) { return new</FONT>

<BR><FONT SIZE=3D2>object(x); }</FONT>

<BR><FONT SIZE=3D2>&nbsp; virtual Synergy::data =
method_object_method(Synergy::data&amp; x);</FONT>

<BR><FONT SIZE=3D2>&nbsp; =
&lt;|constructor_method.declaration|&gt;</FONT>

<BR><FONT SIZE=3D2>&nbsp; &lt;|constructor_cpp.declare|&gt;</FONT>
</P>

<P><FONT SIZE=3D2>&nbsp; class object : =
&lt;|object_base|&gt;&lt;|object_cpp_base|&gt; {</FONT>

<BR><FONT SIZE=3D2>&nbsp; public:</FONT>

<BR><FONT SIZE=3D2>&nbsp;&nbsp;&nbsp; static const char* =
category;</FONT>

<BR><FONT SIZE=3D2>&nbsp;&nbsp;&nbsp; static object_method_base* =
lookup_method(const std::string&amp;);</FONT>

<BR><FONT SIZE=3D2>&nbsp;&nbsp;&nbsp; =
&lt;|object_constructor.declaration|&gt;</FONT>

<BR><FONT SIZE=3D2>&nbsp;&nbsp;&nbsp; virtual root* clone() const { =
&lt;|clone_method_definition|&gt; }</FONT>

<BR><FONT SIZE=3D2>&nbsp;&nbsp;&nbsp; &lt;|copy_method|&gt;</FONT>

<BR><FONT SIZE=3D2>&nbsp;&nbsp;&nbsp; &lt;|object_data|&gt;</FONT>

<BR><FONT SIZE=3D2>&nbsp;&nbsp;&nbsp; virtual Synergy::data =
method_constructor(Synergy::data&amp;) { return</FONT>

<BR><FONT SIZE=3D2>Synergy::global_&lt;|class_name:cpp|&gt;; }</FONT>

<BR><FONT SIZE=3D2>&nbsp;&nbsp;&nbsp; virtual Synergy::data =
method_method(Synergy::data&amp;);</FONT>

<BR><FONT SIZE=3D2>&nbsp;&nbsp;&nbsp; =
&lt;|method.declaration|&gt;</FONT>

<BR><FONT SIZE=3D2>&nbsp;&nbsp;&nbsp; =
&lt;|object_cpp.declare|&gt;</FONT>

<BR><FONT SIZE=3D2>&nbsp; };</FONT>

<BR><FONT SIZE=3D2>};</FONT>

<BR><FONT SIZE=3D2>|class.declaration#&gt;</FONT>
</P>

<P><FONT SIZE=3D2>&lt;#method.declaration|</FONT>

<BR><FONT SIZE=3D2>virtual Synergy::data</FONT>

<BR><FONT =
SIZE=3D2>method_&lt;|method_name:cpp|&gt;(Synergy::data&amp;)&lt;|method_=
is_abstract|&gt;;</FONT>

<BR><FONT SIZE=3D2>|#&gt;</FONT>
</P>
<BR>

<P><FONT SIZE=3D2>In my approach each variable can actually have a =
sequence of lookups</FONT>

<BR><FONT SIZE=3D2>attached to it, so it becomes easy to sequence a =
series of classes with the</FONT>

<BR><FONT SIZE=3D2>same template.</FONT>
</P>

<P><FONT SIZE=3D2>Suppose that we want to format a class named `foo' =
with an object method</FONT>

<BR><FONT SIZE=3D2>named `bar' (among other data). Then the C++ code for =
that (the way I</FONT>

<BR><FONT SIZE=3D2>implemented it) would look something like</FONT>

<BR><FONT SIZE=3D2>&nbsp;&nbsp;&nbsp; // Create a new class named =
&quot;foo&quot;:</FONT>

<BR><FONT SIZE=3D2>&nbsp; =
(*table)[&quot;class&quot;][push_back][&quot;class_name&quot;] =3D =
&quot;foo&quot;;</FONT>

<BR><FONT SIZE=3D2>&nbsp;&nbsp;&nbsp; // Create a method named =
&quot;bar&quot; belonging to last created class =
(&quot;foo&quot;):</FONT>

<BR><FONT SIZE=3D2>&nbsp; =
(*table)[&quot;class&quot;][last][&quot;method&quot;][push_back][&quot;me=
thod_name&quot;] =3D &quot;bar&quot;;</FONT>
</P>

<P><FONT SIZE=3D2>The formatter then uses this lookup table with same =
kind of iterated</FONT>

<BR><FONT SIZE=3D2>localities like in say TeX, or any other modern =
computer language: When one</FONT>

<BR><FONT SIZE=3D2>prints out the &quot;header&quot; macro, when it =
encounters the &quot;class.declaration&quot;</FONT>

<BR><FONT SIZE=3D2>variable, it iterates through all classes using the =
&quot;class.declaration&quot;</FONT>

<BR><FONT SIZE=3D2>macro definition. Then, when in the =
&quot;class.declaration&quot; definition, when it</FONT>

<BR><FONT SIZE=3D2>encounters the &quot;method.declaration&quot;, it =
iterates through all methods</FONT>

<BR><FONT SIZE=3D2>_in_that_class_. If a name is not found locally, it =
iterates towards the</FONT>

<BR><FONT SIZE=3D2>base to find a more global name.</FONT>
</P>

<P><FONT SIZE=3D2>&gt;&gt; One approach is to parse objects into =
something like the DOM (Document</FONT>

<BR><FONT SIZE=3D2>&gt;&gt; Object Model, <A =
HREF=3D"http://www.w3.org/">http://www.w3.org/</A>), and then onto that =
hook a program that</FONT>

<BR><FONT SIZE=3D2>&gt;&gt; can translate into several different =
formats.</FONT>

<BR><FONT SIZE=3D2>&gt;</FONT>

<BR><FONT SIZE=3D2>&gt;Of course, sgmlspl, jade, xt, and other standard =
sgml/xml tools</FONT>

<BR><FONT SIZE=3D2>&gt;provide good frameworks for translating into as =
many different formats</FONT>

<BR><FONT SIZE=3D2>&gt;as one likes by writing, respectively, Perl, =
DSSSL, and XSLT.</FONT>

<BR><FONT SIZE=3D2>&gt;(Possibly also it would be viable to use David =
Carlisle's xmltex</FONT>

<BR><FONT SIZE=3D2>&gt;followed by Eitan Gurari's tex4ht in which case =
one writes TeX.)</FONT>
</P>

<P><FONT SIZE=3D2>So actually, I do not parse into a language, but into =
a binary model, which</FONT>

<BR><FONT SIZE=3D2>has essentially the same general capacities (a local =
lookup system) of any</FONT>

<BR><FONT SIZE=3D2>language. Then I use another program to format that =
into a suitable</FONT>

<BR><FONT SIZE=3D2>language.</FONT>
</P>

<P><FONT SIZE=3D2>&gt;&nbsp; I wonder how some</FONT>

<BR><FONT SIZE=3D2>&gt;of these things would survive a double =
translation</FONT>

<BR><FONT SIZE=3D2>&gt;</FONT>

<BR><FONT SIZE=3D2>&gt;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; gellmu/article =
---(hypothetical)---&gt; TEI ----&gt; LaTeX .</FONT>
</P>

<P><FONT SIZE=3D2>So what I use is something like this your =
&quot;hypothetical&quot; label here, except</FONT>

<BR><FONT SIZE=3D2>that it is not a language that I use, but a binary =
model, a sequence of</FONT>

<BR><FONT SIZE=3D2>iterated lookup tables.</FONT>
</P>

<P><FONT SIZE=3D2>&gt;2.&nbsp; The default &quot;article&quot; document =
type for _regular_ GELLMU provides</FONT>

<BR><FONT SIZE=3D2>&gt;three character names for each of the 33 =
non-alphanumeric but</FONT>

<BR><FONT SIZE=3D2>&gt;printable ASCII characters.</FONT>
</P>

<P><FONT SIZE=3D2>As it is a binary model, such parsing concerns are =
irrelevant.</FONT>
</P>

<P><FONT SIZE=3D2>For example, I wanted to write classes with =
_arbitrary_ binary string</FONT>

<BR><FONT SIZE=3D2>names, which does not work with C++, which only =
allows alpha-numerical</FONT>

<BR><FONT SIZE=3D2>names and underscore with some restrictions. But it =
is easy to mangle</FONT>

<BR><FONT SIZE=3D2>(encode) arbitrary binary string names, which I did =
by an addition to the</FONT>

<BR><FONT SIZE=3D2>formatter; then it is also irrelevant what kind of =
parsing I use in my</FONT>

<BR><FONT SIZE=3D2>original language to produce arbitrary binary string =
names.</FONT>
</P>

<P><FONT SIZE=3D2>If one plays this game along, one ends up with =
developing a better and</FONT>

<BR><FONT SIZE=3D2>better intermediate binary model. For example, =
suppose I want to write a</FONT>

<BR><FONT SIZE=3D2>floating number. Right now, it would suffice to use =
say the C++ syntax, and</FONT>

<BR><FONT SIZE=3D2>parse them as strings which are output verbatim in =
the C++ files. But</FONT>

<BR><FONT SIZE=3D2>suppose I want to produce output to some languages =
with a different syntax</FONT>

<BR><FONT SIZE=3D2>than C++ in this respect. Then it would be natural to =
represent the</FONT>

<BR><FONT SIZE=3D2>floating numbers in some internal binary model, and =
add to the formatter</FONT>

<BR><FONT SIZE=3D2>the capacity to write out floating point numbers in =
different formats.</FONT>
</P>

<P><FONT SIZE=3D2>Of course, my needs are specialized at OOPL -&gt; OOPL =
language translations,</FONT>

<BR><FONT SIZE=3D2>and DPL (&quot;document PL&quot;) translations may =
have other needs.</FONT>
</P>

<P><FONT SIZE=3D2>&nbsp; Hans Aberg</FONT>
</P>

</BODY>
</HTML>
------_=_NextPart_001_01C078BA.EF468400--