MIME-Version: 1.0
Content-Type: multipart/alternative;
	boundary="----_=_NextPart_001_01C0F278.4B517200"
In-Reply-To:  <l03102805b74905cc2b81@[130.239.137.13]>
Content-class: urn:content-classes:message
Subject:      Re: \InputTranslation
Date: Mon, 11 Jun 2001 14:09:26 +0100
Message-ID:  <v03110701b74a479b05e0@[195.100.226.128]>
From: "Hans Aberg" <haberg@MATEMATIK.SU.SE>
Sender: "Mailing list for the LaTeX3 project" <LATEX-L@URZ.UNI-HEIDELBERG.DE>
To: "Multiple recipients of list LATEX-L" <LATEX-L@URZ.UNI-HEIDELBERG.DE>
Reply-To: "Mailing list for the LaTeX3 project" <LATEX-L@URZ.UNI-HEIDELBERG.DE>
Status: R

This is a multi-part message in MIME format.

------_=_NextPart_001_01C0F278.4B517200
Content-Type: text/plain;
	charset="iso-8859-1"
Content-Transfer-Encoding: quoted-printable

At 00:10 +0200 2001/06/11, Lars Hellstr=F6m wrote:
>In this case, I suspect
>the labels should be thought of as being nestable with separate markers =
for
>beginning and end, so that each token list that is formed gets =
delimited by
>matching begin and end labels that record the current context of the =
token
>list they were extracted from.
...
> then it doesn't matter if it is inserted into a French context table =
of
>contents. Upon being written to an external file, the labels should be
>converted to suitable markup.

This is what also I am saying: If one makes sure to nest those =
localization
contexts consequently, the logical part of it should be fairly
straightforward.

>An interesting question is whether these labels should be explicit =
tokens
>or be hidden from the user (i.e., argument grabbing and things like
>\futurelet look past them). Making them explicit tokens would probably
>break tons of code.

This is the hard part, how the localization contexts should be defined =
in
the input, so as to be convenient for the user.

>As for what the labels should be to the user, I think a scheme of =
making
>them integers is pretty useless (how they are implemented is of course
>another matter).

Localization numbers would be as accessible to the user as input =
encoding
numbers, that is, normally the user would not use them at all. =
Standardized
localization numbers also requires that the localization specs can be =
made
so specific that one normally would not bother to override (parts of) =
them.

> A better idea would be to make them some kind of property
>lists, i.e., containers for diverse forms of information that are =
indexed
>by some kind of names.

The normal way is to make a lookup table, that is, a type of =
environment.
One then looks up a name recursively up through the tables towards the =
top
for the first occurrence, just as in any computer language defining
contexts.

However, as a matter for implementation, one would not want to carry =
that
much information around. Further, nothing really says that different
localizations will have the same setup up of variables. So there must be =
a
way to first identify which localization to use, and from that proceed =
to
lookup variables.

> Creating new label values from old by copying the
>values and then changing some would be useful when defining dialects.

The picture that I have in my mind is that all (standard) localization
specs should be loaded (at need) in parallel; it is then easy to define =
a
customized localization spec by picking variables from already defined
ones. Alternatively, one defines entirely new localized variables.

Note that there is a difference in behavior: If I define my own =
localized
dictionary, it will not change when any other localization dictionary is
updated. But if I define my localized dictionary on top of an already
defined dictionary, then the dictionary I use will be updated when the
already defined dictionary is updated.

These are different types of behavior, and I think one must accomodate =
for
both.

>The main problem I see with context labels is that of when they should =
be
>attached,

This is the difficult one.

> I can think of at least three different models:
>
>1. Labels must be present in the input (e.g. encoded using control
>characters).
...
>2. Do as today, i.e., context switches are initiated when commands are
>executed.

Perhaps a hybrid: Localization labels are not of Unicode, but it may be
possible to define such formats using a suitable extension of the Omega
translator, and LaTeX may decide to use such formats for writing and
reading .aux files and the like.

One the other hand, one wants to have convenient context switches within
the LaTeX language itself.

Whether one uses long formats such as <begin french> ... <end french> or
shorter, character based formats, is probably just a question of
optimization.

  Hans Aberg

------_=_NextPart_001_01C0F278.4B517200
Content-Type: text/html;
	charset="iso-8859-1"
Content-Transfer-Encoding: quoted-printable

<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 3.2//EN">
<HTML>
<HEAD>
<META HTTP-EQUIV=3D"Content-Type" CONTENT=3D"text/html; =
charset=3Diso-8859-1">
<META NAME=3D"Generator" CONTENT=3D"MS Exchange Server version =
6.5.7654.12">
<TITLE>     Re: \InputTranslation</TITLE>
</HEAD>
<BODY>
<!-- Converted from text/plain format -->

<P><FONT SIZE=3D2>At 00:10 +0200 2001/06/11, Lars Hellstr=F6m =
wrote:</FONT>

<BR><FONT SIZE=3D2>&gt;In this case, I suspect</FONT>

<BR><FONT SIZE=3D2>&gt;the labels should be thought of as being nestable =
with separate markers for</FONT>

<BR><FONT SIZE=3D2>&gt;beginning and end, so that each token list that =
is formed gets delimited by</FONT>

<BR><FONT SIZE=3D2>&gt;matching begin and end labels that record the =
current context of the token</FONT>

<BR><FONT SIZE=3D2>&gt;list they were extracted from.</FONT>

<BR><FONT SIZE=3D2>...</FONT>

<BR><FONT SIZE=3D2>&gt; then it doesn't matter if it is inserted into a =
French context table of</FONT>

<BR><FONT SIZE=3D2>&gt;contents. Upon being written to an external file, =
the labels should be</FONT>

<BR><FONT SIZE=3D2>&gt;converted to suitable markup.</FONT>
</P>

<P><FONT SIZE=3D2>This is what also I am saying: If one makes sure to =
nest those localization</FONT>

<BR><FONT SIZE=3D2>contexts consequently, the logical part of it should =
be fairly</FONT>

<BR><FONT SIZE=3D2>straightforward.</FONT>
</P>

<P><FONT SIZE=3D2>&gt;An interesting question is whether these labels =
should be explicit tokens</FONT>

<BR><FONT SIZE=3D2>&gt;or be hidden from the user (i.e., argument =
grabbing and things like</FONT>

<BR><FONT SIZE=3D2>&gt;\futurelet look past them). Making them explicit =
tokens would probably</FONT>

<BR><FONT SIZE=3D2>&gt;break tons of code.</FONT>
</P>

<P><FONT SIZE=3D2>This is the hard part, how the localization contexts =
should be defined in</FONT>

<BR><FONT SIZE=3D2>the input, so as to be convenient for the =
user.</FONT>
</P>

<P><FONT SIZE=3D2>&gt;As for what the labels should be to the user, I =
think a scheme of making</FONT>

<BR><FONT SIZE=3D2>&gt;them integers is pretty useless (how they are =
implemented is of course</FONT>

<BR><FONT SIZE=3D2>&gt;another matter).</FONT>
</P>

<P><FONT SIZE=3D2>Localization numbers would be as accessible to the =
user as input encoding</FONT>

<BR><FONT SIZE=3D2>numbers, that is, normally the user would not use =
them at all. Standardized</FONT>

<BR><FONT SIZE=3D2>localization numbers also requires that the =
localization specs can be made</FONT>

<BR><FONT SIZE=3D2>so specific that one normally would not bother to =
override (parts of) them.</FONT>
</P>

<P><FONT SIZE=3D2>&gt; A better idea would be to make them some kind of =
property</FONT>

<BR><FONT SIZE=3D2>&gt;lists, i.e., containers for diverse forms of =
information that are indexed</FONT>

<BR><FONT SIZE=3D2>&gt;by some kind of names.</FONT>
</P>

<P><FONT SIZE=3D2>The normal way is to make a lookup table, that is, a =
type of environment.</FONT>

<BR><FONT SIZE=3D2>One then looks up a name recursively up through the =
tables towards the top</FONT>

<BR><FONT SIZE=3D2>for the first occurrence, just as in any computer =
language defining</FONT>

<BR><FONT SIZE=3D2>contexts.</FONT>
</P>

<P><FONT SIZE=3D2>However, as a matter for implementation, one would not =
want to carry that</FONT>

<BR><FONT SIZE=3D2>much information around. Further, nothing really says =
that different</FONT>

<BR><FONT SIZE=3D2>localizations will have the same setup up of =
variables. So there must be a</FONT>

<BR><FONT SIZE=3D2>way to first identify which localization to use, and =
from that proceed to</FONT>

<BR><FONT SIZE=3D2>lookup variables.</FONT>
</P>

<P><FONT SIZE=3D2>&gt; Creating new label values from old by copying =
the</FONT>

<BR><FONT SIZE=3D2>&gt;values and then changing some would be useful =
when defining dialects.</FONT>
</P>

<P><FONT SIZE=3D2>The picture that I have in my mind is that all =
(standard) localization</FONT>

<BR><FONT SIZE=3D2>specs should be loaded (at need) in parallel; it is =
then easy to define a</FONT>

<BR><FONT SIZE=3D2>customized localization spec by picking variables =
from already defined</FONT>

<BR><FONT SIZE=3D2>ones. Alternatively, one defines entirely new =
localized variables.</FONT>
</P>

<P><FONT SIZE=3D2>Note that there is a difference in behavior: If I =
define my own localized</FONT>

<BR><FONT SIZE=3D2>dictionary, it will not change when any other =
localization dictionary is</FONT>

<BR><FONT SIZE=3D2>updated. But if I define my localized dictionary on =
top of an already</FONT>

<BR><FONT SIZE=3D2>defined dictionary, then the dictionary I use will be =
updated when the</FONT>

<BR><FONT SIZE=3D2>already defined dictionary is updated.</FONT>
</P>

<P><FONT SIZE=3D2>These are different types of behavior, and I think one =
must accomodate for</FONT>

<BR><FONT SIZE=3D2>both.</FONT>
</P>

<P><FONT SIZE=3D2>&gt;The main problem I see with context labels is that =
of when they should be</FONT>

<BR><FONT SIZE=3D2>&gt;attached,</FONT>
</P>

<P><FONT SIZE=3D2>This is the difficult one.</FONT>
</P>

<P><FONT SIZE=3D2>&gt; I can think of at least three different =
models:</FONT>

<BR><FONT SIZE=3D2>&gt;</FONT>

<BR><FONT SIZE=3D2>&gt;1. Labels must be present in the input (e.g. =
encoded using control</FONT>

<BR><FONT SIZE=3D2>&gt;characters).</FONT>

<BR><FONT SIZE=3D2>...</FONT>

<BR><FONT SIZE=3D2>&gt;2. Do as today, i.e., context switches are =
initiated when commands are</FONT>

<BR><FONT SIZE=3D2>&gt;executed.</FONT>
</P>

<P><FONT SIZE=3D2>Perhaps a hybrid: Localization labels are not of =
Unicode, but it may be</FONT>

<BR><FONT SIZE=3D2>possible to define such formats using a suitable =
extension of the Omega</FONT>

<BR><FONT SIZE=3D2>translator, and LaTeX may decide to use such formats =
for writing and</FONT>

<BR><FONT SIZE=3D2>reading .aux files and the like.</FONT>
</P>

<P><FONT SIZE=3D2>One the other hand, one wants to have convenient =
context switches within</FONT>

<BR><FONT SIZE=3D2>the LaTeX language itself.</FONT>
</P>

<P><FONT SIZE=3D2>Whether one uses long formats such as &lt;begin =
french&gt; ... &lt;end french&gt; or</FONT>

<BR><FONT SIZE=3D2>shorter, character based formats, is probably just a =
question of</FONT>

<BR><FONT SIZE=3D2>optimization.</FONT>
</P>

<P><FONT SIZE=3D2>&nbsp; Hans Aberg</FONT>
</P>

</BODY>
</HTML>
------_=_NextPart_001_01C0F278.4B517200--