MIME-Version: 1.0
Content-Type: multipart/alternative;
	boundary="----_=_NextPart_001_01C0DD3F.9F336E80"
In-Reply-To:  <200105150843.f4F8hiI25065@smtp.wanadoo.es>
Content-class: urn:content-classes:message
Subject:      Re: Multilingual Encodings Summary 2.2
Date: Tue, 15 May 2001 14:04:31 +0100
Message-ID:  <l03130303b726ce5cc6ff@[130.239.20.144]>
From: =?iso-8859-1?Q?Lars_Hellstr=F6m?= <Lars.Hellstrom@MATH.UMU.SE>
Sender: "Mailing list for the LaTeX3 project" <LATEX-L@URZ.UNI-HEIDELBERG.DE>
To: "Multiple recipients of list LATEX-L" <LATEX-L@URZ.UNI-HEIDELBERG.DE>
Reply-To: "Mailing list for the LaTeX3 project" <LATEX-L@URZ.UNI-HEIDELBERG.DE>
Status: R

This is a multi-part message in MIME format.

------_=_NextPart_001_01C0DD3F.9F336E80
Content-Type: text/plain;
	charset="iso-8859-1"
Content-Transfer-Encoding: quoted-printable

At 11.40 +0200 2001-05-15, Javier Bezos wrote:
>>>The same should apply to, say, Greek. If I write "barbaros" [well,
>>>imagine it written in Greek] using the same beta, sometimes I would =
like to
>>>see the first one using a differenf glyph from the second one (a =
medial
>>>beta, not used currently).
>>
>> And then I suggest that you do this by selecting a (top level) font =
in
>> which the beta has the medial form, not by using a special =
\medialbeta
>> command or by requesting that the LICR should incorporate something
>> equivalent to this.
>
>Thus, we need 2 virtual font for every font encoding. Don't forget
>that iota can be rendered below a letter or "in-line" (2),
>iota and upsilon can be rendered inverted (2), and there is
>the lunate sigma (2). Since these options are independent, are you
>suggesting the creation of 16 (!) vf files and tfm files for every
>font (and encoding)? (And regarding that, Greek is easy compared
>with scripts like Devanagari or Arabic.)

If you have four characters with to glyph variants each and you write 16
documents in which you realize each of the possible selections of glyph
variants (one selection per document) then you deserve to have to use
different fonts for each document. However:

At 11.24 +0200 2001-05-15, Robin Fairbairns wrote:
>isn't this sort of thing done by ligaturing a "boundary character", so
>
>  "bnd" "beta" -> "initial beta"
>  ...
>  "sigma" "bnd" -> "terminal sigma"
>
>and so on?
>
>(i've never really understood boundary characters so i may have this
>wrong).

(Looks right to me, Robin.) If that is the kind of variants you are =
talking
about then of course they should all be in the same font, but the point =
is
that all these replacements of glyphs should be handled by the font, not =
by
the user or by LaTeX. In particular there shouldn't be an internal
character representation for the variants in LaTeX, because the variants
are semantically equivalent and therefore identical as characters.

The idea that OCPs could be used to select between variant glyphs in the
font is not without merit, though, _provided_ it does not mess up the =
LICR.
I'm also sceptical towards that it should be the language which selects =
a
certain variant form of a character, as that can become a severe drag on
font design. If anything, it should be the font (or its FD file) which
declares that "I provide the following variants of this character",
together with the code for choosing one or the other (this code could =
well
consist of pushing an OCP). Languages could by all means request a =
certain
variant form, but the font shouldn't always have to provide it!

I math fonts something like that could be used to handle the choice =
between
\epsilon and \varepsilon. As I understand it, these are semantically
equivalent---i.e., people will think you've done something wrong if you =
try
to use them both in the same formula to mean different things (but maybe
Barbara has some counterexample)---and so shouldn't have different =
internal
representations in LaTeX (as they do today).

Lars Hellstr=F6m

------_=_NextPart_001_01C0DD3F.9F336E80
Content-Type: text/html;
	charset="iso-8859-1"
Content-Transfer-Encoding: quoted-printable

<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 3.2//EN">
<HTML>
<HEAD>
<META HTTP-EQUIV=3D"Content-Type" CONTENT=3D"text/html; =
charset=3Diso-8859-1">
<META NAME=3D"Generator" CONTENT=3D"MS Exchange Server version =
6.5.7654.12">
<TITLE>     Re: Multilingual Encodings Summary 2.2</TITLE>
</HEAD>
<BODY>
<!-- Converted from text/plain format -->

<P><FONT SIZE=3D2>At 11.40 +0200 2001-05-15, Javier Bezos wrote:</FONT>

<BR><FONT SIZE=3D2>&gt;&gt;&gt;The same should apply to, say, Greek. If =
I write &quot;barbaros&quot; [well,</FONT>

<BR><FONT SIZE=3D2>&gt;&gt;&gt;imagine it written in Greek] using the =
same beta, sometimes I would like to</FONT>

<BR><FONT SIZE=3D2>&gt;&gt;&gt;see the first one using a differenf glyph =
from the second one (a medial</FONT>

<BR><FONT SIZE=3D2>&gt;&gt;&gt;beta, not used currently).</FONT>

<BR><FONT SIZE=3D2>&gt;&gt;</FONT>

<BR><FONT SIZE=3D2>&gt;&gt; And then I suggest that you do this by =
selecting a (top level) font in</FONT>

<BR><FONT SIZE=3D2>&gt;&gt; which the beta has the medial form, not by =
using a special \medialbeta</FONT>

<BR><FONT SIZE=3D2>&gt;&gt; command or by requesting that the LICR =
should incorporate something</FONT>

<BR><FONT SIZE=3D2>&gt;&gt; equivalent to this.</FONT>

<BR><FONT SIZE=3D2>&gt;</FONT>

<BR><FONT SIZE=3D2>&gt;Thus, we need 2 virtual font for every font =
encoding. Don't forget</FONT>

<BR><FONT SIZE=3D2>&gt;that iota can be rendered below a letter or =
&quot;in-line&quot; (2),</FONT>

<BR><FONT SIZE=3D2>&gt;iota and upsilon can be rendered inverted (2), =
and there is</FONT>

<BR><FONT SIZE=3D2>&gt;the lunate sigma (2). Since these options are =
independent, are you</FONT>

<BR><FONT SIZE=3D2>&gt;suggesting the creation of 16 (!) vf files and =
tfm files for every</FONT>

<BR><FONT SIZE=3D2>&gt;font (and encoding)? (And regarding that, Greek =
is easy compared</FONT>

<BR><FONT SIZE=3D2>&gt;with scripts like Devanagari or Arabic.)</FONT>
</P>

<P><FONT SIZE=3D2>If you have four characters with to glyph variants =
each and you write 16</FONT>

<BR><FONT SIZE=3D2>documents in which you realize each of the possible =
selections of glyph</FONT>

<BR><FONT SIZE=3D2>variants (one selection per document) then you =
deserve to have to use</FONT>

<BR><FONT SIZE=3D2>different fonts for each document. However:</FONT>
</P>

<P><FONT SIZE=3D2>At 11.24 +0200 2001-05-15, Robin Fairbairns =
wrote:</FONT>

<BR><FONT SIZE=3D2>&gt;isn't this sort of thing done by ligaturing a =
&quot;boundary character&quot;, so</FONT>

<BR><FONT SIZE=3D2>&gt;</FONT>

<BR><FONT SIZE=3D2>&gt;&nbsp; &quot;bnd&quot; &quot;beta&quot; -&gt; =
&quot;initial beta&quot;</FONT>

<BR><FONT SIZE=3D2>&gt;&nbsp; ...</FONT>

<BR><FONT SIZE=3D2>&gt;&nbsp; &quot;sigma&quot; &quot;bnd&quot; -&gt; =
&quot;terminal sigma&quot;</FONT>

<BR><FONT SIZE=3D2>&gt;</FONT>

<BR><FONT SIZE=3D2>&gt;and so on?</FONT>

<BR><FONT SIZE=3D2>&gt;</FONT>

<BR><FONT SIZE=3D2>&gt;(i've never really understood boundary characters =
so i may have this</FONT>

<BR><FONT SIZE=3D2>&gt;wrong).</FONT>
</P>

<P><FONT SIZE=3D2>(Looks right to me, Robin.) If that is the kind of =
variants you are talking</FONT>

<BR><FONT SIZE=3D2>about then of course they should all be in the same =
font, but the point is</FONT>

<BR><FONT SIZE=3D2>that all these replacements of glyphs should be =
handled by the font, not by</FONT>

<BR><FONT SIZE=3D2>the user or by LaTeX. In particular there shouldn't =
be an internal</FONT>

<BR><FONT SIZE=3D2>character representation for the variants in LaTeX, =
because the variants</FONT>

<BR><FONT SIZE=3D2>are semantically equivalent and therefore identical =
as characters.</FONT>
</P>

<P><FONT SIZE=3D2>The idea that OCPs could be used to select between =
variant glyphs in the</FONT>

<BR><FONT SIZE=3D2>font is not without merit, though, _provided_ it does =
not mess up the LICR.</FONT>

<BR><FONT SIZE=3D2>I'm also sceptical towards that it should be the =
language which selects a</FONT>

<BR><FONT SIZE=3D2>certain variant form of a character, as that can =
become a severe drag on</FONT>

<BR><FONT SIZE=3D2>font design. If anything, it should be the font (or =
its FD file) which</FONT>

<BR><FONT SIZE=3D2>declares that &quot;I provide the following variants =
of this character&quot;,</FONT>

<BR><FONT SIZE=3D2>together with the code for choosing one or the other =
(this code could well</FONT>

<BR><FONT SIZE=3D2>consist of pushing an OCP). Languages could by all =
means request a certain</FONT>

<BR><FONT SIZE=3D2>variant form, but the font shouldn't always have to =
provide it!</FONT>
</P>

<P><FONT SIZE=3D2>I math fonts something like that could be used to =
handle the choice between</FONT>

<BR><FONT SIZE=3D2>\epsilon and \varepsilon. As I understand it, these =
are semantically</FONT>

<BR><FONT SIZE=3D2>equivalent---i.e., people will think you've done =
something wrong if you try</FONT>

<BR><FONT SIZE=3D2>to use them both in the same formula to mean =
different things (but maybe</FONT>

<BR><FONT SIZE=3D2>Barbara has some counterexample)---and so shouldn't =
have different internal</FONT>

<BR><FONT SIZE=3D2>representations in LaTeX (as they do today).</FONT>
</P>

<P><FONT SIZE=3D2>Lars Hellstr=F6m</FONT>
</P>

</BODY>
</HTML>
------_=_NextPart_001_01C0DD3F.9F336E80--