MIME-Version: 1.0
Content-Type: multipart/alternative;
	boundary="----_=_NextPart_001_01C0901D.9B47D000"
User-Agent: Mutt/1.2i
Content-class: urn:content-classes:message
Subject:      Re: default inputenc/fontenc tight to language
Date: Tue, 6 Feb 2001 10:17:03 +0100
Message-ID:  <20010206101702.A5774@clipper.ens.fr>
From: "Eric Brunet" <ebrunet@CLIPPER.ENS.FR>
Sender: "Mailing list for the LaTeX3 project" <LATEX-L@URZ.UNI-HEIDELBERG.DE>
To: "Multiple recipients of list LATEX-L" <LATEX-L@URZ.UNI-HEIDELBERG.DE>
Reply-To: "Mailing list for the LaTeX3 project" <LATEX-L@URZ.UNI-HEIDELBERG.DE>
Status: R

This is a multi-part message in MIME format.

------_=_NextPart_001_01C0901D.9B47D000
Content-Type: text/plain;
	charset="iso-8859-1"
Content-Transfer-Encoding: quoted-printable

Sorry for replying late. Hey, in the internet age, a 4 days delay is a
long time...

Frank Mittelbach wrote:
> what i mean is that most people write their document in a single input
> encoding and do not switch that encoding (or even can switch) just
> because they switch from one language to another.
Sure. Mainly because usualy people don't switch from a language to
another, or, if they do, it is usually languages with compatible
encodings. But I imagine (maybe wrongly) that you need to switch
encodings when writing an english-russian document.

> Anyway, for font encodings a default setting different from the system =
default
> (if necessary) does make sense and current babel already tries to do =
that,
> though as Denis report shows not always successfully.

I am happy to hear that. Now, about input encodings...

> furthermore, because of the argument that the input encoding doesn't =
really
> change "wenn ich jetzt in Deutsch schreibe" (both are latin1 as far as
> this mail is concerned) so for english and german and french it should
> probably be the same. ansinew because a lot of people use PCs? or =
latin1
> because Linux is going to take over the world? or should it change in =
a
> year or two when the latter happens --- with the result that then =
older
> documents would compile incorrectly because they assume the no longer
> correct default?

I should go to the latin1 by default, because it is somehow a more
accepted standard (in the sens that it is an ISO standard) than ansinew.
But we are lucky: ansinew and latin1 are compatible, in the sens that
latin1 is a subset of ansinew (there are 24 extra characters in ansinew,
in the 130--159 range), so a source.tex composed in the ansinew encoding
would be readable on a unix system, except for some very rare =
characters.
Probably the best of all worlds would be to advertize and document that
latin1 is the default encoding (for standards compliance), and thus
encourage people to use \oe or \dots or -- instead of the characters =
156,
133 or 150, but silently accept all the extra ansinew characters so that
careless window users don't get surprised.

What is sure, is that once a default encoding is choosen, it will be =
hard
to change it (the only way would probably be to change \documentclass
into \documenttype :-)

> finally applying the wrong input encoding to a document not in that
> encoding results in typesetting errors but not in compilation errors.
> true, this can also happen if you explicitly specify the wrong =
encoding
> but this is a conscious act (or so we would hope) and not something =
htat
> happens behind the scene

I have seen many beginners that begin typing in french their document
without declaring an inputenc, and not realizing at once that accents =
are
missing in the document. I would not call forgetting an \usepackage
declaration a conscious act.

> which reminds me: please take the list of languages babel currently =
supports
> and attach to them input/font encoding defaults that would be =
suitable, i
> would really be interested in see such a list (and have it disucssed)

Oh, I am certainly not able to do that. If I was to make a choice, I
would use the appropriate latinxxx encodings for each language, but I am
certainly not qualified to choose for all those languages.

=C9ric

------_=_NextPart_001_01C0901D.9B47D000
Content-Type: text/html;
	charset="iso-8859-1"
Content-Transfer-Encoding: quoted-printable

<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 3.2//EN">
<HTML>
<HEAD>
<META HTTP-EQUIV=3D"Content-Type" CONTENT=3D"text/html; =
charset=3Diso-8859-1">
<META NAME=3D"Generator" CONTENT=3D"MS Exchange Server version =
6.5.7654.12">
<TITLE>     Re: default inputenc/fontenc tight to language</TITLE>
</HEAD>
<BODY>
<!-- Converted from text/plain format -->

<P><FONT SIZE=3D2>Sorry for replying late. Hey, in the internet age, a 4 =
days delay is a</FONT>

<BR><FONT SIZE=3D2>long time...</FONT>
</P>

<P><FONT SIZE=3D2>Frank Mittelbach wrote:</FONT>

<BR><FONT SIZE=3D2>&gt; what i mean is that most people write their =
document in a single input</FONT>

<BR><FONT SIZE=3D2>&gt; encoding and do not switch that encoding (or =
even can switch) just</FONT>

<BR><FONT SIZE=3D2>&gt; because they switch from one language to =
another.</FONT>

<BR><FONT SIZE=3D2>Sure. Mainly because usualy people don't switch from =
a language to</FONT>

<BR><FONT SIZE=3D2>another, or, if they do, it is usually languages with =
compatible</FONT>

<BR><FONT SIZE=3D2>encodings. But I imagine (maybe wrongly) that you =
need to switch</FONT>

<BR><FONT SIZE=3D2>encodings when writing an english-russian =
document.</FONT>
</P>

<P><FONT SIZE=3D2>&gt; Anyway, for font encodings a default setting =
different from the system default</FONT>

<BR><FONT SIZE=3D2>&gt; (if necessary) does make sense and current babel =
already tries to do that,</FONT>

<BR><FONT SIZE=3D2>&gt; though as Denis report shows not always =
successfully.</FONT>
</P>

<P><FONT SIZE=3D2>I am happy to hear that. Now, about input =
encodings...</FONT>
</P>

<P><FONT SIZE=3D2>&gt; furthermore, because of the argument that the =
input encoding doesn't really</FONT>

<BR><FONT SIZE=3D2>&gt; change &quot;wenn ich jetzt in Deutsch =
schreibe&quot; (both are latin1 as far as</FONT>

<BR><FONT SIZE=3D2>&gt; this mail is concerned) so for english and =
german and french it should</FONT>

<BR><FONT SIZE=3D2>&gt; probably be the same. ansinew because a lot of =
people use PCs? or latin1</FONT>

<BR><FONT SIZE=3D2>&gt; because Linux is going to take over the world? =
or should it change in a</FONT>

<BR><FONT SIZE=3D2>&gt; year or two when the latter happens --- with the =
result that then older</FONT>

<BR><FONT SIZE=3D2>&gt; documents would compile incorrectly because they =
assume the no longer</FONT>

<BR><FONT SIZE=3D2>&gt; correct default?</FONT>
</P>

<P><FONT SIZE=3D2>I should go to the latin1 by default, because it is =
somehow a more</FONT>

<BR><FONT SIZE=3D2>accepted standard (in the sens that it is an ISO =
standard) than ansinew.</FONT>

<BR><FONT SIZE=3D2>But we are lucky: ansinew and latin1 are compatible, =
in the sens that</FONT>

<BR><FONT SIZE=3D2>latin1 is a subset of ansinew (there are 24 extra =
characters in ansinew,</FONT>

<BR><FONT SIZE=3D2>in the 130--159 range), so a source.tex composed in =
the ansinew encoding</FONT>

<BR><FONT SIZE=3D2>would be readable on a unix system, except for some =
very rare characters.</FONT>

<BR><FONT SIZE=3D2>Probably the best of all worlds would be to advertize =
and document that</FONT>

<BR><FONT SIZE=3D2>latin1 is the default encoding (for standards =
compliance), and thus</FONT>

<BR><FONT SIZE=3D2>encourage people to use \oe or \dots or -- instead of =
the characters 156,</FONT>

<BR><FONT SIZE=3D2>133 or 150, but silently accept all the extra ansinew =
characters so that</FONT>

<BR><FONT SIZE=3D2>careless window users don't get surprised.</FONT>
</P>

<P><FONT SIZE=3D2>What is sure, is that once a default encoding is =
choosen, it will be hard</FONT>

<BR><FONT SIZE=3D2>to change it (the only way would probably be to =
change \documentclass</FONT>

<BR><FONT SIZE=3D2>into \documenttype :-)</FONT>
</P>

<P><FONT SIZE=3D2>&gt; finally applying the wrong input encoding to a =
document not in that</FONT>

<BR><FONT SIZE=3D2>&gt; encoding results in typesetting errors but not =
in compilation errors.</FONT>

<BR><FONT SIZE=3D2>&gt; true, this can also happen if you explicitly =
specify the wrong encoding</FONT>

<BR><FONT SIZE=3D2>&gt; but this is a conscious act (or so we would =
hope) and not something htat</FONT>

<BR><FONT SIZE=3D2>&gt; happens behind the scene</FONT>
</P>

<P><FONT SIZE=3D2>I have seen many beginners that begin typing in french =
their document</FONT>

<BR><FONT SIZE=3D2>without declaring an inputenc, and not realizing at =
once that accents are</FONT>

<BR><FONT SIZE=3D2>missing in the document. I would not call forgetting =
an \usepackage</FONT>

<BR><FONT SIZE=3D2>declaration a conscious act.</FONT>
</P>

<P><FONT SIZE=3D2>&gt; which reminds me: please take the list of =
languages babel currently supports</FONT>

<BR><FONT SIZE=3D2>&gt; and attach to them input/font encoding defaults =
that would be suitable, i</FONT>

<BR><FONT SIZE=3D2>&gt; would really be interested in see such a list =
(and have it disucssed)</FONT>
</P>

<P><FONT SIZE=3D2>Oh, I am certainly not able to do that. If I was to =
make a choice, I</FONT>

<BR><FONT SIZE=3D2>would use the appropriate latinxxx encodings for each =
language, but I am</FONT>

<BR><FONT SIZE=3D2>certainly not qualified to choose for all those =
languages.</FONT>
</P>

<P><FONT SIZE=3D2>=C9ric</FONT>
</P>

</BODY>
</HTML>
------_=_NextPart_001_01C0901D.9B47D000--