Received: from mail.proteosys.com ([213.139.130.197]) by nummer-3.proteosys with Microsoft SMTPSVC(5.0.2195.6713); Sat, 27 Nov 2004 21:28:08 +0100 Received: by mail.proteosys.com (8.12.10/8.12.2) with ESMTP id iARKSDuu032226 for ; Sat, 27 Nov 2004 21:28:14 +0100 Received: from listserv.uni-heidelberg.de (listserv.uni-heidelberg.de [129.206.119.176]) by relay2.uni-heidelberg.de (8.12.10/8.12.10) with ESMTP id iARKNnbY005607; Sat, 27 Nov 2004 21:23:49 +0100 (MET) MIME-Version: 1.0 Content-Type: multipart/alternative; boundary="----_=_NextPart_001_01C4D4BF.9B25BC00" Received: from listserv (listserv.uni-heidelberg.de [129.206.119.176]) by listserv.uni-heidelberg.de (8.12.7/8.12.7/SuSE Linux 0.6) with ESMTP id iAQN0evv004334; Sat, 27 Nov 2004 21:23:17 +0100 X-MimeOLE: Produced By Microsoft Exchange V6.5 Received: from LISTSERV.UNI-HEIDELBERG.DE by LISTSERV.UNI-HEIDELBERG.DE (LISTSERV-TCP/IP release 1.8e) with spool id 179250 for LATEX-L@LISTSERV.UNI-HEIDELBERG.DE; Sat, 27 Nov 2004 21:23:16 +0100 Received: from relay2.uni-heidelberg.de (relay2.uni-heidelberg.de [129.206.210.211]) by listserv.uni-heidelberg.de (8.12.7/8.12.7/SuSE Linux 0.6) with ESMTP id iARKNGYr020754 for ; Sat, 27 Nov 2004 21:23:16 +0100 Received: from mta1.cl.cam.ac.uk (mta1.cl.cam.ac.uk [128.232.0.15]) by relay2.uni-heidelberg.de (8.12.10/8.12.10) with ESMTP id iARKNMbY005521 for ; Sat, 27 Nov 2004 21:23:22 +0100 (MET) Received: from mole.cl.cam.ac.uk ([128.232.8.151] helo=cl.cam.ac.uk ident=[reZI1Yc8KrCyNfpba12Gb2I3juUgUyXV]) by mta1.cl.cam.ac.uk with esmtp (Exim 3.092 #1) id 1CY96G-0004RC-00 for LATEX-L@listserv.uni-heidelberg.de; Sat, 27 Nov 2004 20:23:20 +0000 In-Reply-To: Your message of Sat, 27 Nov 2004 18:58:29 +0100. <200411271858.29386.m.g.n@gmx.de> Return-Path: X-OriginalArrivalTime: 27 Nov 2004 20:28:08.0997 (UTC) FILETIME=[9BBDDD50:01C4D4BF] X-Scanned-By: MIMEDefang at proteosys.com X-ProteoSys-SPAM-Score: 0 () x-spam-auto-whitelist: Content-class: urn:content-classes:message Subject: Re: Unicode/UTF-8 (La)TeX Date: Sat, 27 Nov 2004 21:23:20 +0100 Message-ID: A X-MS-Has-Attach: X-MS-TNEF-Correlator: Thread-Topic: Unicode/UTF-8 (La)TeX Thread-Index: AcTUv5v/SIwSSQHkQWKtIt5tHaioeA== From: "Robin Fairbairns" Sender: "Mailing list for the LaTeX3 project" To: Reply-To: "Mailing list for the LaTeX3 project" Status: R X-Status: X-Keywords: X-UID: 4830 This is a multi-part message in MIME format. ------_=_NextPart_001_01C4D4BF.9B25BC00 Content-Type: text/plain; charset="iso-8859-1" Content-Transfer-Encoding: quoted-printable > > It looks as though two strong Unicode encoding candidates for = TeX/LaTeX are > > the UTF-8 and UTF-32. The UTF-32 encoding might be better to used > > internally for performance and programming convenience reasons. The = UTF-8 > > encoding is better to work with as an extension of ASCII. So one = might make > > a TeX version that say uses UTF-32 internally, and UTF-8 and UTF-32 > > externally. omega has capable of this for ages (for some value of 32). enctex (part of tetex-3) also allows utf-8 input. latex, of course, has utf-8 as an optional input coding, for languages for which there's a latex output encoding. (actually, has had for a long time, with a contributed package.) > ExTeX use internal UTF-32! is there anything of any substance to know about extex? the web site doesn't look any different from what i saw ages back when i first made an entry for the project in my faq. ------_=_NextPart_001_01C4D4BF.9B25BC00 Content-Type: text/html; charset="iso-8859-1" Content-Transfer-Encoding: quoted-printable Re: Unicode/UTF-8 (La)TeX

> > It looks as though two strong Unicode = encoding candidates for TeX/LaTeX are
> > the UTF-8 and UTF-32. The UTF-32 encoding = might be better to used
> > internally for performance and programming = convenience reasons. The UTF-8
> > encoding is better to work with as an = extension of ASCII. So one might make
> > a TeX version that say uses UTF-32 = internally, and UTF-8 and UTF-32
> > externally.

omega has capable of this for ages (for some value of = 32).

enctex (part of tetex-3) also allows utf-8 = input.

latex, of course, has utf-8 as an optional input = coding, for languages
for which there's a latex output encoding.  = (actually, has had for a
long time, with a contributed package.)

> ExTeX use internal UTF-32!

is there anything of any substance to know about = extex?  the web site
doesn't look any different from what i saw ages back = when i first made
an entry for the project in my faq.

------_=_NextPart_001_01C4D4BF.9B25BC00--