X-VM-v5-Data: ([nil nil nil nil nil nil nil nil nil] ["775" "Tue" "17" "June" "1997" "09:30:03" "+0100" "J%org Knappen, Mainz" "KNAPPEN@MZDMZA.ZDV.UNI-MAINZ.DE" nil "17" "Re: Multilingual TeX --- and a successor to TeX" "^Date:" nil nil "6" nil nil nil nil nil] nil) Received: from listserv.gmd.de (listserv.gmd.de [192.88.97.1]) by mail.Uni-Mainz.DE (8.8.5/8.8.4) with ESMTP id JAA00074; Tue, 17 Jun 1997 09:41:01 +0200 (MET DST) Received: from lsv1.listserv.gmd.de by listserv.gmd.de (LSMTP for OpenVMS v1.1a) with SMTP id <12.06F69C5D@listserv.gmd.de>; Tue, 17 Jun 1997 9:41:00 +0200 Received: from RELAY.URZ.UNI-HEIDELBERG.DE by RELAY.URZ.UNI-HEIDELBERG.DE (LISTSERV-TCP/IP release 1.8b) with spool id 154841 for LATEX-L@RELAY.URZ.UNI-HEIDELBERG.DE; Tue, 17 Jun 1997 09:40:53 +0200 Received: from MZDMZA.ZDV.UNI-MAINZ.DE (dzdmzc.zdv.Uni-Mainz.DE [134.93.8.34]) by relay.urz.uni-heidelberg.de (8.7.6/8.7.4) with ESMTP id JAA00558 for ; Tue, 17 Jun 1997 09:40:47 +0200 (MET DST) Received: from MZDMZA.ZDV.UNI-MAINZ.DE by MZDMZA.ZDV.UNI-MAINZ.DE (PMDF V5.0-4 #22141) id <01IK6A6JMJWOH63WP9@MZDMZA.ZDV.UNI-MAINZ.DE> for LATEX-L@RELAY.URZ.UNI-HEIDELBERG.DE; Tue, 17 Jun 1997 09:30:03 +0100 X-VMS-To: IN%"LATEX-L@RELAY.URZ.UNI-HEIDELBERG.DE" MIME-version: 1.0 Content-type: TEXT/PLAIN; CHARSET=US-ASCII Content-transfer-encoding: 7BIT Message-ID: <01IK6A6JMKUIH63WP9@MZDMZA.ZDV.UNI-MAINZ.DE> Reply-To: Mailing list for the LaTeX3 project Date: Tue, 17 Jun 1997 09:30:03 +0100 From: "J%org Knappen, Mainz" Sender: Mailing list for the LaTeX3 project To: Multiple recipients of list LATEX-L Subject: Re: Multilingual TeX --- and a successor to TeX Status: R X-Status: X-Keywords: X-UID: 2070 Werner Lemberg wrote: > I think he has rather meant 16bit character support... In fact, it means 32bit support. Currently there are 16 additional planes (in ISO 10646 speak) assigned for the allocation of more characters. Unicode 2.0 handles this problem by the surrogate mechanism (which is aequivalent to UTF-16 of ISO 10646), meaning that a pair of two 16 bit characters represents one 32 bit character out of the 16 additional planes. Alternatively, it could mean UTF-8 support (32bit fixed width characters are transformed into variable length sequences of 1--6 octets). This can be handled with TeX classic on the input level. Of course it can't be handled with TeX classic on te internal or output level, since larger fonts are desparately needed. --J"org Knappen