Received: from mail.proteosys.com ([213.139.130.197]) by nummer-3.proteosys with Microsoft SMTPSVC(6.0.3790.3959); Wed, 18 Mar 2009 00:30:05 +0100 Received: by mail.proteosys.com (8.14.3/8.14.3) with ESMTP id n2HNU4Dk019541 for ; Wed, 18 Mar 2009 00:30:05 +0100 Received: from listserv.uni-heidelberg.de (listserv.uni-heidelberg.de [129.206.100.94]) by relay.uni-heidelberg.de (8.14.1/8.14.1) with ESMTP id n2HNK7jg006483 (version=TLSv1/SSLv3 cipher=DHE-RSA-AES256-SHA bits=256 verify=NO); Wed, 18 Mar 2009 00:20:08 +0100 Received: from listserv.uni-heidelberg.de (localhost.localdomain [127.0.0.1]) by listserv.uni-heidelberg.de (8.13.1/8.13.1) with ESMTP id n2HN14Y9026976; Wed, 18 Mar 2009 00:20:05 +0100 Received: by LISTSERV.UNI-HEIDELBERG.DE (LISTSERV-TCP/IP release 15.5) with spool id 205609 for LATEX-L@LISTSERV.UNI-HEIDELBERG.DE; Wed, 18 Mar 2009 00:20:04 +0100 Received: from relay2.uni-heidelberg.de (relay2.uni-heidelberg.de [129.206.210.211]) by listserv.uni-heidelberg.de (8.13.1/8.13.1) with ESMTP id n2HNK45i029430 for ; Wed, 18 Mar 2009 00:20:04 +0100 Received: from mordell.elzevir.fr (mordell.elzevir.fr [92.243.3.74]) by relay2.uni-heidelberg.de (8.13.8/8.13.8) with ESMTP id n2HNJsa2017217 (version=TLSv1/SSLv3 cipher=DHE-RSA-AES256-SHA bits=256 verify=FAIL) for ; Wed, 18 Mar 2009 00:19:58 +0100 Received: from roth.elzevir.fr (thue.elzevir.fr [88.165.216.11]) by mordell.elzevir.fr (Postfix) with ESMTPS id 9E12A36167 for ; Wed, 18 Mar 2009 00:19:49 +0100 (CET) Received: from [127.0.0.1] (localhost [127.0.0.1]) by roth.elzevir.fr (Postfix) with ESMTP id 278A3BFD1 for ; Wed, 18 Mar 2009 00:19:48 +0100 (CET) User-Agent: Mozilla-Thunderbird 2.0.0.19 (X11/20090103) MIME-Version: 1.0 References: <553234443@web.de> <49C01E4B.4020604@elzevir.fr> <18880.10491.482905.22434@morse.mittelbach-online.de> X-Enigmail-Version: 0.95.0 OpenPGP: url=http://pgp.mit.edu:11371/pks/lookup?op=get&search=0x50A89B42 Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 8bit Message-ID: <49C03014.7040705@elzevir.fr> Date: Wed, 18 Mar 2009 00:19:48 +0100 Reply-To: Mailing list for the LaTeX3 project Sender: Mailing list for the LaTeX3 project From: =?ISO-8859-1?Q?Manuel_P=E9gouri=E9-Gonnard?= Subject: Re: inputenc for XeTeX and LuaTeX To: LATEX-L@LISTSERV.UNI-HEIDELBERG.DE In-Reply-To: <18880.10491.482905.22434@morse.mittelbach-online.de> Precedence: list List-Help: , List-Unsubscribe: List-Subscribe: List-Owner: List-Archive: X-ProteoSys-SPAM-Score: -4 () RCVD_IN_DNSWL_MED X-Scanned-By: MIMEDefang 2.65 on 213.139.130.197 Return-Path: owner-latex-l@LISTSERV.UNI-HEIDELBERG.DE X-OriginalArrivalTime: 17 Mar 2009 23:30:05.0745 (UTC) FILETIME=[4D96AA10:01C9A758] Status: R X-Status: X-Keywords: X-UID: 5718 Frank Mittelbach a écrit : > perhaps. it might be a straight path into long-term disaster.On the other > hand the whole area is a disaster in the first place. When we started out with > inputenc in 2e I also thought that it is really good to keep the encoding with > the file (which you do by stating \usepackage[latin1]{inputenc} and the like) > and that worked for a while fairly good. But then OSes started to convert on > the fly so by cut-n-paste sometimes even on the same machine an old latin1 got > translated into something else (except for the string specifying the encoding > inside)... so ... not easy really You're right. I still think stating the encoding inside the file is the saner approach, even if it also has its drawbacks. (Btw, when posting on newsgroups/mailing-lists I always take care to use [ascii]{inputenc} in order to avoid such copy-paste problems.) > how much guessing is really needed? Are you targetting an existing 2e env > unchanged or are you intending to design an interface that is robust if used? > Or something inbetween? > I think both are interesting, but concerning the present discussion (more precisely, the packages Will mentions), I guess the target is more an existing as-unchanged-as-possible 2e env. The idea is to make the transition between 8-bit TeX engines and Unicode TeX engine as easy as possible for the user. > - possible 2e solution: steal \openout to always write > \InternallyWrittenFileHookToHandleWhatWeNeedToHandle > to the top of each such file; fix the cases where this is not appropriate > in 2e, such as filecontents env ... and wait for the packages to blow up > and fix those (probably only a few if any) > Probably an interesting approach. > ps interestingly enough, in 2e on top of anormal TeX engine that problem was > properly solved as we ensured that internally written files were always > written in LICR which is unicode in 7bit so it was always coming back > properly. That was at the cost of translating everything into LICR on input > (with active chars) but that was necessary anyway because of the different > 8bit encodings around. At some point, Élie Roux tried to reproduce this approach for lua-inputenc (converting from the input in fake utf-8, then loading the normal inputenc which works correctly and correctly translates things to LICR on output). But it breaks if a non-ascii character is however written to the file for some reason (such as VerbatimOut, eg), and I'm afraid there is nothing to do about it. Manuel.