Received: from mx0.gmx.net (mx0.gmx.net [213.165.64.100]) by h1439878.stratoserver.net (8.14.2/8.14.2/Debian-2build1) with SMTP id o1ALLs65032261 for ; Wed, 10 Feb 2010 22:21:55 +0100 Received: (qmail 19398 invoked by alias); 10 Feb 2010 21:21:49 -0000 Delivered-To: GMX delivery to rainer.schoepf@gmx.net Received: (qmail invoked by alias); 10 Feb 2010 21:21:48 -0000 Received: from relay2.uni-heidelberg.de (EHLO relay2.uni-heidelberg.de) [129.206.210.211] by mx0.gmx.net (mx113) with SMTP; 10 Feb 2010 22:21:48 +0100 Received: from listserv.uni-heidelberg.de (listserv.uni-heidelberg.de [129.206.100.94]) by relay2.uni-heidelberg.de (8.13.8/8.13.8) with ESMTP id o1ALJ8GB006173 (version=TLSv1/SSLv3 cipher=DHE-RSA-AES256-SHA bits=256 verify=NO); Wed, 10 Feb 2010 22:19:08 +0100 Received: from listserv.uni-heidelberg.de (localhost.localdomain [127.0.0.1]) by listserv.uni-heidelberg.de (8.13.1/8.13.1) with ESMTP id o1A9akoU013381; Wed, 10 Feb 2010 22:19:05 +0100 Received: by LISTSERV.UNI-HEIDELBERG.DE (LISTSERV-TCP/IP release 16.0) with spool id 401282 for LATEX-L@LISTSERV.UNI-HEIDELBERG.DE; Wed, 10 Feb 2010 22:19:05 +0100 Received: from relay2.uni-heidelberg.de (relay2.uni-heidelberg.de [129.206.210.211]) by listserv.uni-heidelberg.de (8.13.1/8.13.1) with ESMTP id o1ALJ5gW031675 for ; Wed, 10 Feb 2010 22:19:05 +0100 Received: from smtp101.plus.mail.re1.yahoo.com (smtp101.plus.mail.re1.yahoo.com [69.147.102.64]) by relay2.uni-heidelberg.de (8.13.8/8.13.8) with SMTP id o1ALIvUd006011 for ; Wed, 10 Feb 2010 22:18:58 +0100 Received: (qmail 86123 invoked from network); 10 Feb 2010 21:18:57 -0000 Received: from p5B3188B5.dip0.t-ipconnect.de (st_philipp@91.49.136.181 with plain) by smtp101.plus.mail.re1.yahoo.com with SMTP; 10 Feb 2010 13:18:56 -0800 PST X-Yahoo-SMTP: _jlT6bOswBCTfNEaYibKorijSw14_bs- X-YMail-OSG: epOh7usVM1k8Wt3bt67MNi1U1QKqItiU5FfTrMc9Jyo46KqoQA5zS6EKxaiosO8MzmxR7Ip6fAxg0GVHjrM3xeh0stqyoprW1K3oRBgR_kQsBcE.tpvgxtwGeF2bDN4HUSdd48onL5nVD.dFsNbgzlNBZ_8eoGGy2cRBbeMRdZccVUslYSTFM0Fz2REozEt4k4DfcXYHeh.WHjauuMUgGQ8mtOHxQBtmFGLBuE8ucBqCSoVYITd09EB6Uuzb7YaljHE1T8rMpM5cHTOwfyxZwpkA8rQ7XuEnRqEG_G.8bRAecWnL61c7PvZ7NizlPitSI5AaDHOdf0m9s3w3tHXL9PMCTNXRaz1m4kLbA8WWERjcLbRObHttQOwDVkBstSYjPhZR2cefxt0kYgzL3k9OEGkSce0c6R_HWT1pKtmCY1_k1FmqlrYZ0oIu0IaSwYRBzXCHl0ysc0q7Qr701ISFWptW4O3eY6zY.W0n0hae96FaC6kG9jbbBNpnZFxuJnM- X-Yahoo-Newman-Property: ymail-3 Content-Type: text/plain; charset=us-ascii Mime-Version: 1.0 (Apple Message framework v1077) References: <4B727378.8060704@morningstar2.co.uk>, <4B729944.5050308@residenset.net> , <4B72B36E.6010401@morningstar2.co.uk> <4B730157.5060605@morningstar2.co.uk> X-Mailer: Apple Mail (2.1077) Content-Transfer-Encoding: 8bit X-MIME-Autoconverted: from quoted-printable to 8bit by listserv.uni-heidelberg.de id o1ALJ5gW031676 Message-ID: <3A6B880C-91AA-4597-8E53-92F72E3FA70A@yahoo.de> Date: Wed, 10 Feb 2010 22:18:54 +0100 Reply-To: Mailing list for the LaTeX3 project Sender: Mailing list for the LaTeX3 project From: Philipp Stephani Subject: Re: LaTeX3 8-bit only? To: LATEX-L@listserv.uni-heidelberg.de In-Reply-To: <4B730157.5060605@morningstar2.co.uk> Precedence: list List-Help: , List-Unsubscribe: List-Subscribe: List-Owner: List-Archive: X-GMX-Antispam: 0 (Mail was not recognized as spam); Detail=5D7Q89H36p6i75npGen84eVAEFK/syJmiNoEBJhgjYKpglu1TZLLw7xMZnJMXwBFK0zrU udEInhYyaWAzwtcf5K2pCdD+gZ2/z4PnBLkwixZI+pVtXqOlCN41sOWgjaVeH7+UhPxHlGxFK/rc sw7fg==V1; X-Resent-By: Forwarder X-Resent-For: rainer.schoepf@gmx.net X-Resent-To: rainer@rainer-schoepf.de X-Scanned-By: MIMEDefang 2.63 on 85.214.41.38 Status: R X-Status: X-Keywords: X-UID: 6241 Am 10.02.2010 um 19:56 schrieb Joseph Wright: > Hello Chris, > >> Input is not the only place where character-like things appear in TeX; this is another way of saying what Lars said. Character repertoires are distinct from encodings of characters and these are different again from the encodings used in external files. >> >> So you need to know what character repertoires you are going to deal with internally in these various types of string, whether or not these are represeted by, for example, 7-bit LICRs. > > I was thinking of input encodings, where my point was (supposed to) be that something like the inputenc "utf8" approach would be an approach I hope we can avoid as there are better solutions (in the form of engines which deal with the issue). (Of course, that leaves UTF-16 issues, but I'd hope that engine developments can help out). Current implementation strategies for strings in development environments define one Unicode encoding scheme (UTF-16 in nearly all cases like Windows, Java, Python, Qt, .NET, COM, Cocoa, Carbon; a few technologies like Gnome and Emacs choose UTF-8 instead) that is used exclusively for internal processing, and define "strings" as sequences of UTF-16 or UTF-8 code units. LaTeX could do the same, depending on the engine: UTF-8 for pdfTeX, UTF-16 for XeTeX. Other possibilities (e.g. LICR or UTF-32) are probably either too complicated or not flexible enough.