Received: from mail.proteosys.com ([213.139.130.197]) by nummer-3.proteosys with Microsoft SMTPSVC(6.0.3790.3959); Tue, 17 Mar 2009 09:10:19 +0100 Received: by mail.proteosys.com (8.14.3/8.14.3) with ESMTP id n2H8AJmk028094 for ; Tue, 17 Mar 2009 09:10:19 +0100 Received: from listserv.uni-heidelberg.de (listserv.uni-heidelberg.de [129.206.100.94]) by relay.uni-heidelberg.de (8.14.1/8.14.1) with ESMTP id n2H85gRR002367 (version=TLSv1/SSLv3 cipher=DHE-RSA-AES256-SHA bits=256 verify=NO); Tue, 17 Mar 2009 09:05:43 +0100 Received: from listserv.uni-heidelberg.de (localhost.localdomain [127.0.0.1]) by listserv.uni-heidelberg.de (8.13.1/8.13.1) with ESMTP id n2GN1Bkh016780; Tue, 17 Mar 2009 09:05:33 +0100 Received: by LISTSERV.UNI-HEIDELBERG.DE (LISTSERV-TCP/IP release 15.5) with spool id 207745 for LATEX-L@LISTSERV.UNI-HEIDELBERG.DE; Tue, 17 Mar 2009 09:05:33 +0100 Received: from relay.uni-heidelberg.de (relay.uni-heidelberg.de [129.206.100.212]) by listserv.uni-heidelberg.de (8.13.1/8.13.1) with ESMTP id n2H7tWZF025180 for ; Tue, 17 Mar 2009 08:55:32 +0100 Received: from fmmailgate05.web.de (fmmailgate05.web.de [217.72.192.243]) by relay.uni-heidelberg.de (8.14.1/8.14.1) with ESMTP id n2H7tKAx028406 for ; Tue, 17 Mar 2009 08:55:23 +0100 Received: from web.de by fmmailgate05.web.de (Postfix) with SMTP id 313425B1BEE2 for ; Tue, 17 Mar 2009 08:55:20 +0100 (CET) Received: from [134.96.233.1] by freemailng1501.web.de with HTTP; Tue, 17 Mar 2009 08:55:18 +0100 MIME-Version: 1.0 Precedence: fm-user X-Provags-Id: V01U2FsdGVkX19/mNlUchm7Y3q6RDe61RUEtYqALIDREU3L1sqi/Y5YnhGYv yCX25W0zNmhjM49jhwg8gm7ya++BL1j+Mx3Iv3uas5gmNg+iZc= Content-Type: text/plain; charset=iso-8859-15 Content-Transfer-Encoding: 8bit X-MIME-Autoconverted: from quoted-printable to 8bit by listserv.uni-heidelberg.de id n2H7tWZF025181 Message-ID: <553234443@web.de> Date: Tue, 17 Mar 2009 08:55:18 +0100 Reply-To: Mailing list for the LaTeX3 project Sender: Mailing list for the LaTeX3 project From: =?iso-8859-15?Q?J=F6rg_Knappen?= Organization: http://freemail.web.de/ Subject: Re: inputenc for XeTeX and LuaTeX To: LATEX-L@LISTSERV.UNI-HEIDELBERG.DE List-Help: , List-Unsubscribe: List-Subscribe: List-Owner: List-Archive: X-ProteoSys-SPAM-Score: -4 () RCVD_IN_DNSWL_MED X-Scanned-By: MIMEDefang 2.65 on 213.139.130.197 Return-Path: owner-latex-l@LISTSERV.UNI-HEIDELBERG.DE X-OriginalArrivalTime: 17 Mar 2009 08:10:19.0882 (UTC) FILETIME=[D0403CA0:01C9A6D7] Status: R X-Status: X-Keywords: X-UID: 5714 Unicode has a feature called BOM (Byte order mark) which is a single character prepended to a Unicode file. Thus, if we know that the file is in some Unicode encoding, we can detect which one it is by checking for the byte order mark. If there is no byte oreder mark, assume ASCII or some a non-Unicode 8 bit encoding. This way a Unicode aware can do (partly) an automatic detection of the encoding. The problem is that checking for the byte order mark introduces an incompatible change to TeX, since, e.g., the UTF16 byte order mark looks like