Received: from mail.proteosys.com ([213.139.130.197]) by nummer-3.proteosys with Microsoft SMTPSVC(6.0.3790.1830); Mon, 6 Mar 2006 08:54:24 +0100 Received: by mail.proteosys.com (8.12.10/8.12.2) with ESMTP id k267sKoF016041 for ; Mon, 6 Mar 2006 08:54:21 +0100 Received: from listserv.uni-heidelberg.de (listserv.uni-heidelberg.de [129.206.100.94]) by relay.uni-heidelberg.de (8.13.4/8.13.1) with ESMTP id k267nkDj025593 (version=TLSv1/SSLv3 cipher=DHE-RSA-AES256-SHA bits=256 verify=NO); Mon, 6 Mar 2006 08:49:46 +0100 Received: from listserv.uni-heidelberg.de (listserv.uni-heidelberg.de [129.206.100.94]) by listserv.uni-heidelberg.de (8.13.1/8.13.1) with ESMTP id k263LkXG029011; Mon, 6 Mar 2006 08:49:27 +0100 Received: by LISTSERV.UNI-HEIDELBERG.DE (LISTSERV-TCP/IP release 14.3) with spool id 1406316 for LATEX-L@LISTSERV.UNI-HEIDELBERG.DE; Mon, 6 Mar 2006 08:49:27 +0100 Received: from relay2.uni-heidelberg.de (relay2.uni-heidelberg.de [129.206.210.211]) by listserv.uni-heidelberg.de (8.13.1/8.13.1) with ESMTP id k267nRK1021933 for ; Mon, 6 Mar 2006 08:49:27 +0100 Received: from freebsd1.webcows.se (217198149020-host.dependit.net [217.198.149.20] (may be forged)) by relay2.uni-heidelberg.de (8.12.10/8.12.10) with ESMTP id k267pZvj008289 for ; Mon, 6 Mar 2006 08:51:36 +0100 (MET) Received: from heim-037-65.raab-heim.uni-linz.ac.at ([193.171.37.65] helo=residenset.net) by freebsd1.webcows.se with esmtpa (Exim 4.60 (FreeBSD)) (envelope-from ) id 1FGASw-000NtR-8L for LATEX-L@LISTSERV.UNI-HEIDELBERG.DE; Mon, 06 Mar 2006 08:49:14 +0100 Received: from (193.171.37.65) by gwavraab.uni-linz.ac.at via smtp id 36f5_ed4a527a_ace5_11da_8255_001143d314e3; Mon, 06 Mar 2006 08:50:49 +0100 Content-Type: text/plain; charset=ISO-8859-1; format=flowed Mime-Version: 1.0 (Apple Message framework v553) X-Mailer: Apple Mail (2.553) X-AntiAbuse: This header was added to track abuse, please include it with any abuse report X-AntiAbuse: Primary Hostname - freebsd1.webcows.se X-AntiAbuse: Original Domain - listserv.uni-heidelberg.de X-AntiAbuse: Originator/Caller UID/GID - [0 0] / [26 6] X-AntiAbuse: Sender Address Domain - residenset.net X-Source: X-Source-Args: X-Source-Dir: Content-Transfer-Encoding: 8bit X-MIME-Autoconverted: from quoted-printable to 8bit by listserv.uni-heidelberg.de id k267nRK1021934 Message-ID: Date: Mon, 6 Mar 2006 08:49:32 +0100 Reply-To: Mailing list for the LaTeX3 project Sender: Mailing list for the LaTeX3 project From: =?ISO-8859-1?Q?Lars_Hellstr=F6m?= Subject: Re: LICR objects To: LATEX-L@LISTSERV.UNI-HEIDELBERG.DE In-Reply-To: <17419.19074.277514.24682@morse.mittelbach-online.de> Precedence: list X-ProteoSys-SPAM-Score: 0 () X-Scanned-By: MIMEDefang at proteosys.com Return-Path: owner-latex-l@LISTSERV.UNI-HEIDELBERG.DE X-OriginalArrivalTime: 06 Mar 2006 07:54:28.0646 (UTC) FILETIME=[31FBE460:01C640F3] Status: R X-Status: X-Keywords: X-UID: 4932 Söndagen den 5 mars 2006 kl 21.30 skrev Frank Mittelbach: > > that is not to say that the line > >>>> \DeclareUnicodeCharacter{02C6}{\textasciicircum} > > is probably wrong it should be most likely > > \DeclareUnicodeCharacter{005E}{\textasciicircum} > > and several others have similar defects. would be good if that got > checked. Is that even a legal definition? U+005E (^) is, as was mentioned earlier in this thread, syntax in LaTeX, so you can't inputenc map it to something. Or are you thinking about some attempt at supporting verbatim input? >> Example: Assuming there is a word "deaffish" and the >> author does not want a ligature ffi spanning both word parts. >> Therefore, having a good editor, he uses the Unicode sequence >> U+0066 U+FB01 to specify the correct and desired ligature. >> Using the later case of \DeclareUnicodeCharacter{FB01} >> TeX would get "ffi" and then form the wrong ligature. > > wrong example in my opinion. as Lars said: fi or ffi ligature ended up > in > unicode as legacy codes because they were in legacy 8-bit encodings. > million > other ligatures are not available as "chars" because UC like most other > standards are heavily influenced by what is right for certain > countries but > not others. using "fi" in this way is like using tables in html to > position > elements on the page, ie it works for that example but ... > > so the right thing is not to use fi at all here but would be to a > generic > method to denote subword boundaries or whatever to allow the formatter > not to > use the ligature. TeX's method would be \textcompwordmark ... but > unicode > never thought that such encoding of lgoical information is the task of > the > standard. Actually, U+200C (ZERO WIDTH NON-JOINER) seems to me a perfect match to \textcompwordmark, and I've entered it as such in my "Draft specification for the T1 encoding". More pragmatically, one may of course write "deaf\-fish" to not only escape the ligature, but also point out the proper point of hyphenation. Lars Hellström