Received: from mx0.gmx.net (mx0.gmx.net [213.165.64.100]) by h1439878.stratoserver.net (8.14.2/8.14.2/Debian-2build1) with SMTP id o1AFLquL012070 for ; Wed, 10 Feb 2010 16:21:53 +0100 Received: (qmail 19527 invoked by alias); 10 Feb 2010 15:21:47 -0000 Delivered-To: GMX delivery to rainer.schoepf@gmx.net Received: (qmail invoked by alias); 10 Feb 2010 15:21:46 -0000 Received: from relay2.uni-heidelberg.de (EHLO relay2.uni-heidelberg.de) [129.206.210.211] by mx0.gmx.net (mx070) with SMTP; 10 Feb 2010 16:21:46 +0100 Received: from listserv.uni-heidelberg.de (listserv.uni-heidelberg.de [129.206.100.94]) by relay2.uni-heidelberg.de (8.13.8/8.13.8) with ESMTP id o1AFIxhF024998 (version=TLSv1/SSLv3 cipher=DHE-RSA-AES256-SHA bits=256 verify=NO); Wed, 10 Feb 2010 16:18:59 +0100 Received: from listserv.uni-heidelberg.de (localhost.localdomain [127.0.0.1]) by listserv.uni-heidelberg.de (8.13.1/8.13.1) with ESMTP id o1A9akPc013381; Wed, 10 Feb 2010 16:18:58 +0100 Received: by LISTSERV.UNI-HEIDELBERG.DE (LISTSERV-TCP/IP release 16.0) with spool id 398299 for LATEX-L@LISTSERV.UNI-HEIDELBERG.DE; Wed, 10 Feb 2010 16:18:50 +0100 Received: from relay.uni-heidelberg.de (relay.uni-heidelberg.de [129.206.100.212]) by listserv.uni-heidelberg.de (8.13.1/8.13.1) with ESMTP id o1AFIocr019003 for ; Wed, 10 Feb 2010 16:18:50 +0100 Received: from atlas.informatik.uni-freiburg.de (atlas.informatik.uni-freiburg.de [132.230.150.3]) by relay.uni-heidelberg.de (8.14.1/8.14.1) with ESMTP id o1AFIdio002945 (version=TLSv1/SSLv3 cipher=DHE-RSA-AES256-SHA bits=256 verify=FAIL) for ; Wed, 10 Feb 2010 16:18:42 +0100 Received: from remote239-018.home.uni-freiburg.de ([132.230.239.18] helo=irwin.vpn.uni-freiburg.de) by atlas.informatik.uni-freiburg.de with esmtpsa (TLSv1:AES256-SHA:256) (Exim 4.68) (envelope-from ) id 1NfEKs-0003FE-2W for LATEX-L@LISTSERV.UNI-HEIDELBERG.DE; Wed, 10 Feb 2010 16:18:38 +0100 Received: by irwin.vpn.uni-freiburg.de (Postfix, from userid 500) id BD4E319F16; Wed, 10 Feb 2010 15:52:58 +0100 (CET) Mail-Followup-To: LATEX-L@LISTSERV.UNI-HEIDELBERG.DE References: <4B727378.8060704@morningstar2.co.uk> <20100210100943.GA3759@oberdiek.my-fqdn.de> <4B7298D6.7080206@morningstar2.co.uk> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline Message-ID: <20100210145258.GA18188@oberdiek.my-fqdn.de> Date: Wed, 10 Feb 2010 15:52:58 +0100 Reply-To: Mailing list for the LaTeX3 project Sender: Mailing list for the LaTeX3 project From: Heiko Oberdiek Subject: Re: String module To: LATEX-L@listserv.uni-heidelberg.de In-Reply-To: <4B7298D6.7080206@morningstar2.co.uk> Precedence: list List-Help: , List-Unsubscribe: List-Subscribe: List-Owner: List-Archive: X-GMX-Antispam: 0 (Mail was not recognized as spam); Detail=5D7Q89H36p4U4jfdfC5HDevlx1X2sAZgAaLl3DbFfW0PXxL7WgvovMFXXSEPrACWeh0yo a0c+bcWC+z01V6zPyjr6CgKY+9mNCOZV9VNsIEhFNj4pirV/OHEXmQpET/yar2QqkzhIyUYZyW6x Qb1PA==V1; X-Resent-By: Forwarder X-Resent-For: rainer.schoepf@gmx.net X-Resent-To: rainer@rainer-schoepf.de X-Scanned-By: MIMEDefang 2.63 on 85.214.41.38 Status: R X-Status: X-Keywords: X-UID: 6235 On Wed, Feb 10, 2010 at 11:30:30AM +0000, Joseph Wright wrote: > On 10/02/2010 10:09, Heiko Oberdiek wrote: > >* Encoding conversions, see package `stringenc'. > > Application: PDF (outlines and other text fields). > > At present we seem to have stayed away from encodings. My own > preference is to leave things to LaTeX2e when working as a package > and to use the "native" encoding only for the format (with UTF-8 > engines available this seems sensible to me). There are encoding issues independet from TeX input. For example, outlines can be given as strings in PDFDocEncoding or Unicode (UTF-16). In hyperref, option pdfencoding=auto, the string is first encoded as Unicode, then PDFDocEncoding is tried and used, if it fits. Also `encoding' can be understood in a general way including hex strings, strings in ASCII85, quoted strings in C/Lua/..., ... > >* Matching (replacing) using regular expressions, > > see \pdfmatch and luaTeX. > > Matching is useful for extracting information pieces or > > validating option values, ... > > Unhappily \pdfmatch has still the status "experimental" > > and the regular expression language differs from Lua's. > > I think we'll be staying away from this. XeTeX has no equivalent of > \pdfmatch, and as you say the LuaTeX version works differently from > the pdfTeX one. The authors/maintainers of XeTeX/pdfTeX/LuaTeX could agree on one version, supported by all these engines. > [At present, we only *require* e-TeX in any case, > although an engine with \(pdf)strcmp available is very useful.] If an expandable version is not an issue, then \pdfstrcmp can be implemented in virgin TeX. If the input of \pdfstrcmp consists of `other' and perhaps `space' tokens, then even an expandable version can be simulated. The main problem is AFAIK the conversion of a general token list to the string representation in an expandable way (that means without having \edef). Yours sincerely Heiko