Received: from mx0.gmx.net (mx0.gmx.net [213.165.64.100]) by h1439878.stratoserver.net (8.14.2/8.14.2/Debian-2build1) with SMTP id p9AJnRmZ021660 for ; Mon, 10 Oct 2011 21:49:28 +0200 Received: (qmail 22062 invoked by alias); 10 Oct 2011 19:49:22 -0000 Delivered-To: GMX delivery to rainer.schoepf@gmx.net Received: (qmail invoked by alias); 10 Oct 2011 19:49:22 -0000 Received: from relay.uni-heidelberg.de (EHLO relay.uni-heidelberg.de) [129.206.100.212] by mx0.gmx.net (mx081) with SMTP; 10 Oct 2011 21:49:22 +0200 Received: from listserv.uni-heidelberg.de (listserv.uni-heidelberg.de [129.206.100.94]) by relay.uni-heidelberg.de (8.14.1/8.14.1) with ESMTP id p9AJkl88006359 (version=TLSv1/SSLv3 cipher=DHE-RSA-AES256-SHA bits=256 verify=NO); Mon, 10 Oct 2011 21:46:48 +0200 Received: from listserv.uni-heidelberg.de (localhost.localdomain [127.0.0.1]) by listserv.uni-heidelberg.de (8.13.1/8.13.1) with ESMTP id p9AHPabx001513; Mon, 10 Oct 2011 21:46:47 +0200 Received: by LISTSERV.UNI-HEIDELBERG.DE (LISTSERV-TCP/IP release 16.0) with spool id 1774702 for LATEX-L@LISTSERV.UNI-HEIDELBERG.DE; Mon, 10 Oct 2011 21:46:47 +0200 Received: from relay.uni-heidelberg.de (relay.uni-heidelberg.de [129.206.100.212]) by listserv.uni-heidelberg.de (8.13.1/8.13.1) with ESMTP id p9AJkl4f025536 for ; Mon, 10 Oct 2011 21:46:47 +0200 Received: from moutng.kundenserver.de (moutng.kundenserver.de [212.227.126.187]) by relay.uni-heidelberg.de (8.14.1/8.14.1) with ESMTP id p9AJkaDP006326 for ; Mon, 10 Oct 2011 21:46:40 +0200 Received: from mittelbach-online.de (p3EE3F845.dip.t-dialin.net [62.227.248.69]) by mrelayeu.kundenserver.de (node=mrbap2) with ESMTP (Nemesis) id 0MW8Zl-1RbFPp07M9-00X0hG; Mon, 10 Oct 2011 21:46:36 +0200 Received: by mittelbach-online.de (Postfix, from userid 783) id A3695CE01BC; Mon, 10 Oct 2011 21:46:28 +0200 (CEST) X-Spam-Checker-Version: SpamAssassin 3.2.5 (2008-06-10) on Marlowe X-Spam-Status: No, score=-1.4 required=5.0 tests=ALL_TRUSTED,AWL autolearn=unavailable version=3.2.5 Received: from [127.0.0.1] (unknown [192.168.123.104]) (Authenticated sender: frank) by mittelbach-online.de (Postfix) with ESMTPSA id 7D70BCE0170 for ; Mon, 10 Oct 2011 21:46:27 +0200 (CEST) User-Agent: Mozilla/5.0 (Windows NT 6.1; WOW64; rv:7.0.1) Gecko/20110929 Thunderbird/7.0.1 MIME-Version: 1.0 References: Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit X-Antivirus: avast! (VPS 111010-1, 10.10.2011), Outbound message X-Antivirus-Status: Clean X-Provags-ID: V02:K0:1MdsKxZ7A9+imKUDB85SWmAHMiZWbyKw61Rt77RJmQM 0lDbM/UqQ9LQ5Eola4h2d9Ws3X7IALI+sqiDZEPGXHLOvGv88B WUsW+uvgwMyrpVDpg/oBMCR4yIGP55VGYLq+nafmvylnTj0qxi Bbw+CYYIyFArPMlSPgLZJD44JdHqlh+FLyZGZojwcSWFD8tTwB Wiwg/NGWjNO7QgG10l12KPbPVSrW+dV/dD0arwcvCkxhr1FyQd GjsqFVFbJLofk7RYDUZ8Zj2zDQhKPFys4hHWAOOytVPHa38onv lpS9UqZJgWn0jGt2hq66lQlAtIqmbupduHKZg08fxn8dstaFM4 wYJivPVsxgIUtN8rfYdVlRbg/7zKy2TBhLeKMiW0R Message-ID: <4E934B91.4030205@latex-project.org> Date: Mon, 10 Oct 2011 21:46:25 +0200 Reply-To: Mailing list for the LaTeX3 project Sender: Mailing list for the LaTeX3 project From: Frank Mittelbach Subject: Re: Strings, and regular expressions To: LATEX-L@listserv.uni-heidelberg.de In-Reply-To: Precedence: list List-Help: , List-Unsubscribe: List-Subscribe: List-Owner: List-Archive: X-GMX-Antispam: 0 (eXpurgate); Detail=5D7Q89H36p7Nd1a4XKk/L3frsCBpYZwps9dFe9qBIQJUzr7DBFk9xvkoJ53VY1pehVPg9 0ieTDAioGnk+5AYVoStw5LinQZm1avjzEagrW04/B4seYBzmyybhAIWEkbvWmNDzUWAh8CqfV5kL UXIyUVSFLKTTIpY5y0OBFCPVSGBWGMSL4+9yW7jUgJYTCJC8la17/bBMKQ=V1; X-Resent-By: Forwarder X-Resent-For: rainer.schoepf@gmx.net X-Resent-To: rainer@rainer-schoepf.de Status: R X-Status: X-Keywords: X-UID: 6909 Hi Bruno, a few comments on "random" parts of your message > - Some day I may add printf-like string formatting. Is that useful? I would think so. anything with input output formatting benefits from this. It is not a primary typesetting function but ... ;-) > - Newlines. Currently, "." matches every character; in perl and PCRE > it should not match new lines. As you know, the situation with new > lines in TeX is a little bit odd, since they are converted to the > \endlinechar upon reading, and normally not tokenized, simply giving > rise to a space or a \par. Should we still decide that "." does not > match the CR nor LF characters? Or should it simply no match the > \endlinechar? in my opinion we should accept that TeX strings do not have the notion of newlines thus one can't match against them > > - I had the idea of providing # as a shorthand for .*? (arbitrary > sequence of characters, lazy), mimicking what TeX does when finding a > macro parameter. Is it useful? don't think this requires adding an additional shorthand which is non-standard. > - Same question for caseless matching, and for look-ahead/look-behind > assertions. personally (in other languages) I alway find it useful to be able to match caseless (as an option). But then this requires an understanding on what is "case". Whether that is however important enough to go some length ... dunno > Comments, feature requests, bug reports etc. are all highly welcome. > Thanks for reading so far :). I did ... read that far ;-) cheers frank