Return-Path: Delivered-To: rainer Received: from h2774747.stratoserver.net (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) by h2774747.stratoserver.net (Dovecot) with LMTP id f7xzEsQDcF5UDwAA4+3H6A for ; Mon, 16 Mar 2020 23:55:00 +0100 Return-Path: Received: from mout.gmx.net (mout.gmx.net [212.227.17.21]) by h2774747.stratoserver.net (8.15.2/8.15.2/Debian-3) with ESMTP id 02GMsx1R003922 for ; Mon, 16 Mar 2020 23:55:00 +0100 Received: from relay.uni-heidelberg.de ([129.206.100.212]) by mx-ha.gmx.net (mxgmx114 [212.227.17.5]) with ESMTP (Nemesis) id 1MT9Nv-1irofa2Ps7-00Ub9N for ; Mon, 16 Mar 2020 23:54:53 +0100 Received: from listserv.uni-heidelberg.de ([129.206.100.94]) by relay.uni-heidelberg.de with ESMTP; 16 Mar 2020 23:54:54 +0100 Received: from listserv (localhost [127.0.0.1]) by listserv.uni-heidelberg.de (Postfix) with ESMTP id 158F8127E64; Mon, 16 Mar 2020 23:54:13 +0100 (CET) Received: by LISTSERV.UNI-HEIDELBERG.DE (LISTSERV-TCP/IP release 17.0) with spool id 42651098 for LATEX-L@LISTSERV.UNI-HEIDELBERG.DE; Mon, 16 Mar 2020 23:54:13 +0100 Delivered-To: LATEX-L@LISTSERV.UNI-HEIDELBERG.DE Received: from listserv (localhost [127.0.0.1]) by listserv.uni-heidelberg.de (Postfix) with ESMTP id EBA79127E64 for ; Mon, 16 Mar 2020 23:54:12 +0100 (CET) Mime-Version: 1.0 Content-Type: text/plain; charset="UTF-8" Message-ID: <8422310776248060.WA.kellysmith12.21gmail.com@listserv.uni-heidelberg.de> Date: Mon, 16 Mar 2020 23:54:12 +0100 Reply-To: Mailing list for the LaTeX3 project Sender: Mailing list for the LaTeX3 project From: Kelly Smith Subject: Re: Using Lua to Preprocess Unicode Data To: LATEX-L@LISTSERV.UNI-HEIDELBERG.DE Precedence: list List-Help: , List-Unsubscribe: List-Subscribe: List-Owner: List-Archive: Envelope-To: X-GMX-Antispam: 0 (Mail was not recognized as spam); Detail=V3; X-Spam-Flag: NO X-UI-Filterresults: notjunk:1;V03:K0:objro5+W1Q8=:fF08dc7TUmzmmCjDaWFbgR8zE3 Vr/xtBcd8jKn2BkTo/qUS0tOYKiubv0yR/28WOyos2Jau8G8uO7p+E/QtVtOyUbGTmknYH5eW yWLiev9p0B1dy92a4SuHp0hJpyDRxEtDKsW2wJziIAWTV78Xw+1BM5s0mR7FCGGeuWPCY3M6X n8081yt2/1SxXGK8xtDGZNeIlE/kUbOlrCZ/FzjkU5iZCinEfKnJsm6CiVsMowTnwzt1tvmRQ FyXEQ01+a21RmL4YtGii4KxTb53LP7cda9KVoAJMeEVF5k8fjsi1Ng+nxSdSA5okRFBO5eyh8 Yk/fhhy7BVNv9ibEEvcTVMxJ/4n2oqIqLEahOSQ27/DxGKBtnShA/o8IgVTKu8ZMO79v3fntw At2vajsWsf/fTVqhhJRsv50SQSTIqD1MhBrz3odUHuUpdsTPjibKomtjRGUnauR1P5CBSsqhM FfFa2TdvUWUJdEqfvJU1AEYycbDMfKlAUbEY+gni50uvUx1MkPBItmphMbls9jiPp1QJWU6Ky XbGz8xhcVD3z3A1jxtVYaIkbgbiXsQesXxNnY7nMofq7zMemr0i/BC412VHG1j2v4Nj2VDOOY 7oTZSwzfM9gS+nIlYcaGyORzU17ApgkgmQiZV/8AYWBTp/Xb6YHsVda1an7siY9E+cX0dTze/ cW2aIYuW2EDQhIPRYOGz3YCC9jiwO+fPb+nmYLTX+rQDn6jXP+ne2RoUjvehUZ7EKPtK0q3ST /+psMki61YQdRDXPZXBDRbsT48vLkSs4qKrJi2qz9HuGSkJZ0n1OedHriRVHHP2+0kxOrNtS3 +cTg5mOWkoFmDQJf84Z9U5aRVani5tq7DMCwCu4z6fEdqemRKsNhTq9OA6paQXC9pZrsLuiom pLEWK5Wa0Iodyyvvzf1XCtJjjr9r5a59xFimbcE3y6uDF4GstLQTfjJ7IeBZLD+aTDGHe5G7n dDafQEkuVw2TgBZEBvFE2XF1cEes5yinyCD2qdDviAnkdt0T9QVD53CVGRj65IDi8LMOzdDmH dD/OTocu6UOjCvcvPHq4kfpSIymjN7mm5XMfejf2EKRt1CRuswh/RBgESlUCJzCnTqAuebdeN K10ehEbgRpod1n5m8cp3MzaG9u4J5OIps9HGRJtt0SroqdDUw/z33dub7UtMI4aG6yj66rz2F jA5CDLsS3hesr8sbavfRI0LTy/q4labAVIHQEgDvhU4RR32KFHpD3G/cPw9epNeQdAK7DFJnE sP5bB6/2tSnQO2IXUSu7kSrcjl2OOOuTqzzi70WdAHgPDhkwXJQOLmk397vEwTFaKcUOFmowo ZPz9ITNGUVsSA+PhW8y6TNPKQsEbRZ2P1WDJOZqMTURYaSq/uj/W2lFlMVgppEghBg/oM/ZmN 2Z6IWoZHXQPu45/eg1NXrhmRU+XajUcZ34ulkXZTdjXfIQKrltdjMySr2jsM0LfXbFDE7rSUq bSSPE1poCXUZVEAJvjM4ge/FUmprZyJk1ytDXeh2f51iRd+er4MMKDQkjeWNK4/QtA2MlvNVj GAXHzkev3vpL9+9Y4h7zWgfWUCXM9VK+bvlcbr/ofUOU0U1srsKi5I+u7VLTEzwrsktjStCMX SkBGcLI1YvEn5jz6FiMncZ1R6shmeqlyWD9usrZkP2PWGgGUqmdJ7XppsYToAhZhEMlEyteNI 1H1SSHF6qUopq4Zokpa5gIefOTdUHa2S51uhb00hGxnOr9hOSa8wfKAs0mq87crXZPA0lAXvR 3NrLSDMX8lYd1AwSEOva7+Mom3E0JPUlhQP6o5u9UDaFOzWTqVYI9uhjnj4kbBtG3QL63R+8h WjRolGBi+FkEWHlFuNaLuq88qZWilD4qsbwooz7Ukon/PVg69LfFtlrL9MnbBiuJCW8+3Z0AN P2RibWmZap3hCjXvn5W/yo2QcXuD3TIs8gX25jD60g+s94GJLn29OpfbOMjJcE+ctc82QxyfM LQhfqhOkIAZNA8fJdzc4X4tnlS6xpUf/E1uxM+ne+Wxnt0b8CLvrWsMNGl2wOdza/FTSzf8c7 8bBDcD/uVxIWi4eDCeW9m9xfkViwDHruL6QvkaeZh3gcrqxRLJvNjulZwBLLqj+pwJZoWJVfE HMPx2iZbjSLfwJjsjXU5x4b30JjHR/1rDPEkgaSdgicyxXyy/LaNkIyDKFuCF0W2G4jB00WZf wzt7ra1+QtAnqUIT5+w/Y+QRqjG3EkQUBobFjGmz71wTxFMUc6LLU/bGZUgv9NAZql6Q3nxZx eNj6Bcn/VIvnHopVG2mgFE4bjdEbfqRGgJffEV7CbvGMkedOQuZ9X1uxllxCXYQ2Hfli0rv9E xrtvN/2wbqXMkFzYLeBJ1zK2e+RdCyPqul1cqUhgWfw32r4Xtw9ubuJF0e0k3nb2i/6aKY7H/ jd1zJMXdKVH62UUdXW7AclaO2ovJlJnm42eXDebBRCN4LMfc6+jsD8PQoPCboPu4cpuWns+s8 7xIDh+j/FVJme4+CasLeJ7yzTjvsPBo19XG8mwm0b5KZ1Y5+y0hQ5/r3Fnlywcue3uxsovavm efzeMjBYmKwsGrBcQtahn4tJrTgUJDfiMuhFgJUL/i6emneWBPomBn0e/4IZfWQTIrpfCAO44 EQAWMnhlKdijelVggmpdc6PWnvLQeLFYYMbXKlMoE1BKctpcMcG+Ej/dQU7ns9Pk3M0t3h//2 bAuaFByfaMWSkOt5AkBp9B4QATgPZsReckj0WcPA70HPDjAsUwo1Il0SZBiHwbhWRRKU= X-UI-Loop:V01:WR5QA1EYl3M=:NCYaOSJ/1LSZhmQZ69mpXAbmmc+wqcu5AU1hxKn/DCk= X-Spam-Flag: NO X-UI-Out-Filterresults: notjunk:1;V03:K0:l1ClGe3sago=:cF+9NigFfmv8ddcxvxsbmp gRFKBRLyLkQC/QnSouWNovr0IBb3juI2KJjqm46fkMxp7UNP+1yz1yc8fjNMIaLJAw+FeQOzh IPhhTZdC/WqiWivte4Tt3S/HfnYcuVm+zSMS5ufBOA4vFJBA+6ieZbVyEwrfc6wWobeL/9sqx CN1VWI6A2PENbPpJ63YYXIej6LCnFd+zdOzPk9LReINgT5E9X1Fbfukirzj2SF9j6QGs4W9J4 KO8ZZpyeULh39djnzEhqJ+izlqcL7kRyVWBZPwl7cZ6IGClBGyogANNsYIiZun6hrEnAlmcj4 loabMX3xD0CS1r5lVde+r1FdcimQ0TazfXsKbjeua0wtRmLzmtg1BvgMFzwI3n2LFig/2ZnTy FR0R29akT574N+djvkpDsPM41kIptxc5ky3o9aKQVQsPXRw13nCBMEj5eCm4J/raRXYZly9ND QUc3WDnrUW/bfqe5zfenEf8L4GQySYY= X-Scanned-By: MIMEDefang 2.78 on 81.169.212.23 Content-Transfer-Encoding: 8bit X-MIME-Autoconverted: from quoted-printable to 8bit by h2774747.stratoserver.net id 02GMsx1R003922 Status: R X-Status: X-Keywords: X-UID: 8206 >As format-building is all about saving time for 'normal' runs, I'm not >seeing there is a massive need to speed up the process. I know there is >one engine in development that doesn't use format files, so that might >be a place to consider things, but I think we'd need a strong case to >alter the approach for XeTeX/LuaTeX (pdfTeX, ...). > >Joseph Sorry, I should’ve clarified: the point of preprocessing the data wouldn’t be to speed up anything, instead, the point would be to do complex processing that would be very difficult or even impossible in LaTeX. For example, if the l3regex module were extended so that precompiled regexes could be used as parts of other regexes, then Unicode properties could be simply implemented by referring to precompiled regexes whose content was created by running filters over the Unicode character database. Another example would be processing the very complex XML files that are used in supplementary Unicode files, like the Common Locale Data Repository, which could help with localization and language-specific date and number parsing/formatting. This idea of preprocessing could be applied to any complex data set that LaTeX3 may need to work with, but I used the example of Unicode data because that’s the one that immediately came to mind.