X-VM-v5-Data: ([nil nil nil nil nil nil nil nil nil] ["6428" "Wed" "5" "June" "1996" "23:12:18" "+0100" "KNAPPEN@VKPMZD.KPH.UNI-MAINZ.DE" "KNAPPEN@VKPMZD.KPH.UNI-MAINZ.DE" nil "141" "Re: latex/2071: Latex fuer Vietnamesisch" "^Date:" nil nil "6" nil nil nil nil] nil) Received: from listserv.gmd.de (listserv.gmd.de [192.88.97.1]) by trudi.zdv.Uni-Mainz.DE (8.7.5/8.7.3) with ESMTP id XAA31979; Wed, 5 Jun 1996 23:13:43 +0200 (MET DST) Received: from listserv.gmd.de by listserv.gmd.de (LSMTP for OpenVMS v1.1a) with SMTP id <5.18E445A5@listserv.gmd.de>; Wed, 5 Jun 1996 23:13:40 +0200 Received: from URZINFO.URZ.UNI-HEIDELBERG.DE by URZINFO.URZ.UNI-HEIDELBERG.DE (LISTSERV-TCP/IP release 1.8b) with spool id 84334 for LATEX-L@URZINFO.URZ.UNI-HEIDELBERG.DE; Wed, 5 Jun 1996 23:11:01 +0200 Received: from MZDMZA.ZDV.UNI-MAINZ.DE (dzdmza.zdv.Uni-Mainz.DE [134.93.8.17]) by relay.urz.uni-heidelberg.de (8.7.5/8.7.4) with ESMTP id XAA10095 for ; Wed, 5 Jun 1996 23:10:58 +0200 (MET DST) Received: from decnet-daemon (KNAPPEN@VKPMZD) by MZDMZA.ZDV.UNI-MAINZ.DE (PMDF V5.0-4 #10401) id <01I5KF2NRXRKB61VYS@MZDMZA.ZDV.UNI-MAINZ.DE> for LATEX-L@URZINFO.URZ.UNI-HEIDELBERG.DE; Wed, 05 Jun 1996 23:12:18 +0100 X-VMS-To: MZDMZA::IN%"LATEX-L@URZINFO.URZ.UNI-HEIDELBERG.DE" MIME-version: 1.0 Content-type: TEXT/PLAIN; CHARSET=US-ASCII Content-transfer-encoding: 7BIT Message-ID: <01I5KF2NSSUAB61VYS@MZDMZA.ZDV.UNI-MAINZ.DE> Reply-To: Mailing list for the LaTeX3 project Date: Wed, 5 Jun 1996 23:12:18 +0100 From: KNAPPEN@VKPMZD.KPH.UNI-MAINZ.DE Sender: Mailing list for the LaTeX3 project To: Multiple recipients of list LATEX-L Subject: Re: latex/2071: Latex fuer Vietnamesisch Status: R X-Status: X-Keywords: X-UID: 1754 Frank, here is included a document on the vietnamese character set which was sent to the iso10646 mailing list 'bout five years ago. --J"org Knappen Appended: Document from ISO10646 Date: Fri, 12 Apr 91 17:26:00 CET From: "J. W. van Wingen" Subject: vietnamese Sender: Multi-byte Code Issues Please find the result of some research concerning Vietnamese. Best regards, Johan van Wingen ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ 1 W VERSION 1.1 COMPLETE REPERTOIRE OF GRAPHIC CHARACTERS 1991-04-10 REQUIRED FOR THE VIETNAMESE LANGUAGE J. W. van Wingen COMMENTARY ON VIETNAMESE LETTERS The Vietnamese language has a number of extra letters, both consonants and vowels. I checked my information with Dr. Truong Van Binh, Lecturer in Vietnamese at Leiden University. This resulted also in finding more defects in DIS 10646. (There are no Vietnamese characters in Unicode at all, but the Chinese transliteration letters LATIN SMALL LETTER U WITH DIAERESIS AND BREVE etc. are included, strangely enough.) The F, J, W, Z are not needed, and will only be kept for foreign names. There is, sorted after D, the: =d LD61 LATIN SMALL LETTER D WITH STROKE =D LD62 LATIN CAPITAL LETTER D WITH STROKE The following are vowels in their own right: a LA01 LATIN SMALL LETTER A A LA02 LATIN CAPITAL LETTER A #a LA23 LATIN SMALL LETTER A WITH BREVE #A LA24 LATIN CAPITAL LETTER A WITH BREVE ^a LA15 LATIN SMALL LETTER A WITH CIRCUMFLEX ^A LA16 LATIN CAPITAL LETTER A WITH CIRCUMFLEX e LE01 LATIN SMALL LETTER E E LE02 LATIN CAPITAL LETTER E ^e LE15 LATIN SMALL LETTER E WITH CIRCUMFLEX ^E LE16 LATIN CAPITAL LETTER E WITH CIRCUMFLEX i LI01 LATIN SMALL LETTER I I LI02 LATIN CAPITAL LETTER I o LO01 LATIN SMALL LETTER O O LO02 LATIN CAPITAL LETTER O ^o LO15 LATIN SMALL LETTER O WITH CIRCUMFLEX ^O LO16 LATIN CAPITAL LETTER O WITH CIRCUMFLEX }o LO.. LATIN SMALL LETTER O WITH HOOK }O LO.. LATIN CAPITAL LETTER O WITH HOOK u LU01 LATIN SMALL LETTER U U LU02 LATIN CAPITAL LETTER U }u LU.. LATIN SMALL LETTER U WITH HOOK }U LU.. LATIN CAPITAL LETTER U WITH HOOK y LY01 LATIN SMALL LETTER Y Y LY02 LATIN CAPITAL LETTER Y All these can carry one of six tones, each except one indicated by a diacritic: high falling high rising GRAVE HUY\^EN low falling ACUTE S/#AC low broken DOT BELOW N#ANG low rising VERTICAL TILDE H?OI high broken TILDE NG~A The diacritics have a name in Vietnamese, but no English equivalent. Those given here are the names used in DIS 10646. The "vertical tilde" in fact has the form of an interrogation sign without dot, so the name chosen (by whom?) is quite inappropriate. The sorting order is as indicated, but has as yet to be checked with a lexicon. The vowels seem not to have a place of their own in the alphabet, but have one on the keyboard (N 293, see below). When applied, we get the following characters, shown here on the Y: /y LY11 LATIN SMALL LETTER Y WITH ACUTE * /Y LY12 LATIN CAPITAL LETTER Y WITH ACUTE * \y LY13 LATIN SMALL LETTER Y WITH GRAVE * \Y LY14 LATIN CAPITAL LETTER Y WITH GRAVE * ?y LY.. LATIN SMALL LETTER Y WITH VTILDE ?Y LY.. LATIN CAPITAL LETTER Y WITH VTILDE ~y LY19 LATIN SMALL LETTER Y WITH TILDE ~Y LY20 LATIN CAPITAL LETTER Y WITH TILDE * .y LY.. LATIN SMALL LETTER Y WITH DOT BELOW * .Y LY.. LATIN CAPITAL LETTER Y WITH DOT BELOW * In SC2/WG2 N 293 a code table is given for Vietnamese, but in this the letters marked with * are missing. One feels that there was lack of space, and the least used were omitted, just the same trick as we met with the LATIN CAPITAL LETTER Y WITH DIAERESIS and the CYRILLIC CAPITAL HARD SIGN. Unfortunately, DIS 10646 follows N 293. (The Vietnamese letters are scattered over Tables 5 to 8 in a most impractical way.) Because the combination of some vowels with a tone results in a letter carrying two diacritics, this is reflected in its name. But that should be constructed in a logical way, unlike arbitrarily as in DIS 10646. /#a LA LATIN SMALL LETTER A WITH BREVE AND ACUTE /#A LA LATIN CAPITAL LETTER A WITH BREVE AND ACUTE \#a LA LATIN SMALL LETTER A WITH BREVE AND GRAVE \#A LA LATIN CAPITAL LETTER A WITH BREVE AND GRAVE ?#a LA LATIN SMALL LETTER A WITH BREVE AND VTILDE ?#A LA LATIN CAPITAL LETTER A WITH BREVE AND VTILDE ~#a LA LATIN SMALLAL LETTER A WITH BREVE AND TILDE ~#A LA LATIN CAPITAL LETTER A WITH BREVE AND TILDE .#a LA LATIN SMALL LETTER A WITH BREVE AND DOT BELOW .#A LA LATIN CAPITAL LETTER A WITH BREVE AND DOT BELOW /^a LA LATIN SMALL LETTER A WITH CIRCUMFLEX AND ACUTE /^A LA LATIN CAPITAL LETTER A WITH CIRCUMFLEX AND ACUTE \^a LA LATIN SMALL LETTER A WITH CIRCUMFLEX AND GRAVE \^A LA LATIN CAPITAL LETTER A WITH CIRCUMFLEX AND GRAVE ?^a LA LATIN SMALL LETTER A WITH CIRCUMFLEX AND VTILDE ?^A LA LATIN CAPITAL LETTER A WITH CIRCUMFLEX AND VTILDE ~^a LA LATIN SMALLAL LETTER A WITH CIRCUMFLEX AND TILDE ~^A LA LATIN CAPITAL LETTER A WITH CIRCUMFLEX AND TILDE .^a LA LATIN SMALL LETTER A WITH CIRCUMFLEX AND DOT BELOW .^A LA LATIN CAPITAL LETTER A WITH CIRCUMFLEX AND DOT BELOW The form of the graphic symbol is outside the scope of DIS 10646 (and SC2 as well). But because the name might be derived from the common glyph, some remarks should be made. Putting a GRAVE or ACUTE over a CIRCUMFLEX may make the letter too high. Thus these forms occur: CIRCUMFLEX & ACUTE CIRCUMFLEX & GRAVE /\ / //\ \ /\ /\\ / \/ // \ \/ \ / \\ / \ // \ / \ / \\ 1 2 3 4 It was pointed out to me that case 2 is NOT permitted. Glyphs are the subject of SC18/WG8 and they should be aware of this.