View on GitHub

Babel

The multilingual framework to localize LaTeX, LuaLaTeX, XeLaTeX

Which method for which language

Improve this page! Feel free to draft a pull request on GitHub.

Since currently there are two methods to load a language, here is a short explanation on how to do it, with info about basic fonts, so that you can setup quickly a document. There are also pages on specific locales in the menu above and here.

See also the examples here for languages like Arabic, Thai, Hebrew, Polytonic Greek, Japanese, Syriac and Malayalam, among others.

I’d like to thank Loren Davis for his help in preparing this page, including the list of additional fonts.

Index Loading the languageFonts


Loading the language

There are two ways to load a language, which use different definition files. At first this duality may seem inconvenient, but this way the transition to a more modern system where locales can be easily defined and interoperate can be done smoothly.

The first one is:

\usepackage[italian]{babel}

It will load the old good Italian language definition file. This procedure is the preferred one in most languages where an ldf exists, but there are exceptions (for example, Arabic and Hebrew). In general, you should do this for European languages written in Latin and Cyrillic scripts, as well as for Vietnamese. For Greek, it may depend on your needs, and very often both methods will be fine. If you want modern and ancient Greek at the same time, the next one is usually better.

The second method is:

\usepackage[thai, provide=*]{babel}

This will load data from an ini file. This is usually the correct method for languages written in some of the 30-something non-European scripts, currently supported by babel, like Thai, Hindi, Sanskrit, Marathi, Malayalam, Lao, Khmer, Bangla (Bengali), Gujarati, Telugu, Kannada, Sinhala, Tamil, Amharic, or Burmese. In short simple CJK texts, this is also the preferred method, but in more complex ones you should resort to some CKJ framework like luatexja, xeCJK, CTeX, kotex, and the like.

Some languages may require additional options. Examples are Arabic, Hebrew, and Syriac. So, with LuaTeX:

\usepackage[arabic, bidi=basic, provide=*]{babel}

With XeTeX:

\usepackage[arabic, bidi=bidi-r, provide=*]{babel}

Fonts

With non-Latin scripts, you need to select another font, which can be done with \babelfont.

If compatibility is not a concern, remember currently the most important systems (Windows, Mac, Linux) come with many fonts. Even if they are not installed by default, there are options in the system settings to do it.

With luatex and in versions ≥24.14, the default renderer in \babelfont is the default one in Latin, Cyrillic, Greek and scripts of the same family. In the rest of scripts, like Southeast Asian ones and Arabic, the default renderer is Harfbuzz.

TeXLive

Here are some basic fonts included in TeXLive. This section is not meant as a font reference, but as a quick guide for a language to work with a more or less standard font. I’ve found no fonts for Telugu or Kannada.

Font(s) Languages / Scripts Notes
IPAexMincho, IPAexGothic Japanese  
Baekmuk Gulim, Baekmuk Batang, Baekmuk Dotum Korean  
FreeSerif Multilingual, including Greek, Cyrillic, Coptic, Arabic, Hebrew, many (but not all) Brahmi scripts (Thai, Devanagari, Bengali, Tamil, Malayalam, etc.), Amharic, Georgian, Cherokee, Armenian. A comprehensive Times-like, although with some oddities.
Amiri Arabic  
ALM Fixed Arabic A CM-like monospaced font.
CMU Serif, CMU Sans Serif, CMU Typewriter Text Cyrillic, Greek A CM-like family.
NewComputerModern Cyrillic, Greek, Hebrew, Cherokee Another, more modern, CM-like family. It doesn’t support African orthographies.
Padauk Myanmar  
Shobhika Devanagari With an excellent coverage of Vedic accents, Latin transliteration for Sanskrit, mathematical and technical symbols, and Cyrillic.
Rachana Malayalam It was the first Unicode font to support traditional orthography. This is the version by smc.org.in. There are several other fonts floating around under the name “Rachana,” and some of them do not work. Manjari is another popular font suitable for these body text, designed on the same theoretical principles as Inconsolata.
FandolSong, FandolHei, FandolKai, FandolFang Yi, Chinese  
David CLM, Miriam CLM, Miriam Mono CLM Hebrew Free implementations of the most common Hebrew serif, sans-serif and monospace fonts. If you need cantillation marks for liturgical texts, Culmus also has a Taamey David. There are many different versions of font faces.
DejaVu Sans Greek, Cyrillic, Armenian, Hebrew, Arabic, Lao, Georgian Also Canadian Aboriginal, Ogham.
Noto Greek, Cyrillic A large family, but TeXLive includes only the fonts for these two scripts (and emojis).
Junicode Runic Also Medieval and Ancient Latin script.

Additional fonts

There are also some fine fonts out there, but you must install them because they aren’t bundled with TeXLive. This list, of course, is just a very small selection.

Font(s) Languages / Scripts Notes
Noto Multilingual This family has a very comprehensive coverage and come in many variations of weight and width to match your other fonts. You can see how to combine Noto fonts at the end of the video LaTeX – Multilingual Unicode strings.
Babelstone Han Chinese It supports both Simplified and Traditional Chinese and has an especially comprehensive selection of traditional Chinese characters.
Abyssinica SIL Abyssinica SIL  
Nirmala UI Indic scripts A Windows font with 14 scripts: Telugu, Kannada, Bangla, Odia, Sinhala, Devanagari…
Segoe UI Historic Ancient scripts A Windows font with 25 scripts: Imperial Aramaic, Brahmi, Egyptian hieroglyphs, Syriac, Akkadian Cuneiform, Runic…

You may also want to have a look at some articles on tex.textackexchange on Arabic. On Windows, Nirmala UI supports meny Indic scripts, while Segoe UI Historic provides some ancient scripts, like Syriac, Old Uyghur


Language Method Font Additional package options (lua) xe Notes
Arabic provide Renderer=Harfbuzz bidi=basic, layout=counters tabular bidi=bidi-r  
German          
Hebrew provide Renderer=Harfbuzz bidi=basic, layout=tabular bidi=bidi-r  
Hindi provide Renderer=Harfbuzz      
Portuguese          
Swedish          
Thai provide Renderer=Harfbuzz