logo

LaTeX Tagging Project

Screen Reading Examples (DRAFT)

The “Larger Example” on the WTPDF from LaTeX page shows a small typical LaTeX article set in two columns with tables and figures.

Here we show the effect of tagging on generated speech. All examples use the test release of NVDA 2025 (enabling interfacing to MathML readings), along with public versions of MathCat, Foxit and Acrobat Reader on Windows 11.

To present something of the experience to users not familiar with screen readers, we have provided video clips of readings of the full document and of specific sections. Although a video format is used they are all recordings of the screen reader, with a fixed video frame showing the part the document in each case.

Table of Contents.

The first overview of a document is often presented by a table of contents. However a screen reader may have difficulty inferring a usable reading. With an untagged document it relies on the screen reader inferring the structure, to pass to the underlying Accessibility tools.

On this document foxit produces the following reading (Acrobat is similar).

Using the tagged PDF, the structure is made explicit to the screen reader and typesetting atrtifacts such as leader dots are marked as artifacts and not read, producing a far more understandable reading.

Mathematics

Mathematics is traditionally hard to automatically present.

In the untagged PDF the screen reader is just passed the raw character data. The superscript in the first equation is not read, so losing the meaning, and the matrix equation is completely scrambled with the entries being read in the wrong order.

At the present time different PDF readers support different ways of tagging mathematics, here we present a version with math tagged with Associated files containing MathML, which is understood by foxit, and a version with the same MathML used as PDF Structure Element tagging, which is understood by Acrobat. Identical readings are generated as in either case the resulting MathML is passed via NVDA to MathCat to generate the reading.

We expect that this is a temporary situation and that PDF readers will soon support both mechanisms to supply MathML to the accessibility tools. However for now, we need to build two separate PDFs, one using Associated Files, and one with Structure Elements, to support current PDF viewers.

Associated Files and foxit

Structure Elements and Acrobat


Complete Videos

Untagged PDF, with Foxit:

Tagged PDF using Associated MathML Files, with Foxit:

Tagged PDF using MathML Structure Elements, with Acrobat:

The TeX source and LaTeX generated PDF

Available in three forms

t1, Classic LaTeX, typeset with pdftex and no tagging enabled. TeX PDF

t1-af Lualatex with Unicode Math and tagging enabled. Auto-generated MathML included as Associated files. TeX PDF

t1-se Lualatex with Unicode Math and tagging enabled. Auto-generated MathML included as PDF Structure Elements. TeX PDF