MIME-Version: 1.0
Content-Type: multipart/alternative;
	boundary="----_=_NextPart_001_01C096C5.68F53D00"
Lines: 42
References: <v03110700b6aee5fd0bce@[195.100.226.150]> Your message of "Mon, 12            Feb 2001 19:44:31 +0100." <v03110700b6addd26c875@[195.100.226.129]>
Content-class: urn:content-classes:message
Subject:      Re: Why markup?
Date: Wed, 14 Feb 2001 21:22:22 +0100
Message-ID:  <2.07b5.RCBL.G8RKLA@cherepan.mccme.ru>
From: "Alexander Cherepanov" <sasha@CHEREPAN.MCCME.RU>
Sender: "Mailing list for the LaTeX3 project" <LATEX-L@URZ.UNI-HEIDELBERG.DE>
To: "Multiple recipients of list LATEX-L" <LATEX-L@URZ.UNI-HEIDELBERG.DE>
Reply-To: "Mailing list for the LaTeX3 project" <LATEX-L@URZ.UNI-HEIDELBERG.DE>
Status: R

This is a multi-part message in MIME format.

------_=_NextPart_001_01C096C5.68F53D00
Content-Type: text/plain;
	charset="iso-8859-1"
Content-Transfer-Encoding: quoted-printable

13-Feb-01 14:45 Hans Aberg wrote:
> Otherwise, I stated the general principle, the better the parsing =
becomes,
> the less markup will be needed (or the more sophisticated it can be).

The question is "can markup be avoided completely?"
I bet no.

> As for that natural language parsing problem, one problem is that =
humans,
> using their massively parallel supercomputers, can scan a sentence and =
try
> many different patterns. Let's try parsing the Frank Mittelbach =
example:
>     The a in the formula is a variable.
> You would probably use the context knowledge that it is composed of
> English
> and Math and scan it to recognize that the second "a", but not the =
first,
> is a indefinite article. Then from that, you would infer that the =
first
> "a"
> must be a math symbol, which is supported by the semantic information =
of
> the wording "in the formula".

As for a math environment, how will you parse (without math markup) the =
last
sentence in the following example:

    ($G$ acts on a manifold $M$. $G_1,G_2,A$ --- subgroups of $G$. $X$ =
---
    a submanifold of $M$. $a \in A, p \in X$.)
            $$ L =3D \{ ga \cdot p  \mid  \mskip10mu p \in X \} $$
            $$ M =3D \{ g \cdot ap  \mid  \mskip15mu g \in G_1, p \in X =
\} $$
            $$ N =3D \{ gap         \mid  \mskip20mu g \in G_2 \} $$
    Editor writes: The gap in the last formula [i.e. \mskip20mu] should =
be
    made like in the first one and the $gap$ should be made like in the
    second one.

The following example is not that serious :-) (and, in fact, I'ld write
$\mathfrak{so}$ instead of $so$):

    This Lie algebra is $so_n(\mathbb C)$. Let's see why it is so [or
    $so$?].

Somebody with better English (and maybe with experience in other fields
of math) will find better examples, I think.

Sasha

------_=_NextPart_001_01C096C5.68F53D00
Content-Type: text/html;
	charset="iso-8859-1"
Content-Transfer-Encoding: quoted-printable

<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 3.2//EN">
<HTML>
<HEAD>
<META HTTP-EQUIV=3D"Content-Type" CONTENT=3D"text/html; =
charset=3Diso-8859-1">
<META NAME=3D"Generator" CONTENT=3D"MS Exchange Server version =
6.5.7654.12">
<TITLE>     Re: Why markup?</TITLE>
</HEAD>
<BODY>
<!-- Converted from text/plain format -->

<P><FONT SIZE=3D2>13-Feb-01 14:45 Hans Aberg wrote:</FONT>

<BR><FONT SIZE=3D2>&gt; Otherwise, I stated the general principle, the =
better the parsing becomes,</FONT>

<BR><FONT SIZE=3D2>&gt; the less markup will be needed (or the more =
sophisticated it can be).</FONT>
</P>

<P><FONT SIZE=3D2>The question is &quot;can markup be avoided =
completely?&quot;</FONT>

<BR><FONT SIZE=3D2>I bet no.</FONT>
</P>

<P><FONT SIZE=3D2>&gt; As for that natural language parsing problem, one =
problem is that humans,</FONT>

<BR><FONT SIZE=3D2>&gt; using their massively parallel supercomputers, =
can scan a sentence and try</FONT>

<BR><FONT SIZE=3D2>&gt; many different patterns. Let's try parsing the =
Frank Mittelbach example:</FONT>

<BR><FONT SIZE=3D2>&gt;&nbsp;&nbsp;&nbsp;&nbsp; The a in the formula is =
a variable.</FONT>

<BR><FONT SIZE=3D2>&gt; You would probably use the context knowledge =
that it is composed of</FONT>

<BR><FONT SIZE=3D2>&gt; English</FONT>

<BR><FONT SIZE=3D2>&gt; and Math and scan it to recognize that the =
second &quot;a&quot;, but not the first,</FONT>

<BR><FONT SIZE=3D2>&gt; is a indefinite article. Then from that, you =
would infer that the first</FONT>

<BR><FONT SIZE=3D2>&gt; &quot;a&quot;</FONT>

<BR><FONT SIZE=3D2>&gt; must be a math symbol, which is supported by the =
semantic information of</FONT>

<BR><FONT SIZE=3D2>&gt; the wording &quot;in the formula&quot;.</FONT>
</P>

<P><FONT SIZE=3D2>As for a math environment, how will you parse (without =
math markup) the last</FONT>

<BR><FONT SIZE=3D2>sentence in the following example:</FONT>
</P>

<P><FONT SIZE=3D2>&nbsp;&nbsp;&nbsp; ($G$ acts on a manifold $M$. =
$G_1,G_2,A$ --- subgroups of $G$. $X$ ---</FONT>

<BR><FONT SIZE=3D2>&nbsp;&nbsp;&nbsp; a submanifold of $M$. $a \in A, p =
\in X$.)</FONT>

<BR><FONT =
SIZE=3D2>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbs=
p; $$ L =3D \{ ga \cdot p&nbsp; \mid&nbsp; \mskip10mu p \in X \} =
$$</FONT>

<BR><FONT =
SIZE=3D2>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbs=
p; $$ M =3D \{ g \cdot ap&nbsp; \mid&nbsp; \mskip15mu g \in G_1, p \in X =
\} $$</FONT>

<BR><FONT =
SIZE=3D2>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbs=
p; $$ N =3D \{ gap&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; =
\mid&nbsp; \mskip20mu g \in G_2 \} $$</FONT>

<BR><FONT SIZE=3D2>&nbsp;&nbsp;&nbsp; Editor writes: The gap in the last =
formula [i.e. \mskip20mu] should be</FONT>

<BR><FONT SIZE=3D2>&nbsp;&nbsp;&nbsp; made like in the first one and the =
$gap$ should be made like in the</FONT>

<BR><FONT SIZE=3D2>&nbsp;&nbsp;&nbsp; second one.</FONT>
</P>

<P><FONT SIZE=3D2>The following example is not that serious :-) (and, in =
fact, I'ld write</FONT>

<BR><FONT SIZE=3D2>$\mathfrak{so}$ instead of $so$):</FONT>
</P>

<P><FONT SIZE=3D2>&nbsp;&nbsp;&nbsp; This Lie algebra is $so_n(\mathbb =
C)$. Let's see why it is so [or</FONT>

<BR><FONT SIZE=3D2>&nbsp;&nbsp;&nbsp; $so$?].</FONT>
</P>

<P><FONT SIZE=3D2>Somebody with better English (and maybe with =
experience in other fields</FONT>

<BR><FONT SIZE=3D2>of math) will find better examples, I think.</FONT>
</P>

<P><FONT SIZE=3D2>Sasha</FONT>
</P>

</BODY>
</HTML>
------_=_NextPart_001_01C096C5.68F53D00--