<html xmlns:v="urn:schemas-microsoft-com:vml" xmlns:o="urn:schemas-microsoft-com:office:office" xmlns:w="urn:schemas-microsoft-com:office:word" xmlns:m="http://schemas.microsoft.com/office/2004/12/omml" xmlns="http://www.w3.org/TR/REC-html40"><head><meta http-equiv=Content-Type content="text/html; charset=us-ascii"><meta name=Generator content="Microsoft Word 12 (filtered medium)"><style><!--
/* Font Definitions */
@font-face
        {font-family:"Cambria Math";
        panose-1:2 4 5 3 5 4 6 3 2 4;}
@font-face
        {font-family:Calibri;
        panose-1:2 15 5 2 2 2 4 3 2 4;}
/* Style Definitions */
p.MsoNormal, li.MsoNormal, div.MsoNormal
        {margin:0in;
        margin-bottom:.0001pt;
        font-size:11.0pt;
        font-family:"Calibri","sans-serif";}
a:link, span.MsoHyperlink
        {mso-style-priority:99;
        color:blue;
        text-decoration:underline;}
a:visited, span.MsoHyperlinkFollowed
        {mso-style-priority:99;
        color:purple;
        text-decoration:underline;}
span.EmailStyle17
        {mso-style-type:personal-compose;
        font-family:"Calibri","sans-serif";
        color:windowtext;}
.MsoChpDefault
        {mso-style-type:export-only;}
@page WordSection1
        {size:8.5in 11.0in;
        margin:1.0in 1.0in 1.0in 1.0in;}
div.WordSection1
        {page:WordSection1;}
--></style><!--[if gte mso 9]><xml>
<o:shapedefaults v:ext="edit" spidmax="1026" />
</xml><![endif]--><!--[if gte mso 9]><xml>
<o:shapelayout v:ext="edit">
<o:idmap v:ext="edit" data="1" />
</o:shapelayout></xml><![endif]--></head><body lang=EN-US link=blue vlink=purple><div class=WordSection1><p class=MsoNormal>Greetings,<o:p></o:p></p><p class=MsoNormal><o:p> </o:p></p><p class=MsoNormal>I am looking for several points of assistance with the use of the ERG with PET. Please accept my apology in advance if this is less than appropriate for the list. I trepidatiously welcome any correspondence telling me “where to go”!<o:p></o:p></p><p class=MsoNormal><o:p> </o:p></p><p class=MsoNormal>First, we could use help in tokenization, particularly with regard to unknown words and MWE. We are, in effect, looking for the behavior of the on-line demo, albeit with PET. We have an SWT user interface that transmits to PET as a web service, so we would prefer a Java solution.<o:p></o:p></p><p class=MsoNormal><o:p> </o:p></p><p class=MsoNormal>Second, we have done a lot of work to map ERG types/sorts/predicates to an OWL ontology, but there is more to do. For example, we want the predicates of quantifiers, connectives, and prepositions rigorously classified. This is challenging for non-ERG authors for several reasons, as you are probably aware. (Even the nominal sorts can seem confusing when viewed purely semantically.) If there are existing ontologies that have good coverage or if someone who understands the critical aspects of the ERG non-terminals so as to assist with or review such an ontology, we are interested.<o:p></o:p></p><p class=MsoNormal><o:p> </o:p></p><p class=MsoNormal>FYI, the source text is XHTML from textbooks and sites such as Wikipedia, especially with regard to cellular biology and chemistry and, to a lesser extent, physics and mathematics. This motivates us to want even better handling of affixation, compounds, and various lexical forms, such as chemicals with sub- and super-scripts, too. . In time, the content will become broadly biomedical.<o:p></o:p></p><p class=MsoNormal><o:p> </o:p></p><p class=MsoNormal>I would be happy to share details on the project by request. And to all involved, I would like to express my appreciation for the excellent work in PET and the ERG. We hope that what results will reflect well on your efforts and yield some useful contributions.<o:p></o:p></p><p class=MsoNormal><o:p> </o:p></p><p class=MsoNormal>Thank you,<o:p></o:p></p><p class=MsoNormal>Paul<o:p></o:p></p><p class=MsoNormal><o:p> </o:p></p><p class=MsoNormal>Paul Haley<o:p></o:p></p><p class=MsoNormal>Automata, Inc.<o:p></o:p></p><p class=MsoNormal>(412) 716-6420<o:p></o:p></p><p class=MsoNormal><o:p> </o:p></p></div></body></html>