<div dir="ltr">Thanks Woodley.<div><br></div><div>We could also use word shape and the token mapping to map generic entities, ...</div><div class="gmail_extra"><br><div class="gmail_quote">On Thu, Jan 12, 2017 at 3:55 PM, Woodley Packard <span dir="ltr"><<a href="mailto:sweaglesw@sweaglesw.org" target="_blank">sweaglesw@sweaglesw.org</a>></span> wrote:<br><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><div style="word-wrap:break-word">Hi Olga and Francis,<div><br></div><div>If what you want to do is define certain generic lexemes that apply to all words, you do not need a POS tagger at all. You will need to enable token mapping, but not necessarily write any token mapping rules. Next you need to create some lexical entries whose TDL status is "generic-lex-entry". Each one of these will get instantiated on every token, so some caution will be wise. The token feature structure gets unified into a grammar-defined path into the lexeme it licenses, so the explosion can be controlled by making constraints at that path. When POS tagging is used, the POS info lives on the token feature structures, and typical unknown-word-aware DELPH-IN grammars use constraints on generic lexical entries’ tokens’ POS values to select which generic lexical entry applies in which POS situations. You can also use the so-called "lexical filtering" stage to throw out some generic (or native if you want) lexical entries. See the ERG’s lfr.tdl for examples (Dan uses this to discard generic lexemes proposed by the tagger in situations where the grammar has native lexical coverage).</div><div><br></div><div>If you find that you need help getting some portions of this airborne, let me know.</div><div><br></div><div>Regards,</div><div>Woodley</div><div><div class="m_911005092495834604h5"><div><br><div><blockquote type="cite"><div>On Jan 10, 2017, at 9:28 PM, Francis Bond <<a href="mailto:bond@ieee.org" target="_blank">bond@ieee.org</a>> wrote:</div><br class="m_911005092495834604m_-7447715687815317943Apple-interchange-newline"><div><div dir="ltr">There is some discussion here (for PET):<div><a href="http://moin.delph-in.net/PetInput" target="_blank">http://moin.delph-in.net/PetIn<wbr>put</a><br></div><div><br></div><div>the chart mapping approach also works for ACE.</div><div><br></div><div>You have to use an external POS tagger (which you could fake if all unknown words get the same POS).</div><div><br></div><div>We have this working for Zhong and Jacy (and I think INDRA). I would be happy to walk you through it if you promise to enhance the documentation :-).</div><div><br></div></div><div class="gmail_extra"><br><div class="gmail_quote">On Tue, Jan 10, 2017 at 4:57 PM, Olga Zamaraeva <span dir="ltr"><<a href="mailto:olzama@uw.edu" target="_blank">olzama@uw.edu</a>></span> wrote:<br><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><div dir="ltr">Dear Developers,<div><br></div><div>I would like to parse some text (with ACE) using a small grammar and I am likely to encounter stems that I do not have in the lexicon. My understanding is that it is possible to add a generic lexical entry for e.g. "verb", and analyze some of the unknown words morphologically this way. I am looking for any documentation/advice on how this is done. Would anyone be able to point me to anything?</div><div><br></div><div><br></div><div>Thank you,</div><div>Olga</div></div> </blockquote></div><br><br clear="all"><div><br></div>-- <br><div class="m_911005092495834604m_-7447715687815317943gmail_signature" data-smartmail="gmail_signature">Francis Bond <<a href="http://www3.ntu.edu.sg/home/fcbond/" target="_blank">http://www3.ntu.edu.sg/home/f<wbr>cbond/</a>><br>Division of Linguistics and Multilingual Studies<br>Nanyang Technological University<br></div> </div> </div></blockquote></div><br></div></div></div></div></blockquote></div><br><br clear="all"><div><br></div>-- <br><div class="m_911005092495834604gmail_signature" data-smartmail="gmail_signature">Francis Bond <<a href="http://www3.ntu.edu.sg/home/fcbond/" target="_blank">http://www3.ntu.edu.sg/home/<wbr>fcbond/</a>><br>Division of Linguistics and Multilingual Studies<br>Nanyang Technological University<br></div> </div></div>