<div dir="ltr">Hi Prof. Oepen,<div><br></div><div>I have tried using the command such as 'echo 'this is a _generic_vbd_ .' | ./bin/cheap -default-les=all -cm lingo/erg/english.grm' on the top level of logon, but have had no luck. Cheap outputs that it did not find any lexicon entries for '_generic_vbd_'.</div><div><br></div><div>One thing that I am noticing is that if I look in the _generic_vbd_ entry in the linguistic type database (<a href="http://compling.hss.ntu.edu.sg/ltdb/cgi/ERG_1214//showtype.cgi?typ=v_np%2A_pa-unk_le">link</a>) and choose any surface form listed to replace the word in the sentence, it will activate the generic_vbd rule, and not any other rule. Could there be any potential downsides to using this method?</div><div><br></div><div>Again, I really appreciate your time! Thanks!</div></div><div class="gmail_extra"><br><div class="gmail_quote">On Tue, Jun 12, 2018 at 5:35 PM, Stephan Oepen <span dir="ltr"><<a href="mailto:oe@ifi.uio.no" target="_blank">oe@ifi.uio.no</a>></span> wrote:<br><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><div><div dir="auto">hmm, what if you disable PoS tagging, i.e. drop the ‘-tagger’ option to cheap?</div></div><div dir="auto"><br></div><div dir="auto">oe</div><div class="HOEnZb"><div class="h5"><div dir="auto"><br></div><div><br><div class="gmail_quote"><div>On Tue, 12 Jun 2018 at 23:23 Johnny Wei <<a href="mailto:jwei@umass.edu" target="_blank">jwei@umass.edu</a>> wrote:<br></div><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><div>Hi,<div><br></div><div>Thank you for your response! Is there anyway to prevent sentences like "_generic_vbd_ is a person." to be parseable where "_generic_vbd_" is being recognized as a "NN" POS generic? Would this require turning off the unknown word machinery and adding these generics as lexical entries?</div><div><br></div><div>I am interested because the grammaticality of my language model is important to me. Thanks! Please let me know!</div><div><br></div></div><div class="gmail_extra"></div><div class="gmail_extra"><br><div class="gmail_quote">On Tue, Jun 12, 2018 at 2:10 PM, Stephan Oepen <span><<a href="mailto:oe@ifi.uio.no" target="_blank">oe@ifi.uio.no</a>></span> wrote:<br><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">hi johnny,<br>
<br>
> [...] A sample from my language model might look like "generic_proper_ne had<br>
<span>> VBP_u_unknown a cat ." I want to see if these sequences can be parsed by the<br>
> ERG.<br>
<br>
</span>if you look at ‘gle.tdl’ in the ERG sources, it provides the<br>
definitions of the generic lexical entries that are put to use in<br>
unknown word handling. for debugging purposes (similar to what you<br>
have in mind, i think) they all have a unique orthography—which you<br>
should be able to just give to the parser; please see the attachment,<br>
for how this works on the ERG on-line interface.<br>
<br>
best wishes, oe<br>
</blockquote></div><br><br clear="all"><div><br></div></div><div class="gmail_extra">-- <br><div class="m_-8458890842797887335m_6316285119431448283gmail_signature" data-smartmail="gmail_signature"><div>Johnny Wei</div></div>
</div></blockquote></div></div>
</div></div></blockquote></div><br><br clear="all"><div><br></div>-- <br><div class="gmail_signature" data-smartmail="gmail_signature"><div dir="ltr">Johnny Wei</div></div>
</div>