[pet] handling of unknown lexical items

John Stewart cane.cubo at gmail.com
Wed Oct 26 22:42:32 CEST 2011


Rebecca,

Thank you, that is helpful.  I'm not using a tagger, and I have cheap
0.99.14svn_cm and the ERG (1004) grammar.  I see from
http://www.coli.uni-saarland.de/~thorsten/tnt/ that TnT has a more
restrictive license.  Would any off-the-shelf tagger that produces
Penn tags work fine?

Best,

jds

On Wed, Oct 26, 2011 at 3:12 PM, Rebecca Dridan <bec.dridan at gmail.com> wrote:
> Are you using any sort of POS tagger to annotate the input to PET? I assume
> the online demo is using the TnT tagger, which is the default.  How you feed
> those into the parser depends a bit on which version of the parser and the
> grammar you are using, but you'll definitely want POS-tagged input to get
> decent unknown word handling.
>
> Rebecca
>
> On 26/10/11 8:48 PM, John Stewart wrote:
>>
>> Hello,
>>
>> I am trying to reproduce the behaviour of the online demo using the
>> command-line PET + ERG system, but having various troubles.  One is
>> with unknown words.  For  the sentence
>>
>> (1)  ugo kissed pilar
>>
>> The online demo returns _ugo/nn_u_unknown and _pilar/nn_u_unknown ,
>> which is correct.
>>
>> Using the command-line tool as follows:
>>
>>> cheap -default-les=all -verbose=3 -mrs english.grm
>>
>> I get the surprising output:
>>
>> (1011 np_frg_c 0 0 3 [root_frag]
>>   (1007 hdn_bnp_c 0 0 3
>>     (1003 n-hdn_cpd_c 0 0 3
>>       (5 gen_generic_noun/n_-_mc-ns-g_le 0 0 1 []
>>         (1 "ugo" 0 0 1<0:1>))
>>       (1000 hdn-n_prnth_c 0 1 3
>>         (610 generic_pl_noun/n_-_c-pl-unk_le 0 1 2 []
>>           (2 "kissed" 0 1 2<1:2>))
>>         (865 generic_pl_noun_ne/n_-_c-pl-gen_le 0 2 3 []
>>           (3 "pilar" 0 2 3<2:3>))))))
>>
>> So an NP fragment.  Incidentally I'm unsure how to read the leaf
>> types, as the format, with "/", seems to not match the templates
>> documented at http://moin.delph-in.net/ErgLeTypes  But in any case,
>> plural nouns are an incorrect default (I get worse results with
>> -default-les=traditional).  Are there cheap switches that will yield
>> the better output given by the online demo?
>>
>> Thanks for any suggestions.
>>
>> jds
>>
>
>




More information about the pet mailing list