[developers] [incr tsdb()] Process | Vocabulary output

Mon Aug 20 20:11:23 CEST 2012

Following up on the thread below:

Either I haven't understood your answer properly, or something else
is going on.  For my current Chintang grammar, I get results like
this:

ba-ce-ko | 1 references | [1 + 2] lexical entrie(s);

In lexicon.tdl, I have an entry with [ ORTH "ba" ], but none with
[ORTH "ba-ce-ko" ].   Could this instead mean something like
"there is one analysis available of this word, and it involves
the application of two lexical rules"?  (That would be consistent
with what I see in the parse chart, parsing just this item.)

In some other cases, it looks like a result of [ 1 + 0 ] lexical entrie(s)
means that the orthographic rules allowed the LKB to strip
affixes and find a stem, but then the tfs associated with those
lexical rules didn't unify.

Regarding dag-inflected-p, I haven't said anything yet in user-fns.lsp,
so that's whatever the default is in the LKB.  (And I'm not sure I
would know how to write something more appropriate, as we're
now using a complex fs as the value of INFLECTED.  Can
dag-inflected-p check whether the value of that feature unifies
with some other type, or can it only check types as strings?)

Thanks,
Emily

On Wed, Aug 8, 2012 at 2:53 PM, Dan Flickinger <danf at stanford.edu> wrote:
> The test being applied is the function dag-inflected-p, which can be defined in a grammar's user-fns.lsp file.  In the ERG, this is a check for the value of the feature INFLECTD.
>
>  Dan
>
> ----- Original Message -----
> From: "Emily M. Bender" <ebender at uw.edu>
> To: "Dan Flickinger" <danf at stanford.edu>
> Cc: "developers" <developers at delph-in.net>
> Sent: Wednesday, August 8, 2012 2:16:28 PM
> Subject: Re: [developers] [incr tsdb()] Process | Vocabulary output
>
> Thanks, Dan. What property is being used to indicate "fully inflected"
> in this case?
>
> Emily
>
> On Wed, Aug 8, 2012 at 1:50 PM, Dan Flickinger <danf at stanford.edu> wrote:
>> Hi Emily -
>>
>> My understanding is that the first of the two numbers in the "[n + n]" report is the number of already inflected lexical entries defined in the lexicon with the given orthography, and the second number is the number of lexical rules that can apply to a lexicon-defined entry's stem to produce the given orthography.  So [basic + derived].  Thus, in the ERG, "dog" is reported as "[0 + 2]" since there is no fully inflected lexical entry defined with that spelling, but there are two derived forms with that spelling, applying an inflectional rule to each of the noun and verb entries defined in the lexicon.
>>
>>  Dan
>>
>> ----- Original Message -----
>> From: "Emily M. Bender" <ebender at uw.edu>
>> To: "developers" <developers at delph-in.net>
>> Sent: Thursday, July 19, 2012 4:00:58 PM
>> Subject: [developers] [incr tsdb()] Process | Vocabulary output
>>
>> Dear all,
>>
>> Is there any documentation on the output of the Process | Vocabulary
>> function in [incr tsdb()]?  I'm curious in particular what the two
>> numbers in "[n + n] lexical entries" mean, but couldn't turn up
>> anything on the wiki.
>>
>> Thanks,
>> Emily
>>
>> --
>> Emily M. Bender
>> Associate Professor
>> Department of Linguistics
>> Check out CLMS on facebook! http://www.facebook.com/uwclma
>
>
>
> --
> Emily M. Bender
> Associate Professor
> Department of Linguistics
> Check out CLMS on facebook! http://www.facebook.com/uwclma

-- 
Emily M. Bender
Associate Professor
Department of Linguistics
Check out CLMS on facebook! http://www.facebook.com/uwclma