[lkb] generation ranking in LKB

Stephan Oepen oe at ifi.uio.no
Sat Jul 10 12:00:55 CEST 2010


any number will activate n-best selective unpacking, so you can choose  
how many realizations you want returned (at most).  as for the  
specific model, it's the ‘combined’ model from the EMNLP paper;  
however, just now i'm not sure the MEM file has been updated for the  
change in ERG rule identifiers.  in principle, that model should be  
retrained, i guess.  we have been a little slack about this in recent  
ERG releases ...

cheers, oe



On 10. juli 2010, at 11.44, Xuchen Yao <xuchen at coli.uni-saarland.de>  
wrote:

> Hi Stephan,
>
> Thanks for the reply! I wasn't taking advantage of this ranking  
> mechanism until your response.
>
> I've seen the big "bnc.blm" file at your prompt. May I further know  
> the actual ranking used is based on language models or MaxEnt? I  
> want to have an impression of how well it should work according to  
> the result of your original paper.
>
> BTW, The number "42" in
>
> (generate-from-mrs
>            (mrs::extract-mrs (first *parse-record*))
>            :nanalyses 42)
>
> was a bit confusing at first but after digging into the code, I saw  
> you wanted to "have some cake and eat some", so I guess I should  
> always use "42" here.;-)
>
> Xuchen
>
> On 07/10/2010 11:15 AM, Stephan Oepen wrote:
>> hi xuchen,
>>
>> i am assuming you are thinking of generation using the ERG here?  the
>> following lines in the `script' file enable realization ranking:
>>
>>   #+(and :tsdb :logon)
>>   (tsdb::read-model (lkb-pathname (parent-directory) "jhpstg.g.mem"))
>>   #+(and :tsdb :logon)
>>   (setf *unpacking-scoring-hook* #'tsdb::mem-score-configuration)
>>
>> the reason this is only supported in the LOGON tree is its dependency
>> on the CMU LM toolkit and the actual language model, which were  
>> deemed
>> too space-consuming to be included in the general LKB SVN.
>>
>> so, assuming you are actually in the LOGON tree, you should be able  
>> to
>> use realization ranking.  it will be turned on by default when  
>> running
>> in [incr tsdb()], i.e. batch generating.  for interactive use, i  
>> think
>> you would have to pass the optional :nanalyses argument.  e.g.
>>
>>   LKB(22): (do-parse-tty "kim gave the big angry dog a bone.")
>>   720
>>   375
>>   -1
>>   30937
>>   10
>>   LKB(23): (generate-from-mrs
>>             (mrs::extract-mrs (first *parse-record*))
>>             :nanalyses 42)
>>   ("Kim gave the angry big dog a bone."
>>    "Kim gave the big angry dog a bone."
>>    "Kim gave a bone to the angry big dog."
>>    "Kim gave a bone to the big angry dog.")
>>   86312
>>   2780
>>   728
>>   3008
>>   800
>>   337
>>   635
>>   LKB(24): (loop for edge in *gen-record* collect (edge-score edge))
>>   (2.20243 1.8947849 1.7799873 1.4181345)
>>
>> too bad the adjective ordering ends up unnatural in this example, but
>> the model appears to be doing something (useful).
>>
>>                                                         best  -  oe
>>
>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ 
>> +++++++++++
>> +++ Universitetet i Oslo (IFI); Boks 1080 Blindern; 0316 Oslo; (+47) 2284 0125
>> +++    --- oe at ifi.uio.no; stephan at oepen.net; http://www.emmtee.net/oe/ 
>>  ---
>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ 
>> +++++++++++
>>




More information about the lkb mailing list