<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.0 TRANSITIONAL//EN">
<HTML>
<HEAD>
<META HTTP-EQUIV="Content-Type" CONTENT="text/html; CHARSET=UTF-8">
<META NAME="GENERATOR" CONTENT="GtkHTML/3.10.3">
</HEAD>
<BODY>
On Thu, 2006-11-09 at 17:39 +1100, David Martinez wrote:
<BLOCKQUOTE TYPE=CITE>
<PRE>
<FONT COLOR="#000000"> Hi,</FONT>
<FONT COLOR="#000000"> I have been playing with the parameters "nsolutions", "results", </FONT>
<FONT COLOR="#000000">and "packing" and I can't get the output I need: a single export tree for </FONT>
<FONT COLOR="#000000">each parsed sentence.</FONT>
</PRE>
</BLOCKQUOTE>
I would guess you need -results=1, packing=15 and -nsolutions >>0 if you need selective unpacking. Please correct me, if I'm wrong. <BR>
<BR>
Otherwise, -packing=7 -results=1 should always give you the globally optimal candidate. That's what we've been using here so far. See also Francis's synopsis. <BR>
<BR>
B<BR>
<BR>
<BLOCKQUOTE TYPE=CITE>
<PRE>
<FONT COLOR="#000000"> When using nsolutions=1 and packing=15 in most cases I obtain a </FONT>
<FONT COLOR="#000000">set of "inactive" trees from the sentence. Is there some way of choosing </FONT>
<FONT COLOR="#000000">the best candidate?</FONT>
<FONT COLOR="#000000"> Thank you,</FONT>
<FONT COLOR="#000000"> David</FONT>
<FONT COLOR="#000000">On Wed, 8 Nov 2006, Berthold Crysmann wrote:</FONT>
<FONT COLOR="#000000">> On Wed, 2006-11-08 at 10:57 +0100, Yi Zhang wrote:</FONT>
<FONT COLOR="#000000">>> Hi,</FONT>
<FONT COLOR="#000000">>></FONT>
<FONT COLOR="#000000">>> Here are my understanding of the options:</FONT>
<FONT COLOR="#000000">>></FONT>
<FONT COLOR="#000000">>></FONT>
<FONT COLOR="#000000">>></FONT>
<FONT COLOR="#000000">>></FONT>
<FONT COLOR="#000000">>> -results sits in the output routine and stops it printing all</FONT>
<FONT COLOR="#000000">>> the</FONT>
<FONT COLOR="#000000">>> results. They are still all calculated.</FONT>
<FONT COLOR="#000000">>></FONT>
<FONT COLOR="#000000">>> I think that's right.</FONT>
<FONT COLOR="#000000">>></FONT>
<FONT COLOR="#000000">>></FONT>
<FONT COLOR="#000000">>> -nsolutions asks cheap to only produce the top "n" parses.</FONT>
<FONT COLOR="#000000">>></FONT>
<FONT COLOR="#000000">>> Due to the use of ambiguity packing, the parsing is splitted into two</FONT>
<FONT COLOR="#000000">>> phases: i) packed parse forest creation; ii) unpacking the readings.</FONT>
<FONT COLOR="#000000">>> `-nsolutions' can have effect in both phases.</FONT>
<FONT COLOR="#000000">>></FONT>
<FONT COLOR="#000000">>> In the first phase, if `-nsolutions' is set to be non-zero, the forest</FONT>
<FONT COLOR="#000000">>> creation phase will stop when the `first' n (with kind of beam search</FONT>
<FONT COLOR="#000000">>> i think) packed trees are found. If `-nsolutions' is not set or set to</FONT>
<FONT COLOR="#000000">>> be zero, the entire packed parse forest will be created.</FONT>
<FONT COLOR="#000000">>></FONT>
<FONT COLOR="#000000">>> In the unpacking phase, the effect depends on the unpacking mechanism</FONT>
<FONT COLOR="#000000">>> used:</FONT>
<FONT COLOR="#000000">>> - if `packing=7' (which is the default exhaustive unpacking) is used,</FONT>
<FONT COLOR="#000000">>> all the readings will be unpacked (with lots of unification operations</FONT>
<FONT COLOR="#000000">>> replayed), and sorted according to the scoring model. `-nsolutions'</FONT>
<FONT COLOR="#000000">>> won't have any effect on this phase. So you might finally get more</FONT>
<FONT COLOR="#000000">>> readings than `-nsolutions'.</FONT>
<FONT COLOR="#000000">></FONT>
<FONT COLOR="#000000">></FONT>
<FONT COLOR="#000000">></FONT>
<FONT COLOR="#000000">>> - if `packing=15' (selective unpacking) is used, only the best n</FONT>
<FONT COLOR="#000000">>> readings will be unpacked from the parse forest. But note that</FONT>
<FONT COLOR="#000000">>> `-nsolutions' must be set to >0, otherwise the parser will fall back</FONT>
<FONT COLOR="#000000">>> into exhaustive unpacking like `-packing=7'. Current implementation</FONT>
<FONT COLOR="#000000">>> supports the basic branching and grand-parenting (with arbitrary</FONT>
<FONT COLOR="#000000">>> number of levels) features in the scoring model.</FONT>
<FONT COLOR="#000000">>></FONT>
<FONT COLOR="#000000">></FONT>
<FONT COLOR="#000000">> Grandparenting in Pet sounds like a great improvement. What is still</FONT>
<FONT COLOR="#000000">> missing, as compared to lkb/tsdb++? Ngrams?</FONT>
<FONT COLOR="#000000">></FONT>
<FONT COLOR="#000000">> Thanks for the description.</FONT>
<FONT COLOR="#000000">></FONT>
<FONT COLOR="#000000">></FONT>
<FONT COLOR="#000000">> B</FONT>
<FONT COLOR="#000000">></FONT>
<FONT COLOR="#000000">>> I also think the use of `-nsolutions' is particularly vague at the</FONT>
<FONT COLOR="#000000">>> moment. I believe this is partly due to the split of the parsing</FONT>
<FONT COLOR="#000000">>> phases. To PET developers, should the option be splitted for</FONT>
<FONT COLOR="#000000">>> particular phases of parsing?</FONT>
<FONT COLOR="#000000">>></FONT>
<FONT COLOR="#000000">>> Stephan and Bernd, please correct me if I am wrong :-)</FONT>
<FONT COLOR="#000000">>></FONT>
<FONT COLOR="#000000">>> Best,</FONT>
<FONT COLOR="#000000">>> yi</FONT>
<FONT COLOR="#000000">></FONT>
</PRE>
</BLOCKQUOTE>
</BODY>
</HTML>