<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.0 TRANSITIONAL//EN">
<HTML>
<HEAD>
<META HTTP-EQUIV="Content-Type" CONTENT="text/html; CHARSET=UTF-8">
<META NAME="GENERATOR" CONTENT="GtkHTML/3.10.3">
</HEAD>
<BODY>
On Sun, 2006-12-10 at 23:45 +0100, Yi Zhang wrote:<BR>
<BLOCKQUOTE TYPE=CITE>
<FONT COLOR="#000000">Hi all,</FONT><BR>
<BR>
<FONT COLOR="#000000">I would like to ask about the current state of RMRS in PET. </FONT><BR>
<BR>
<FONT COLOR="#000000">The cfrom/cto of EPs are defined to be character spans. And I do get the correct cfrom/cto when using LKB alone. However, with the SVN PET compiled with recent LKB (from the logon cvs, with the patch for file name confliction of `rmrs/convert.lisp'), the cfrom/cto for eps seems to be token-based span, rather than the character span (see the attached outputs from PET). I get the same `token-based' span when I use the `tsdb::export-trees()' of the fine system to export RMRS in xml format from the treebank. </FONT><BR>
<BR>
<FONT COLOR="#000000">It will be appreciated if anyone can explain to me the current state of RMRS development, especially for the use in PET and itsdb. Is it possible at all to get the character span cfrom/cto in PET and/or tsdb `export' functionality? </FONT><BR>
<BR>
<FONT COLOR="#000000">Best,</FONT><BR>
<FONT COLOR="#000000">Yi</FONT><BR>
<BR>
</BLOCKQUOTE>
Hi Yi,<BR>
<BR>
try experimenting with the -tok parameter: the yy_counts and xml_counts parameter should give you character spans. <BR>
You need to provide the input in yy or Pic format, though. <BR>
<BR>
Standard string input appears to give token positions.... Maybe we could have this added as an option? <BR>
<BR>
Berthold<BR>
<BR>
<BLOCKQUOTE TYPE=CITE>
<BR>
<FONT COLOR="#000000">-----begin-----</FONT><BR>
<FONT COLOR="#000000">...</FONT><BR>
<FONT COLOR="#000000">(1) `the dog barks' [0] --- 2 (0.00|0.01s) <7:58> (673.1K) [0.0s]</FONT><BR>
<FONT COLOR="#000000">derivation[1] (0.4371):the dog barks </FONT><BR>
<BR>
<FONT COLOR="#000000"><rmrs cfrom='-1' cto='-1'></FONT><BR>
<FONT COLOR="#000000"><label vid='1'/> </FONT><BR>
<FONT COLOR="#000000"><ep cfrom='0' cto='3'><gpred>prop-or-ques_m_rel</gpred><label vid='1'/><var sort='e' vid='2'/></ep></FONT><BR>
<FONT COLOR="#000000"><ep cfrom='0' cto='2'><realpred lemma='the' pos='q'/><label vid='6'/><var sort='x' vid='7'/></ep> </FONT><BR>
<FONT COLOR="#000000"><ep cfrom='1' cto='2'><realpred lemma='dog' pos='n' sense='1'/><label vid='10'/><var sort='x' vid='7'/></ep></FONT><BR>
<FONT COLOR="#000000"><ep cfrom='2' cto='3'><realpred lemma='bark' pos='v' sense='1'/><label vid='11'/><var sort='e' vid='2'/></ep> </FONT><BR>
<FONT COLOR="#000000"><rarg><rargname>MARG</rargname><label vid='1'/><var sort='h' vid='3'/></rarg></FONT><BR>
<FONT COLOR="#000000"><rarg><rargname>RSTR</rargname><label vid='6'/><var sort='h' vid='8'/></rarg> </FONT><BR>
<FONT COLOR="#000000"><rarg><rargname>BODY</rargname><label vid='6'/><var sort='h' vid='9'/></rarg></FONT><BR>
<FONT COLOR="#000000"><rarg><rargname>ARG1</rargname><label vid='11'/><var sort='x' vid='7'/></rarg> </FONT><BR>
<FONT COLOR="#000000"><hcons hreln='qeq'><hi><var sort='h' vid='3'/></hi><lo><label vid='11'/></lo></hcons></FONT><BR>
<FONT COLOR="#000000"><hcons hreln='qeq'><hi><var sort='h' vid='8'/></hi><lo><label vid='10'/></lo></hcons> </FONT><BR>
<FONT COLOR="#000000"></rmrs></FONT><BR>
<FONT COLOR="#000000">...</FONT><BR>
<FONT COLOR="#000000">-----end-----</FONT>
</BLOCKQUOTE>
</BODY>
</HTML>