<div dir="ltr">G'day,<div><br></div><div>I have been asked about the current state of non-English treebanks. I have answered about Hinoki (9,128 treebanked sentences out of 15,000, slightly old grammar). Are there any other treebanks people think it may be interesting to learn grammars off?</div><div><br></div><div>I will try and summarize replies on the wiki.</div><div><br></div><div>Francis</div><div><br><div class="gmail_quote">---------- Forwarded message ----------<br>From: <b class="gmail_sendername">David R. Mortensen</b> <span dir="ltr"><<a href="mailto:dmortens@cs.cmu.edu">dmortens@cs.cmu.edu</a>></span><br>Date: Thu, Jul 20, 2017 at 10:51 PM<br>Subject: Treebanks with MRS<br>To: <a href="mailto:bond@ieee.org">bond@ieee.org</a>, <a href="mailto:fcbond@ntu.edu.sg">fcbond@ntu.edu.sg</a><br><br><br>
<div bgcolor="#FFFFFF">
<font face="Consolas">Francis,<br>
<br>
We met at CMU earlier this year. It was very interesting to talk
to you. It turns out that I now have some HPSG/MRS related
questions for you: I'm participating in a JSALT workshop
(<a class="gmail-m_-9028170088770340545moz-txt-link-freetext" href="https://www.lti.cs.cmu.edu/2017-jelinek-workshop" target="_blank">https://www.lti.cs.cmu.edu/<wbr>2017-jelinek-workshop</a>) on neural
machine translation and we have become interested in MRS as a
semantic representation for NMT. For English, we have a rich
resource in form of DeepBank. We have found that ERG is not robust
enough to do our MRS parsing but Buys and Blunsom
(<a class="gmail-m_-9028170088770340545moz-txt-link-freetext" href="https://arxiv.org/abs/1704.07092" target="_blank">https://arxiv.org/abs/1704.<wbr>07092</a>) have demonstrated a robust
transition-based parser for MRS that they trained on a subset of
DeepBank, and we may do something similar.<br>
<br>
We are now interested in other resources that contain significant
amounts of MRS. I noticed that you mention MRS in a paper on the
Hinoko Treebank. Is this treebank available? Do you know how many
MRSs it contains? Do you know of other treebanks that contain
significant numbers of hand-annotated/corrected MRS
representations, especially for non-English languages?<span class="gmail-HOEnZb"><font color="#888888"><br>
</font></span></font><span class="gmail-HOEnZb"><font color="#888888">
<pre class="gmail-m_-9028170088770340545moz-signature" cols="72">--
Best,
David R. Mortensen
Research Scientist
Language Technologies Institute
Carnegie Mellon University</pre>
</font></span></div>
</div><br><br clear="all"><div><br></div>-- <br><div class="gmail_signature">Francis Bond <<a href="http://www3.ntu.edu.sg/home/fcbond/" target="_blank">http://www3.ntu.edu.sg/home/fcbond/</a>><br>Division of Linguistics and Multilingual Studies<br>Nanyang Technological University<br></div>
</div></div>