[developers] Searching treebanks

Lars Hellan lars.hellan at ntnu.no
Wed Feb 26 14:21:43 CET 2020


Hi Francis,

For Norwegian you can do such things through https://typecraft.org/tc2wiki/Norwegian_Valency_Corpus, a corpus of about 20,000 sentences.


(Not right on your mark, but perhaps not too far from the sphere of "anything" ...)


Best

Lars

________________________________
From: developers-bounces at emmtee.net <developers-bounces at emmtee.net> on behalf of Francis Bond <bond at ieee.org>
Sent: Wednesday, February 26, 2020 2:02:28 PM
To: Stephan Oepen; developers at delph-in.net; Rebecca Dridan; Timothy Baldwin
Subject: [developers] Searching treebanks

G'day,

does anyone know of any way to search Redwoods (or DELPHIN treebanks in general)  for trees of a certain type (using something like the Fangorn interface).  For example, I want to find how often in the treebank 'start' is intransitive vs NP V VP-ving  vs NP V VP-to vs NP V VP NP  (I start; I start lecturing; I start to lecture; I start a lecture).

In fangorn this was "//VP/VB/start[->S/VP/VBG" for NP V VP-ving, ...

I would be ecstatic if there were an online search I can point my students at, but would be interested in anything.



--
Francis Bond <http://www3.ntu.edu.sg/home/fcbond/>
Division of Linguistics and Multilingual Studies
Nanyang Technological University
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.delph-in.net/archives/developers/attachments/20200226/36bcb567/attachment.html>


More information about the developers mailing list