[developers] Searching treebanks

Francis Bond bond at ieee.org
Wed Feb 26 14:28:32 CET 2020


Thanks for the tip.    If only we all sensibly annotated our corpora with
typecraft.

On Wed, Feb 26, 2020 at 9:21 PM Lars Hellan <lars.hellan at ntnu.no> wrote:

> Hi Francis,
>
> For Norwegian you can do such things through
> https://typecraft.org/tc2wiki/Norwegian_Valency_Corpus, a corpus of about
> 20,000 sentences.
>
>
> (Not right on your mark, but perhaps not too far from the sphere of
> "anything" ...)
>
>
> Best
>
> Lars
> ------------------------------
> *From:* developers-bounces at emmtee.net <developers-bounces at emmtee.net> on
> behalf of Francis Bond <bond at ieee.org>
> *Sent:* Wednesday, February 26, 2020 2:02:28 PM
> *To:* Stephan Oepen; developers at delph-in.net; Rebecca Dridan; Timothy
> Baldwin
> *Subject:* [developers] Searching treebanks
>
> G'day,
>
> does anyone know of any way to search Redwoods (or DELPHIN treebanks in
> general)  for trees of a certain type (using something like the Fangorn
> interface).  For example, I want to find how often in the treebank 'start'
> is intransitive vs NP V VP-ving  vs NP V VP-to vs NP V VP NP  (I start; I
> start lecturing; I start to lecture; I start a lecture).
>
> In fangorn this was "//VP/VB/start[->S/VP/VBG" for NP V VP-ving, ...
>
> I would be ecstatic if there were an online search I can point my students
> at, but would be interested in anything.
>
>
>
> --
> Francis Bond <http://www3.ntu.edu.sg/home/fcbond/>
> Division of Linguistics and Multilingual Studies
> Nanyang Technological University
>


-- 
Francis Bond <http://www3.ntu.edu.sg/home/fcbond/>
Division of Linguistics and Multilingual Studies
Nanyang Technological University
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.delph-in.net/archives/developers/attachments/20200226/85844b83/attachment.html>


More information about the developers mailing list