[erg] new version of WikiWoods TreeCache

Stephan Oepen oe at ifi.uio.no
Tue Oct 30 21:02:02 CET 2012


colleagues,

breaking in a new compute cluster at UiO these past few weeks, i
finally generated a new full version of the WikiWoods Treecache,
i.e. a collection of ERG analyses (syntax and semantics) for the
English Wikipedia (as of late 2008).  this version of WikiWoods
is against the 1111 release of the ERG.

i still need to update and elaborate the WikiWoods web page (at
‘http://www.delph-in.net/wikiwoods’), but in case someone were
interested in the data already, it is available for download at:

  http://ltr.uio.no/wikiwoods/1111

the [incr tsdb()] profiles are around 65 gigabytes compressed,
the usual sets of exports (tokens, derivation, labelled tree, MRS,
and EDS) come to about 120 gigabyte compressed.

we are very interested in knowing how WikiWoods is used (if
at all) and would be very grateful for any comments, including
of course suggestions for improvement.

best wishes, oe

nb: for folks at UiO, the individual files of all three WikiWoods
releases are locally visible in ‘/norstore/project/ltr/DELPH-IN/’.

+++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
+++ Universitetet i Oslo (IFI); Boks 1080 Blindern; 0316 Oslo; (+47) 2284 0125
+++    --- oe at ifi.uio.no; stephan at oepen.net; http://www.emmtee.net/oe/ ---
+++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++



More information about the erg mailing list