[developers] Treebanking and training with FFT

Francis Bond bond at ieee.org
Wed May 13 15:20:12 CEST 2020


Hi,

We successfully treebanked recently, using (and updating) the wiki page.
 Is the webdir correct?  It should have the files control.js, index.html
and render.js in it.   We found it in  ace-tools-x86.0.9.31/assets  (but
not in 0.9.30).  However 0.9.31 did not work for some reason, so we used
the grammar and fftb from 0.9.30 and the webdir from 0.9.31.   They are
also included somewhere in the logon tree.

I hope this helps.





On Wed, May 13, 2020 at 9:09 PM Petter Haugereid <petterha at gmail.com> wrote:

> Hi everybody,
>
> I have been trying over some days to make treebanking work with FFT.
> Following instructions on the DELPH-IN site, I have given the commands
> below, and I end up with a browser window with the items of the profile I
> attempt to treebank. However, when I click on one of the items, I get an
> error message "404 Not Found". Do any of you know what I am doing wrong?
>
> Here are the commands (with full paths):
> mkprof -s ~/logon/lingo/lkb/src/tsdb/home/mrs.2020.05.12/ /tmp/mrs-test
> art -f -a '~/tools/ace-0.9.30/ace --disable-generalization -g
> ~/tools/ace-0.9.30/norwegian-small.dat -O' /tmp/mrs-test
> ~/acetools-x86-0.9.30/fftb -g ~/tools/ace-0.9.30/norwegian-small.dat
>  --browser --webdir ~/acetools /tmp/mrs-test/
>
> I am quite keen to get a statistical model for my grammar, so I have tried
> to train a model from a small treebank which I have disambiguated with the
> logon tool. When I try to train with LOGON, only get a lot of garbage
> collection messages, and I eventually have to kill the process. When I try
> to train with FFT with the following commands, I get the messages below:
>
> mkprof -s ~/logon/lingo/lkb/src/tsdb/home/mrs.2020.05.12/ /tmp/mrs-test/
> art -a '~/tools/ace-0.9.30/ace -g ~/tools/ace-0.9.30/norwegian-small.dat
> -O' -f /tmp/mrs-test/
> FFGRANDPARENT=0 ~/acetools-x86-0.9.30/ffmaster 1 mrs-test-gp0.mem &
> FFGRANDPARENT=0 ~/acetools-x86-0.9.30/ffworker
> ~/tools/ace-0.9.30/norwegian-small.dat /tmp/mrs-test/
> ~/logon/lingo/lkb/src/tsdb/home/mrs.2020.05.12/ localhost
>
> # loading /tmp/mrs-test/...
> # loading /home/petter/logon/lingo/lkb/src/tsdb/home/mrs.2020.05.12/...
> # loading gold
> #  ... iid 1 -- gold tree 1 / 1 not in parse forest
> #  ... iid 2 -- gold tree 1 / 1 not in parse forest
> #  ... iid 3 -- gold tree 1 / 1 not in parse forest
> ...
> #  ... iid 68 -- gold tree 1 / 1 not in parse forest
> #  ... iid 69 -- gold tree 1 / 1 not in parse forest
> # loaded 0 ambiguous feature forests with gold trees
> # [1]+  Exit 255                FFGRANDPARENT=0
> ~/acetools-x86-0.9.30/ffmaster 1 mrs-test-gp0.mem
> # Floating point exception (core dumped)
>
> I tried the same commands with the ERG MRS treebank in LOGON, and I was
> able to train a model with it. I suspect the reason I don't succeed, is
> that I have treebanked with LOGON, while Dan has used FFT.
>
> Here are links to
> 1) the MRS treebank
>  https://www.dropbox.com/s/7mj53j644vwhbes/mrs.2020.05.12.tgz?dl=0
> 2) The Norwegian MRS items I have treebanked
> https://www.dropbox.com/s/qfhuqwnxlz0e1pb/mrs.txt?dl=0
> 3) The Norsyg grammar (loading 'lkb/small-script' with the LKB,
> 'ace/config-small.tdl' with ACE is sufficient)
> https://www.dropbox.com/s/rmoy6q40dvz1dxh/norsyg.20-05-13.tgz?dl=0
> 4) A compiled version of the grammar, compiled with ace-0.9.30
> https://www.dropbox.com/s/cb0dq9omuhojlmv/norwegian-small.dat?dl=0
>
> If someone can point me to what I am doing wrong, I would be very greatful!
>
> Best,
>
> Petter
>


-- 
Francis Bond <http://www3.ntu.edu.sg/home/fcbond/>
Division of Linguistics and Multilingual Studies
Nanyang Technological University
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.delph-in.net/archives/developers/attachments/20200513/f657bc7c/attachment.html>


More information about the developers mailing list