[developers] Treebanking and training with FFT

Francis Bond bond at ieee.org
Thu May 14 03:38:12 CEST 2020


Great.

I added a bit more to the documentation, just in case.

On Thu, May 14, 2020 at 1:45 AM Petter Haugereid <petterha at gmail.com> wrote:

> Yes, it helped!
> I changed the webdir to ~/logon/lingo/answer/fftb/ (where I found the
> files you mentioned), and then I could treebank with fftb. I was also able
> to train a model.
> Thank you very much!
>
> Petter
>
> On Wed, May 13, 2020 at 3:20 PM Francis Bond <bond at ieee.org> wrote:
>
>> Hi,
>>
>> We successfully treebanked recently, using (and updating) the wiki page.
>>  Is the webdir correct?  It should have the files control.js, index.html
>> and render.js in it.   We found it in  ace-tools-x86.0.9.31/assets  (but
>> not in 0.9.30).  However 0.9.31 did not work for some reason, so we used
>> the grammar and fftb from 0.9.30 and the webdir from 0.9.31.   They are
>> also included somewhere in the logon tree.
>>
>> I hope this helps.
>>
>>
>>
>>
>>
>> On Wed, May 13, 2020 at 9:09 PM Petter Haugereid <petterha at gmail.com>
>> wrote:
>>
>>> Hi everybody,
>>>
>>> I have been trying over some days to make treebanking work with FFT.
>>> Following instructions on the DELPH-IN site, I have given the commands
>>> below, and I end up with a browser window with the items of the profile I
>>> attempt to treebank. However, when I click on one of the items, I get an
>>> error message "404 Not Found". Do any of you know what I am doing wrong?
>>>
>>> Here are the commands (with full paths):
>>> mkprof -s ~/logon/lingo/lkb/src/tsdb/home/mrs.2020.05.12/ /tmp/mrs-test
>>> art -f -a '~/tools/ace-0.9.30/ace --disable-generalization -g
>>> ~/tools/ace-0.9.30/norwegian-small.dat -O' /tmp/mrs-test
>>> ~/acetools-x86-0.9.30/fftb -g ~/tools/ace-0.9.30/norwegian-small.dat
>>>  --browser --webdir ~/acetools /tmp/mrs-test/
>>>
>>> I am quite keen to get a statistical model for my grammar, so I have
>>> tried to train a model from a small treebank which I have disambiguated
>>> with the logon tool. When I try to train with LOGON, only get a lot of
>>> garbage collection messages, and I eventually have to kill the process.
>>> When I try to train with FFT with the following commands, I get the
>>> messages below:
>>>
>>> mkprof -s ~/logon/lingo/lkb/src/tsdb/home/mrs.2020.05.12/ /tmp/mrs-test/
>>> art -a '~/tools/ace-0.9.30/ace -g ~/tools/ace-0.9.30/norwegian-small.dat
>>> -O' -f /tmp/mrs-test/
>>> FFGRANDPARENT=0 ~/acetools-x86-0.9.30/ffmaster 1 mrs-test-gp0.mem &
>>> FFGRANDPARENT=0 ~/acetools-x86-0.9.30/ffworker
>>> ~/tools/ace-0.9.30/norwegian-small.dat /tmp/mrs-test/
>>> ~/logon/lingo/lkb/src/tsdb/home/mrs.2020.05.12/ localhost
>>>
>>> # loading /tmp/mrs-test/...
>>> # loading /home/petter/logon/lingo/lkb/src/tsdb/home/mrs.2020.05.12/...
>>> # loading gold
>>> #  ... iid 1 -- gold tree 1 / 1 not in parse forest
>>> #  ... iid 2 -- gold tree 1 / 1 not in parse forest
>>> #  ... iid 3 -- gold tree 1 / 1 not in parse forest
>>> ...
>>> #  ... iid 68 -- gold tree 1 / 1 not in parse forest
>>> #  ... iid 69 -- gold tree 1 / 1 not in parse forest
>>> # loaded 0 ambiguous feature forests with gold trees
>>> # [1]+  Exit 255                FFGRANDPARENT=0
>>> ~/acetools-x86-0.9.30/ffmaster 1 mrs-test-gp0.mem
>>> # Floating point exception (core dumped)
>>>
>>> I tried the same commands with the ERG MRS treebank in LOGON, and I was
>>> able to train a model with it. I suspect the reason I don't succeed, is
>>> that I have treebanked with LOGON, while Dan has used FFT.
>>>
>>> Here are links to
>>> 1) the MRS treebank
>>>  https://www.dropbox.com/s/7mj53j644vwhbes/mrs.2020.05.12.tgz?dl=0
>>> 2) The Norwegian MRS items I have treebanked
>>> https://www.dropbox.com/s/qfhuqwnxlz0e1pb/mrs.txt?dl=0
>>> 3) The Norsyg grammar (loading 'lkb/small-script' with the LKB,
>>> 'ace/config-small.tdl' with ACE is sufficient)
>>> https://www.dropbox.com/s/rmoy6q40dvz1dxh/norsyg.20-05-13.tgz?dl=0
>>> 4) A compiled version of the grammar, compiled with ace-0.9.30
>>> https://www.dropbox.com/s/cb0dq9omuhojlmv/norwegian-small.dat?dl=0
>>>
>>> If someone can point me to what I am doing wrong, I would be very
>>> greatful!
>>>
>>> Best,
>>>
>>> Petter
>>>
>>
>>
>> --
>> Francis Bond <http://www3.ntu.edu.sg/home/fcbond/>
>> Division of Linguistics and Multilingual Studies
>> Nanyang Technological University
>>
>

-- 
Francis Bond <http://www3.ntu.edu.sg/home/fcbond/>
Division of Linguistics and Multilingual Studies
Nanyang Technological University
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.delph-in.net/archives/developers/attachments/20200514/9063d6b9/attachment.html>


More information about the developers mailing list