[developers] Options to extract syntax trees from FFTB

Tuấn Anh Lê tuananh.ke at gmail.com
Thu Feb 27 08:46:08 CET 2020


Hi everyone,

We are trying to use FFTB to tree bank a small corpus and we would like to
extract the chosen syntax trees from the corpus. The expected output would
be something like

It works --> ("S" ("NP" ("NP" ("it"))) ("VP" ("V" ("V" ("works")))))

Is there a way to extract this from the FFTB profile?

Currently I'm selecting the trees manually by parsing the sentences using
ACE with the options --report-label and then split the output string with "
;  " but I'm not sure if this is the best approach.

[erg-trunk]$ ace -g erg-0.9.26.dat --report-label
It works
SENT: It works
[ LTOP: h0 INDEX: e2 [ e SF: prop TENSE: pres MOOD: indicative PROG: -
PERF: - ] RELS: < [ pron<0:2> LBL: h4 ARG0: x3 [ x PERS: 3 NUM: sg GEND: n
PT: std ] ]  [ pronoun_q<0:2> LBL: h5 ARG0: x3 RSTR: h6 BODY: h7 ]  [
_work_v_1<3:8> LBL: h1 ARG0: e2 ARG1: x3 ARG2: i8 ] > HCONS: < h0 qeq h1 h6
qeq h4 > ICONS: < > ] ;  ("S" ("NP" ("NP" ("it"))) ("VP" ("V" ("V"
("works")))))
NOTE: 1 readings, added 391 / 68 edges to chart (27 fully instantiated, 35
actives used, 18 passives used) RAM: 1880k

Thank you
--
Tuan Anh
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.delph-in.net/archives/developers/attachments/20200227/343cb2bc/attachment.html>


More information about the developers mailing list