[developers] lexical filtering rule not firing
Paul Haley
paul at haleyai.com
Sat May 6 23:45:24 CEST 2017
I am stumped by this behavior of some modifications to the ERG using PET and wonder if someone can shed some light on what I must be
missing.
The following lexical item is being left in the chart after lexical processing (full chart of "Some run." is available at
https://1drv.ms/u/s!Am2TXpQ1-kjshyPqQnj5QcjDBOHg)
L [183 1-2 run_n1 (1) 0 {} { : w_period_plr} {}] < blk: 0 dtrs: 58 parents: 299 >
n_pp_c-of_le
This is despite the presence of the following rule:
veto_non-noun_lfr := lexical_filtering_rule & [+CONTEXT <[SYNSEM anti_noun_synsem,TOKENS.+LIST <[+TNT.+TAGS <"NOUN">]>]>,+INPUT
<[SYNSEM [LOCAL.CAT.HEAD noun_or_ttl, PHON.ONSET con_or_voc]]>, +OUTPUT <>, +POSITION "I1 at C1"].
The +INPUT of which matches n_pp_c_of_le and the +CONTEXT of which matches the following (below) which is also left in the chart
(having several variations due to lexical rules).
I am at a loss for why the lexical filtering rule above is not removing the lexical item above. Of course I am most probably wrong
about their "matching"! Nonetheless, any insight would be sincerely appreciated. I have stared at and fiddled with it for far too
many hours.
Thank you,
Paul
P.S. the rules that discard generic LEs given natives and zero probability tokens have been discarded (otherwise the following would
have been pruned rather than, hopefully, leading to the pruning of run as a noun; in this test case)
L [250 1-2 generic_veto_noun (1) 0 {} { : w_period_plr} {}] < blk: 0 dtrs: 64 parents: 274 >
L [249 1-2 generic_veto_noun (1) 0 {} { : v_pas-prt-t_odlr w_period_plr} {}] < blk: 0 dtrs: 64 parents: >
L [248 1-2 generic_veto_noun (1) 0 {} { : v_pas-p-p_odlr w_period_plr} {}] < blk: 0 dtrs: 64 parents: >
L [247 1-2 generic_veto_noun (1) 0 {} { : v_pas-p_odlr w_period_plr} {}] < blk: 0 dtrs: 64 parents: >
L [246 1-2 generic_veto_noun (1) 0 {} { : v_pas_odlr w_period_plr} {}] < blk: 0 dtrs: 64 parents: >
L [245 1-2 generic_veto_noun (1) 0 {} { : v_psp_olr w_period_plr} {}] < blk: 0 dtrs: 64 parents: >
basic_unknown_word
[ ARGS *cons*
[ FIRST sign
[ ARGS *list*,
SYNSEM synsem
[ LOCAL local_min
[ CAT cat_min1
[ HEAD head
[ MINORS minors_basic
[ MIN #1:predsort ],
MOD *list*,
PRD bool,
AUX luk,
INV bool,
CASE case ],
VAL valence_min,
--DT token_head ],
CONT mrs_min0
[ HOOK hook
[ INDEX #2:semarg,
XARG *top*,
LTOP handle,
--SLTOP handle ] ],
AGR *top* ],
--MIN #1,
--SIND #2,
PHON phon_min,
PUNCT punctuation_min,
NONLOC non-local_min,
OPT bool,
LEX luk,
MODIFD xmod_min ],
KEY-ARG bool,
ORTH orthog
[ FIRST *top*,
REST *top*,
FROM #3:"5",
TO string,
FORM string,
LD bracket_list,
LB bracket_list,
RB bracket_list,
RD bracket_list ],
INFLECTD luk,
GENRE genre,
DIALECT dialect,
IDIOM bool,
RNAME basic_ctype ],
REST *top* ],
SYNSEM anti_noun_synsem
[ LOCAL local
[ CAT cat
[ HEAD partn
[ MINORS minors_basic
[ MIN #4:predsort ],
MOD *null*,
PRD bool,
AUX luk,
INV bool,
CASE case,
POSS -,
--BARE luk ],
VAL valence_sp
[ SPCMPS *null*,
SUBJ *null*,
SPR *null*,
COMPS *null*,
SPEC *cons*
[ FIRST anti_synsem_min,
REST *null* ] ],
MC na,
POSTHD bool,
HC-LEX luk,
HS-LEX luk,
NEGPOL luk,
--DT #5:token_head
[ +TI "<5:9>",
+LL ctype
[ -CTYPE- string ],
+TG string ] ],
CONT nom-obj
[ HOOK #6:hook
[ INDEX #7:semarg,
XARG *top*,
LTOP handle,
--SLTOP handle ],
RELS *diff-list*,
HCONS *diff-list*
[ LIST #8:*list*,
LAST #8 ],
ICONS *diff-list*
[ LIST #9:*list*,
LAST #9 ] ],
ARG-S *null*,
CONJ cnil,
AGR #7,
CTXT ctxt_min ],
--MIN #4,
--SIND #7,
PHON phon
[ ONSET unk_onset
[ --TL #10:generic_token_cons
[ FIRST token
[ +CLASS #11:alphabetic
[ +CASE non_capitalized+lower,
+INITIAL - ],
+FROM #3,
+FORM #12:"run.",
+TO "9",
+CARG "run",
+TNT tnt
[ +MAIN #13:tnt_main
[ +TAG "POS",
+PRB "0.99" ],
+TAGS *cons*
[ FIRST "NOUN",
REST *null* ],
+PRBS #14:*cons*
[ FIRST "0.00",
REST *null* ] ],
+PRED "_run/NOUN_u_unknown_rel",
+TRAIT token_trait
[ +UW +,
+IT italics,
+LB #15:bracket_null
[ LIST #16:*list*,
LAST #16 ],
+RB #17:bracket_null
[ LIST #18:*list*,
LAST #18 ],
+LD #19:bracket_null
[ LIST #20:*list*,
LAST #20 ],
+RD #21:bracket_null
[ LIST #22:*list*,
LAST #22 ],
+HD #5 ],
+ID #23:*diff-list*
[ LIST *list*,
LAST *list* ],
+TICK + ],
REST generic_token_null ] ] ],
LKEYS lexkeys
[ KEYREL relation
[ CFROM #3,
CTO #24:"9",
PRED predsort,
LBL handle,
LNK *list* ] ],
PUNCT no_punctuation_min
[ PNCTPR ppair ],
NONLOC non-local
[ REL 0-dlist,
QUE 0-dlist,
SLASH 0-dlist
[ LIST #25:*locallist*,
LAST #25 ] ],
OPT bool,
LEX +,
MODIFD xmod_min,
--BRDG luk ],
KEY-ARG bool,
ORTH orthography
[ FIRST "_veto_NOUN_",
REST *null*,
FROM #3,
CLASS #11,
TO #24,
FORM #12,
LD #19,
LB #15,
RB #17,
RD #21 ],
INFLECTD luk,
GENRE genre,
DIALECT dialect,
IDIOM bool,
RNAME basic_ctype,
C-CONT mrs_min0
[ HOOK #6 ],
ALTS alts_min,
TOKENS tokens
[ +LIST #10,
+LAST token
[ +CLASS #11,
+FROM "5",
+FORM "run.",
+TO #24,
+CARG "run",
+TNT tnt
[ +MAIN #13,
+TAGS *cons*
[ FIRST "NOUN",
REST *null* ],
+PRBS #14 ],
+PRED "_run/NOUN_u_unknown_rel",
+TRAIT token_trait
[ +UW +,
+IT italics,
+LB #15,
+RB #17,
+LD #19,
+RD #21,
+HD #5 ],
+ID #23,
+TICK + ] ] ]
I [64 () 1-2 <5:9> "" "run." { : } {NOUN 0}] < blk: 0 >
token
[ +CLASS alphabetic
[ +CASE non_capitalized+lower,
+INITIAL - ],
+FROM "5",
+FORM "run.",
+TO "9",
+CARG "run",
+TNT tnt
[ +MAIN tnt_main
[ +TAG "POS",
+PRB "0.99" ],
+TAGS *cons*
[ FIRST "NOUN",
REST *null* ],
+PRBS *cons*
[ FIRST "0.00",
REST *null* ] ],
+PRED "_run/NOUN_u_unknown_rel",
+TRAIT token_trait
[ +UW +,
+IT italics,
+LB bracket_null
[ LIST #1:*list*,
LAST #1 ],
+RB bracket_null
[ LIST #2:*list*,
LAST #2 ],
+LD bracket_null
[ LIST #3:*list*,
LAST #3 ],
+RD bracket_null
[ LIST #4:*list*,
LAST #4 ],
+HD token_head
[ +TI "<5:9>",
+LL ctype
[ -CTYPE- string ],
+TG string ] ],
+ID *diff-list*
[ LIST *list*,
LAST *list* ],
+TICK + ]
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.delph-in.net/archives/developers/attachments/20170506/c384288d/attachment-0001.html>
More information about the developers
mailing list