[developers] Question about RMRS cfrom/cto in PET
Ben Waldron
bmw20 at cl.cam.ac.uk
Mon Dec 11 12:09:33 CET 2006
Berthold Crysmann wrote:
> Hi Yi,
>
> try experimenting with the -tok parameter: the yy_counts and
> xml_counts parameter should give you character spans.
> You need to provide the input in yy or Pic format, though.
If you are providing a character string as input (rather than
preprocessed YY, PIC or SMAF input) try the -tok=fsr parameter. This
will activate the FSPP tokeniser (which is used by the LKB) and
behaviour should be identical to that you get when running the LKB.
- Ben
> Standard string input appears to give token positions.... Maybe we
> could have this added as an option?
More information about the developers
mailing list