[developers] Question about using PET and the Pet Input Chart with tsdb++
Rebecca Dridan
bec.dridan at gmail.com
Fri Jul 18 17:30:37 CEST 2008
I give batches of sentences to cheap by giving it a file containing
filenames of PIC files, eg:
cat itemlist | cheap -tok=pic ....
where itemlist looks like:
000001.xml
000002.xml
000003.xml
..
The blank lines between filenames seems to be required.
cat *.xml|cheap ...
also seems to work, as long as your xml files have blank lines at the
bottom.
Hope that helps.
Rebecca
Francisco Costa wrote:
> Thank you for the tip.
> I'm running cheap outside tsdb++ with the -tsdbdump option now, and I
> like the results.
>
> Just one more question. At the moment I'm invoking cheap for each
> sentence, which is time consuming even if automatized. Is is possible
> to give a batch of sentences to cheap? What would be the sentence
> delimiter? Or can I use the -server option? What signals the end of an
> item in that case? I've tried running cheap with the -server and
> -tok=xml_counts options and netcatting PIC XML to it, but it doesn't
> seem to do anything. Does it wait for an EOF also in this case?
>
> Thanks again in advance,
>
> Francisco
>
> Rebecca Dridan wrote:
>> Hi Francisco,
>>
>> I eventually gave up trying to parse PIC input from within [incr
>> tsdb()]. From memory, I think the problem was cheap expected EOF at
>> the end of each item, and there didn't seem to be a way of sending
>> that. It's possible there have been changes to PET since (I was
>> trying this March 07), but I ended up running cheap outside [incr
>> tsdb()] with the -tsdbdump option and copying the results in place
>> to view with [incr tsdb()]. It's not quite as convenient, but it works.
>>
>> Rebecca
>>
More information about the developers
mailing list