[developers] Question about using PET and the Pet Input Chart with tsdb++
Francisco Costa
fcosta at di.fc.ul.pt
Fri Jul 18 17:58:24 CEST 2008
That's exactly what I needed. Thank you.
Francisco
Rebecca Dridan wrote:
> I give batches of sentences to cheap by giving it a file containing
> filenames of PIC files, eg:
>
> cat itemlist | cheap -tok=pic ....
>
> where itemlist looks like:
>
> 000001.xml
>
> 000002.xml
>
> 000003.xml
>
> ..
>
> The blank lines between filenames seems to be required.
>
> cat *.xml|cheap ...
>
> also seems to work, as long as your xml files have blank lines at the
> bottom.
>
> Hope that helps.
> Rebecca
>
> Francisco Costa wrote:
>
>> Thank you for the tip.
>> I'm running cheap outside tsdb++ with the -tsdbdump option now, and I
>> like the results.
>>
>> Just one more question. At the moment I'm invoking cheap for each
>> sentence, which is time consuming even if automatized. Is is possible
>> to give a batch of sentences to cheap? What would be the sentence
>> delimiter? Or can I use the -server option? What signals the end of an
>> item in that case? I've tried running cheap with the -server and
>> -tok=xml_counts options and netcatting PIC XML to it, but it doesn't
>> seem to do anything. Does it wait for an EOF also in this case?
>>
>> Thanks again in advance,
>>
>> Francisco
>>
>> Rebecca Dridan wrote:
>>
>>> Hi Francisco,
>>>
>>> I eventually gave up trying to parse PIC input from within [incr
>>> tsdb()]. From memory, I think the problem was cheap expected EOF at
>>> the end of each item, and there didn't seem to be a way of sending
>>> that. It's possible there have been changes to PET since (I was
>>> trying this March 07), but I ended up running cheap outside [incr
>>> tsdb()] with the -tsdbdump option and copying the results in place
>>> to view with [incr tsdb()]. It's not quite as convenient, but it works.
>>>
>>> Rebecca
>>>
>
More information about the developers
mailing list