[developers] Question about using PET and the Pet Input Chart with tsdb++

Francisco Costa fcosta at di.fc.ul.pt
Fri Jul 18 17:58:24 CEST 2008


That's exactly what I needed. Thank you.

Francisco
Rebecca Dridan wrote:
> I give batches of sentences to cheap by giving it a file containing  
> filenames of PIC files, eg:
> 
> cat itemlist | cheap -tok=pic ....
> 
> where itemlist looks like:
> 
> 000001.xml
> 
> 000002.xml
> 
> 000003.xml
> 
> ..
> 
> The blank lines between filenames seems to be required.
> 
> cat *.xml|cheap ...
> 
> also seems to work, as long as your xml files have blank lines at the 
> bottom.
> 
> Hope that helps.
> Rebecca
> 
> Francisco Costa wrote:
> 
>> Thank you for the tip.
>> I'm running cheap outside tsdb++ with the -tsdbdump option now, and I 
>> like the results.
>>
>> Just one more question. At the moment I'm invoking cheap for each 
>> sentence, which is time consuming even if automatized. Is is possible 
>> to give a batch of sentences to cheap? What would be the sentence 
>> delimiter? Or can I use the -server option? What signals the end of an 
>> item in that case? I've tried running cheap with the -server and 
>> -tok=xml_counts options and netcatting PIC XML to it, but it doesn't 
>> seem to do anything. Does it wait for an EOF also in this case?
>>
>> Thanks again in advance,
>>
>> Francisco
>>
>> Rebecca Dridan wrote:
>>
>>> Hi Francisco,
>>>
>>> I eventually gave up trying to parse PIC input from within [incr 
>>> tsdb()].  From memory, I think the problem was cheap expected EOF at 
>>> the end of each item, and there didn't seem to be a way of sending 
>>> that. It's possible there have been changes to PET since (I was 
>>> trying this March 07), but I ended up running cheap outside [incr 
>>> tsdb()]  with the -tsdbdump option and copying the results in place 
>>> to view with [incr tsdb()].  It's not quite as convenient, but it works.
>>>
>>> Rebecca
>>>
> 



More information about the developers mailing list