[developers] Malformed RMRS XML output from an ugly but valid PIC:

Andrew MacKinlay admackin at gmail.com
Tue Nov 17 16:15:48 CET 2009


Using the chart mapping Pet, I've got a particular PIC which has a  
whole lot of problems with it but is nonetheless valid input AFAICT.

In the output RMRS, I get the following:

<ep cfrom='83' cto='130'><realpred lemma='`/usr/portage/distfiles/ 
cdemu-0.6' pos='beta.tar.bz2'' sense='jj'/><label vid='7'/><anchor  
vid='10040'/><var sort='e' vid='89' sf='prop'/></ep>

As you can see, the POS and sense are incorrect and the POS includes  
an unescaped quote that will break XML parsers.

The corresponding 'w' element in the PIC is:
     <w cend="130" constant="no" cstart="83" id="W013">
         <surface>`/usr/portage/distfiles/cdemu-0.6_beta.tar.bz2'</ 
surface>
         <pos prio="1.0" tag="JJ"/>
     </w>

Any thoughts as to why this is occurring? Looks like some combination  
of the '_' and the quotes in the surface string to me.

I've attached the full RMRS and PIC XML if anyone is interested.

Thanks,
Andy.

-------------- next part --------------
A non-text attachment was scrubbed...
Name: broken-rmrs-pic.xml
Type: text/xml
Size: 2315 bytes
Desc: not available
URL: <http://lists.delph-in.net/archives/developers/attachments/20091117/1e43b3bd/attachment.xml>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: broken-rmrs.xml
Type: text/xml
Size: 12550 bytes
Desc: not available
URL: <http://lists.delph-in.net/archives/developers/attachments/20091117/1e43b3bd/attachment-0001.xml>


More information about the developers mailing list