[developers] LKB splitting on apostrophe

Emily M. Bender ebender at uw.edu
Mon Jan 23 00:06:23 CET 2012


Thanks, Stephan.  We will look into this.
Emily

On Sun, Jan 22, 2012 at 12:03 AM, Stephan Oepen <oe at ifi.uio.no> wrote:
> i would be surprised if the LKB had changed in this respect in recent years?  anyway, the REPP pre-processing language is stable (see ReppTop on the wiki) and supported in any self-respecting system (LKB, PET, ACE, agree).  so i would maybe start having Matrix-derived grammars include a vanilla ‘tokenizer.rpp’.  this way, things will be more transparent, and students (who need to) have full control over string-level pre-processing.
>
> best, oe
>
>
>
> On 22. jan. 2012, at 03:41, "Emily M. Bender" <ebender at uw.edu> wrote:
>
>> Dear all,
>>
>> Students in my class this term are reporting that the LKB is splitting
>> words on the character ', even when it's not in the value of
>> *punctuation-characters*.  Any idea why this might be?  Anything
>> we can do about it?
>>
>> Thanks,
>> Emily
>>
>> --
>> Emily M. Bender
>> Associate Professor
>> Department of Linguistics
>> Check out CLMS on facebook! http://www.facebook.com/uwclma



-- 
Emily M. Bender
Associate Professor
Department of Linguistics
Check out CLMS on facebook! http://www.facebook.com/uwclma




More information about the developers mailing list