[itsdb] [lkb] the fine system and unicode

Stephan Oepen oe at csli.Stanford.EDU
Fri Feb 17 18:29:42 CET 2006


hi stefan,

> But if this is possible, may be the fine system could check for
> itselves. Then it would be possible to work with various test suite
> data bases in different encodings without having to change the locale
> before looking at another one. I put together a CD rom with a Chinese
> and a German grammar. At the current stage one either gets Umlaute or
> Chinese characters.

i am assuming you are among the people who mean [incr tsdb()] when they
say `the fine system'?  if so, then i am sad to report that there is no
support for variation of profile encoding within the same [incr tsdb()]
session.  i have a script to convert a complete profile tree to UTF-8,
though.

in general, for DELPH-IN and our LOGON project here in norway, we have
moved to strongly encouraging UTF-8 (even for german or norwegian where
the local tradition used to be ISO-8859-1).  maybe that would be a good
option for your CD-ROM universe too?  more and more Linux distributions
have long made UTF-8 the default.

                                                       all best  -  oe

+++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
+++ Universitetet i Oslo (IFI); Boks 1080 Blindern; 0316 Oslo; (+47) 2285 7989
+++     CSLI Stanford; Ventura Hall; Stanford, CA 94305; (+1 650) 723 0515
+++       --- oe at csli.stanford.edu; oe at ifi.uio.no; stephan at oepen.net ---
+++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++



More information about the itsdb mailing list