[itsdb] [lkb] the fine system and unicode
Stephan Oepen
oe at csli.Stanford.EDU
Fri Feb 17 18:29:42 CET 2006
hi stefan,
> But if this is possible, may be the fine system could check for
> itselves. Then it would be possible to work with various test suite
> data bases in different encodings without having to change the locale
> before looking at another one. I put together a CD rom with a Chinese
> and a German grammar. At the current stage one either gets Umlaute or
> Chinese characters.
i am assuming you are among the people who mean [incr tsdb()] when they
say `the fine system'? if so, then i am sad to report that there is no
support for variation of profile encoding within the same [incr tsdb()]
session. i have a script to convert a complete profile tree to UTF-8,
though.
in general, for DELPH-IN and our LOGON project here in norway, we have
moved to strongly encouraging UTF-8 (even for german or norwegian where
the local tradition used to be ISO-8859-1). maybe that would be a good
option for your CD-ROM universe too? more and more Linux distributions
have long made UTF-8 the default.
all best - oe
+++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
+++ Universitetet i Oslo (IFI); Boks 1080 Blindern; 0316 Oslo; (+47) 2285 7989
+++ CSLI Stanford; Ventura Hall; Stanford, CA 94305; (+1 650) 723 0515
+++ --- oe at csli.stanford.edu; oe at ifi.uio.no; stephan at oepen.net ---
+++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
More information about the itsdb
mailing list