[developers] no utf-8 in parse browser.
C.J. Adams-Collier
cjac at colliertech.org
Sat Jan 23 06:44:56 CET 2010
Aha... I think it might be the combining characters I use...
x̣ is x plus ̣
I'm thinking that the lkb doesn't like U+0232, because:
(lkb::do-parse-tty "x̣ʷil̕ ti č̓ač̓as")
No analysis found corresponding to token 0-1 X
No analysis found corresponding to token 1-2 IL
No analysis found corresponding to token 2-3 TI
No analysis found corresponding to token 3-4 Č
No analysis found corresponding to token 4-5 AČ
No analysis found corresponding to token 5-6 AS
No parses found
it doesn't mention anything about X̣ or x̣
Any idea how we can fix this? Or perhaps I'm still wrong ;)
On Fri, 2010-01-22 at 15:34 -0800, C.J. Adams-Collier wrote:
> This might also be of interest...
>
> Old-style lkb windows don't render the unicode, but the metacity window
> title is rendered correctly
>
> [incr tsdb()] results browser windows don't render the unicode
>
> New-style (lui) lkb windows render the unicode correctly
>
> You can see unicode rendering in the gnome-terminal emacs window and in
> the character map tool.
>
>
>
>
> On Fri, 2010-01-22 at 06:07 -0800, C.J. Adams-Collier wrote:
> > Hey there Stephan,
> >
> > I used the auto-install script from the delph-in wiki:
> >
> > http://wiki.delph-in.net/moin/LkbInstallation#head-5ec0b1f7b5bc196bd756c0a2028f4acba96ec9ac
> >
> > All of my LANG vars are set to en_US.UTF-8 and I've run locale-gen
> > en_US.UTF-8 as well. All other characters look fine in X,
> > gnome-terminal, gedit, evolution, etc.:
> >
> > $ ls -l /home/cjac/delphin/Archives/*.tgz
> > -rw-r--r-- 1 cjac cjac 232166 2006-01-23 10:50 /home/cjac/delphin/Archives/eli.tgz
> > -rw-r--r-- 1 cjac cjac 34733534 2009-04-27 07:27 /home/cjac/delphin/Archives/erg.tgz
> > -rw-r--r-- 1 cjac cjac 36562 2009-04-27 07:27 /home/cjac/delphin/Archives/itsdb_capi.tgz
> > -rw-r--r-- 1 cjac cjac 1645517 2009-04-27 07:27 /home/cjac/delphin/Archives/itsdb_data.tgz
> > -rw-r--r-- 1 cjac cjac 550424 2009-04-27 07:27 /home/cjac/delphin/Archives/itsdb_documentation.tgz
> > -rw-r--r-- 1 cjac cjac 8196618 2009-04-27 07:27 /home/cjac/delphin/Archives/itsdb_libraries.tgz
> > -rw-r--r-- 1 cjac cjac 2711095 2009-04-27 07:46 /home/cjac/delphin/Archives/itsdb_linux.x86.64.tgz
> > -rw-r--r-- 1 cjac cjac 390042 2009-04-27 07:27 /home/cjac/delphin/Archives/itsdb_source.tgz
> > -rw-r--r-- 1 cjac cjac 486396 2009-04-27 07:27 /home/cjac/delphin/Archives/itsdb_tsdb.tgz
> > -rw-r--r-- 1 cjac cjac 266620 2009-04-27 07:23 /home/cjac/delphin/Archives/lkb_data.tgz
> > -rw-r--r-- 1 cjac cjac 13977191 2009-04-27 07:46 /home/cjac/delphin/Archives/lkb_linux.x86.64.tgz
> > -rw-r--r-- 1 cjac cjac 1284070 2009-04-27 07:23 /home/cjac/delphin/Archives/lkb_source.tgz
> > $ sudo locale-gen en_US.UTF-8
> > Generating locales...
> > en_US.UTF-8... up-to-date
> > Generation complete.
> > $ set | grep en_US.UTF-8
> > GDM_LANG=en_US.UTF-8
> > LANG=en_US.UTF-8
> > LANGUAGE=en_US.UTF-8
> > LC_ALL=en_US.UTF-8
> > $ lsb_release -a
> > No LSB modules are available.
> > Distributor ID: Ubuntu
> > Description: Ubuntu 9.10
> > Release: 9.10
> > Codename: karmic
> >
> >
> > On Fri, 2010-01-22 at 10:27 +0100, Stephan Oepen wrote:
> > > hi c.j.,
> > >
> > > this looks rather weird, which language is this? and which version of
> > > [incr tsdb()] are you using, i.e. how did you obtain and install it? i
> > > would suggest you make sure LANG is set to a UTF-8 locale, ideally even
> > > for the X server (i.e. system-wide, depending on how you start X). in
> > > my experience, on a system generally enabled for UniCode, there rarely
> > > are display issues in [incr tsdb()]; this is mainly a standard Tk 8.4.
> > >
> > > best - oe
> > >
> > > +++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> > > +++ Universitetet i Oslo (IFI); Boks 1080 Blindern; 0316 Oslo; (+47) 2284 0125
> > > +++ --- oe at ifi.uio.no; stephan at oepen.net; http://www.emmtee.net/oe/ ---
> > > +++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> >
>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 197 bytes
Desc: This is a digitally signed message part
URL: <http://lists.delph-in.net/archives/developers/attachments/20100122/bbbf0f3d/attachment.sig>
More information about the developers
mailing list