[developers] no utf-8 in parse browser.

C.J. Adams-Collier cjac at colliertech.org
Sat Jan 23 06:44:56 CET 2010


Aha... I think it might be the combining characters I use...

x̣ is x plus ̣

I'm thinking that the lkb doesn't like U+0232, because:

(lkb::do-parse-tty "x̣ʷil̕ ti č̓ač̓as")

No analysis found corresponding to token 0-1 X
No analysis found corresponding to token 1-2 IL
No analysis found corresponding to token 2-3 TI
No analysis found corresponding to token 3-4 Č
No analysis found corresponding to token 4-5 AČ
No analysis found corresponding to token 5-6 AS
No parses found

it doesn't mention anything about X̣ or x̣

Any idea how we can fix this?  Or perhaps I'm still wrong ;)

On Fri, 2010-01-22 at 15:34 -0800, C.J. Adams-Collier wrote: 
> This might also be of interest...
> 
> Old-style lkb windows don't render the unicode, but the metacity window
> title is rendered correctly
> 
> [incr tsdb()] results browser windows don't render the unicode
> 
> New-style (lui) lkb windows render the unicode correctly
> 
> You can see unicode rendering in the gnome-terminal emacs window and in
> the character map tool.
> 
> 
> 
> 
> On Fri, 2010-01-22 at 06:07 -0800, C.J. Adams-Collier wrote: 
> > Hey there Stephan,
> > 
> > I used the auto-install script from the delph-in wiki:
> > 
> > http://wiki.delph-in.net/moin/LkbInstallation#head-5ec0b1f7b5bc196bd756c0a2028f4acba96ec9ac
> > 
> > All of my LANG vars are set to en_US.UTF-8 and I've run locale-gen
> > en_US.UTF-8 as well.  All other characters look fine in X,
> > gnome-terminal, gedit, evolution, etc.:
> > 
> > $ ls -l /home/cjac/delphin/Archives/*.tgz
> > -rw-r--r-- 1 cjac cjac   232166 2006-01-23 10:50 /home/cjac/delphin/Archives/eli.tgz
> > -rw-r--r-- 1 cjac cjac 34733534 2009-04-27 07:27 /home/cjac/delphin/Archives/erg.tgz
> > -rw-r--r-- 1 cjac cjac    36562 2009-04-27 07:27 /home/cjac/delphin/Archives/itsdb_capi.tgz
> > -rw-r--r-- 1 cjac cjac  1645517 2009-04-27 07:27 /home/cjac/delphin/Archives/itsdb_data.tgz
> > -rw-r--r-- 1 cjac cjac   550424 2009-04-27 07:27 /home/cjac/delphin/Archives/itsdb_documentation.tgz
> > -rw-r--r-- 1 cjac cjac  8196618 2009-04-27 07:27 /home/cjac/delphin/Archives/itsdb_libraries.tgz
> > -rw-r--r-- 1 cjac cjac  2711095 2009-04-27 07:46 /home/cjac/delphin/Archives/itsdb_linux.x86.64.tgz
> > -rw-r--r-- 1 cjac cjac   390042 2009-04-27 07:27 /home/cjac/delphin/Archives/itsdb_source.tgz
> > -rw-r--r-- 1 cjac cjac   486396 2009-04-27 07:27 /home/cjac/delphin/Archives/itsdb_tsdb.tgz
> > -rw-r--r-- 1 cjac cjac   266620 2009-04-27 07:23 /home/cjac/delphin/Archives/lkb_data.tgz
> > -rw-r--r-- 1 cjac cjac 13977191 2009-04-27 07:46 /home/cjac/delphin/Archives/lkb_linux.x86.64.tgz
> > -rw-r--r-- 1 cjac cjac  1284070 2009-04-27 07:23 /home/cjac/delphin/Archives/lkb_source.tgz
> > $ sudo locale-gen en_US.UTF-8
> > Generating locales...
> >   en_US.UTF-8... up-to-date
> > Generation complete.
> > $ set | grep en_US.UTF-8
> > GDM_LANG=en_US.UTF-8
> > LANG=en_US.UTF-8
> > LANGUAGE=en_US.UTF-8
> > LC_ALL=en_US.UTF-8
> > $ lsb_release -a
> > No LSB modules are available.
> > Distributor ID: Ubuntu
> > Description: Ubuntu 9.10
> > Release: 9.10
> > Codename: karmic
> > 
> > 
> > On Fri, 2010-01-22 at 10:27 +0100, Stephan Oepen wrote: 
> > > hi c.j.,
> > > 
> > > this looks rather weird, which language is this?  and which version of
> > > [incr tsdb()] are you using, i.e. how did you obtain and install it?  i
> > > would suggest you make sure LANG is set to a UTF-8 locale, ideally even
> > > for the X server (i.e. system-wide, depending on how you start X).  in
> > > my experience, on a system generally enabled for UniCode, there rarely
> > > are display issues in [incr tsdb()]; this is mainly a standard Tk 8.4.
> > > 
> > >                                                            best  -  oe
> > > 
> > > +++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> > > +++ Universitetet i Oslo (IFI); Boks 1080 Blindern; 0316 Oslo; (+47) 2284 0125
> > > +++    --- oe at ifi.uio.no; stephan at oepen.net; http://www.emmtee.net/oe/ ---
> > > +++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> > 
> 

-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 197 bytes
Desc: This is a digitally signed message part
URL: <http://lists.delph-in.net/archives/developers/attachments/20100122/bbbf0f3d/attachment.sig>


More information about the developers mailing list