[developers] no utf-8 in parse browser.

C.J. Adams-Collier cjac at colliertech.org
Fri Jan 22 14:55:21 CET 2010


Hey there Francis,

Thanks for the quick response.  I'll attach the associated files.
Daniel and I are doing Lushootseed (lut).  I'm using combining
characters rather than looking up the full character in the utf-8 pages.
I'm still kinda' new to this.

Cheers,

C.J.


On Fri, 2010-01-22 at 10:23 +0800, Francis Bond wrote: 
> G'day,
> 
> I get utf8 characters displayed (Japanese and Korean) without any
> problems.  You also seem to get some characters displayed (accented a)
> What do the sentences look like in the actual item files?  What are
> the characters that don't get displayes?
> 
> 2010/1/22 C.J. Adams-Collier <cjac at colliertech.org>:
> > Hey folks,
> >
> > It's a little difficult to view our parses in the parse browser.  The
> > characters are being escaped rather than rendered in utf-8.
> >
> > Is there anything we can do to get the utf-8 chars pushed all the way
> > through the pipeline here and displayed on the [incr tsdb()] window?
> >
> > Cheers,
> >
> > C.J.
> >
> >
> 
> 
> 

-------------- next part --------------
ʔ - glottal stop
λ̕  - lamda with comma above, right
á - lower-case a with acute
à - lower-case a with grave
í - lower-case i with acute
č - c with caron
č̓ - c with caron and comma
c̕ - c with comma above, right
ə - lower-case schwa
gʷ - g with raised lower-case w
ǰ - j with caron
kʷ - k with raised lower-case w
k̕ - k with comma above, right
k̕ʷ - k with comma above, right and raised lower-case w
ł - lower-case l with stroke
l̕ - l with comma above, right
p̕ - p with comma above, right
qʷ - q with raised lower-case w
q̕ - q with comma above, right
q̕ʷ - q with comma above, right and raised lower-case w
š - s with caron
t̕ - t with comma above, right
w̕ - w with comma above, right
xʷ - x with raised lower-case w
x̣ʷ - x with raised lower-case w and dot below
y̕ - with comma above, right
-------------- next part --------------
1 at author@@@1@{wo}@x̣ʷil̕ t-i č̓ač̓as at 1@3@'the child gets lost'@CJ Adams-Collier and Daniel Mills at Jan 13, 2010
2 at author@@@1@{wo}@t-i č̓ač̓as x̣ʷil̕@0 at 3@'The child gets lost'@CJ Adams-Collier and Daniel Mills at Jan 13, 2010
3 at author@@@1@{wo}@ʔik̕ʷi t-i č̓ač̓as t-i ʔalʔəl at 1@5@'the child sweeps the house'@CJ Adams-Collier and Daniel Mills at Jan 13, 2010
4 at author@@@1@{wo}@ʔik̕ʷi t-i ʔalʔəl t-i č̓ač̓as at 1@5@'the house sweeps the child'@CJ Adams-Collier and Daniel Mills at Jan 13, 2010
5 at author@@@1@{wo}@t-i ʔalʔəl ʔik̕ʷi t-i č̓ač̓as at 0@5@'the child sweeps the house'@CJ Adams-Collier and Daniel Mills at Jan 13, 2010
6 at author@@@1@{wo}@t-i  č̓ač̓as ʔik̕ʷi t-i ʔalʔəl at 0@5@'the child sweeps the house'@CJ Adams-Collier and Daniel Mills at Jan 13, 2010
7 at author@@@1@{wo}@t-i  č̓ač̓as t-i ʔalʔəl ʔik̕ʷi at 0@5@'the child sweeps the house'@CJ Adams-Collier and Daniel Mills at Jan 13, 2010
8 at author@@@1@{wo}@t-i ʔalʔəl t-i č̓ač̓as ʔik̕ʷi at 0@5@'the child sweeps the house'@CJ Adams-Collier and Daniel Mills at Jan 13, 2010
9 at author@@@1@{pn}@x̣ʷil̕ ʔəcá@1 at 2@'I get lost'@CJ Adams-Collier and Daniel Mills at Jan 13, 2010
10 at author@@@1@{pn}@ʔəcá x̣ʷil̕@1 at 2@'It is I that gets lost'@CJ Adams-Collier and Daniel Mills at Jan 13, 2010
11 at author@@@1@{pn}@x̣ʷil̕ díbəł@1 at 2@'We get lost'@CJ Adams-Collier and Daniel Mills at Jan 13, 2010
12 at author@@@1@{pn}@díbəł x̣ʷil̕@1 at 2@'It is we that get lost'@CJ Adams-Collier and Daniel Mills at Jan 13, 2010
13 at author@@@1@{pn}@x̣ʷil̕ dəgʷí@1 at 2@'You get lost'@CJ Adams-Collier and Daniel Mills at Jan 13, 2010
14 at author@@@1@{pn}@dəgʷí x̣ʷil̕@1 at 2@'It is you that gets lost'@CJ Adams-Collier and Daniel Mills at Jan 13, 2010
15 at author@@@1@{pn}@x̣ʷil̕ gʷəlápu at 1@2@'Y'all get lost'@CJ Adams-Collier and Daniel Mills at Jan 13, 2010
16 at author@@@1@{pn}@gʷəlápu x̣ʷil̕@1 at 2@'It is y'all that get lost'@CJ Adams-Collier and Daniel Mills at Jan 13, 2010
17 at author@@@1@{pn}@x̣ʷil̕ díł@1 at 2@'He/she/it/them get lost'@CJ Adams-Collier and Daniel Mills at Jan 13, 2010
18 at author@@@1@{pn}@díł x̣ʷil̕@1 at 2@'It is he/she/it/them that get(s) lost'@CJ Adams-Collier and Daniel Mills at Jan 13, 2010
19 at author@@@1@{pn}@x̣ʷil̕ č-ad at 1@2@'I get lost'@CJ Adams-Collier and Daniel Mills at Jan 13, 2010
20 at author@@@1@{pn}@x̣ʷil̕ č-ałi at 1@2@'We get lost'@CJ Adams-Collier and Daniel Mills at Jan 13, 2010
21 at author@@@1@{pn}@x̣ʷil̕ č-axʷ@1 at 2@'You get lost'@CJ Adams-Collier and Daniel Mills at Jan 13, 2010
22 at author@@@1@{pn}@x̣ʷil̕ č-aləp at 1@2@'Y'all get lost'@CJ Adams-Collier and Daniel Mills at Jan 13, 2010
23 at author@@@1@{det}@x̣ʷil̕ t-i č̓ač̓as at 1@3@'the child (masc) gets lost'@CJ Adams-Collier and Daniel Mills at Jan 13, 2010
24 at author@@@1@{det}@x̣ʷil̕ t-s-i č̓ač̓as at 1@3@'the child (fem) gets lost'@CJ Adams-Collier and Daniel Mills at Jan 13, 2010
25 at author@@@1@{det}@x̣ʷil̕ č̓ač̓as at 0@2@'the child gets lost'@CJ Adams-Collier and Daniel Mills at Jan 13, 2010
26 at author@@@1@{det}@x̣ʷil̕ č̓ač̓as t-i at 0@3@'the child gets lost'@CJ Adams-Collier and Daniel Mills at Jan 13, 2010
27 at author@@@1@{det}@x̣ʷil̕ t-i č-ad at 0@3@'I get lost'@CJ Adams-Collier and Daniel Mills at Jan 13, 2010
28 at author@@@1@{det}@x̣ʷil̕ t-i ʔəcá@0 at 3@'I get lost'@CJ Adams-Collier and Daniel Mills at Jan 13, 2010
-------------- next part --------------
A non-text attachment was scrubbed...
Name: make_item.pl
Type: application/x-perl
Size: 8013 bytes
Desc: not available
URL: <http://lists.delph-in.net/archives/developers/attachments/20100122/e347e265/attachment.pl>
-------------- next part --------------
Language: Lushootseed
Language code: lut
Author: CJ Adams-Collier and Daniel Mills
Date: Jan 13, 2010
Source a: snohomish_grammatical_structure.pdf

#----Word Order----#

#1
Source: author
Vetted: f
Judgment: g
Phenomena: {wo}
x̣ʷil̕ ti č̓ač̓as
x̣ʷil̕ t-i č̓ač̓as
get.lost det.def-dir child
'the child gets lost'

#2 Incorrect word order: VS instead of SV
Source: author
Vetted: f
Judgment: u
Phenomena: {wo}
ti č̓ač̓as x̣ʷil̕
t-i č̓ač̓as x̣ʷil̕
det.def-dir child get.lost
'The child gets lost'

#3
Source: author
Vetted: f
Judgment: g
Phenomena: {wo}
ʔik̕ʷi ti č̓ač̓as ti ʔalʔəl
ʔik̕ʷi t-i č̓ač̓as t-i ʔalʔəl
sweep det.def-dir child det.def-dir house 
'the child sweeps the house'

#4 Correct, but with wrong meaning: VOS instead of VSO
Source: author
Vetted: f
Judgment: g
Phenomena: {wo}
ʔik̕ʷi ti ʔalʔəl ti č̓ač̓as
ʔik̕ʷi t-i ʔalʔəl t-i č̓ač̓as
sweep det.def-dir house det.def-dir child
'the house sweeps the child'

#5 Incorrect word order: OVS instead of VSO
Source: author
Vetted: f
Judgment: u
Phenomena: {wo}
ti ʔalʔəlʔ ʔik̕ʷi ti č̓ač̓as
t-i ʔalʔəl ʔik̕ʷi t-i č̓ač̓as
det.def-dir house sweep det.def-dir child
'the child sweeps the house'

#6 Incorrect word order: SVO instead of VSO
Source: author
Vetted: f
Judgment: u
Phenomena: {wo}
ti č̓ač̓as ʔik̕ʷi ti ʔalʔəlʔ
t-i  č̓ač̓as ʔik̕ʷi t-i ʔalʔəl
det.def-dir child sweep det.def-dir house
'the child sweeps the house'

#7 Incorrect word order: SOV instead of VSO
Source: author
Vetted: f
Judgment: u
Phenomena: {wo}
ti č̓ač̓as ti ʔalʔəlʔ ʔik̕ʷi
t-i  č̓ač̓as t-i ʔalʔəl ʔik̕ʷi
det.def-dir child det.def-dir house sweep
'the child sweeps the house'

#8 Incorrect word order: OSV instead of VSO
Source: author
Vetted: f
Judgment: u
Phenomena: {wo}
ti ʔalʔəlʔ ti č̓ač̓as ʔik̕ʷi
t-i ʔalʔəl t-i č̓ač̓as ʔik̕ʷi
det.def-dir house det.def-dir child sweep
'the child sweeps the house'

#----Pronouns----#
#Note: There is not agreement morphology with pronouns, so everything in this section is grammatical

#1
Source: author
Vetted: f
Judgment: g
Phenomena: {pn}
x̣ʷil̕ ʔəcá
x̣ʷil̕ ʔəcá
get.lost I
'I get lost'

#2
Source: author
Vetted: f
Judgment: g
Phenomena: {pn}
ʔəcá x̣ʷil̕
ʔəcá x̣ʷil̕
I get.lost
'It is I that gets lost'

#3
Source: author
Vetted: f
Judgment: g
Phenomena: {pn}
x̣ʷil̕ díbəł
x̣ʷil̕ díbəł
get.lost we
'We get lost'

#4
Source: author
Vetted: f
Judgment: g
Phenomena: {pn}
díbəł x̣ʷil̕
díbəł x̣ʷil̕
We get.lost
'It is we that get lost'

#5
Source: author
Vetted: f
Judgment: g
Phenomena: {pn}
x̣ʷil̕ dəgʷí
x̣ʷil̕ dəgʷí
get.lost you
'You get lost'

#6
Source: author
Vetted: f
Judgment: g
Phenomena: {pn}
dəgʷí x̣ʷil̕
dəgʷí x̣ʷil̕
You get.lost
'It is you that gets lost'

#7
Source: author
Vetted: f
Judgment: g
Phenomena: {pn}
x̣ʷil̕ gʷəlápu
x̣ʷil̕ gʷəlápu
get.lost y'all
'Y'all get lost'

#8
Source: author
Vetted: f
Judgment: g
Phenomena: {pn}
gʷəlápu x̣ʷil̕
gʷəlápu x̣ʷil̕
Y'all get.lost
'It is y'all that get lost'

#9
Source: author
Vetted: f
Judgment: g
Phenomena: {pn}
x̣ʷil̕ díł
x̣ʷil̕ díł
get.lost 3rd.pro
'He/she/it/them get lost'

#10
Source: author
Vetted: f
Judgment: g
Phenomena: {pn}
díł x̣ʷil̕
díł x̣ʷil̕
Y'all get.lost
'It is he/she/it/them that get(s) lost'

#11
Source: author
Vetted: f
Judgment: g
Phenomena: {pn}
x̣ʷil̕ čad
x̣ʷil̕ č-ad
get.lost ind.pred-I
'I get lost'

#12
Source: author
Vetted: f
Judgment: g
Phenomena: {pn}
x̣ʷil̕ čałi
x̣ʷil̕ č-ałi
get.lost ind.pred-we
'We get lost'

#13
Source: author
Vetted: f
Judgment: g
Phenomena: {pn}
x̣ʷil̕ čaxʷ
x̣ʷil̕ č-axʷ
get.lost ind.pred-you
'You get lost'

#14
Source: author
Vetted: f
Judgment: g
Phenomena: {pn}
x̣ʷil̕ čaləp
x̣ʷil̕ č-aləp
get.lost ind.pred-yall
'Y'all get lost'


#Lushootseed does not have case, so this section is omitted#

#----The rest of the NP----#

#1
Source: author
Vetted: f
Judgment: g
Phenomena: {det}
x̣ʷil̕ ti č̓ač̓as
x̣ʷil̕ t-i č̓ač̓as
get.lost det.def-dir child
'the child (masc) gets lost'

#2
Source: author
Vetted: f
Judgment: g
Phenomena: {det}
x̣ʷil̕ tsi č̓ač̓as
x̣ʷil̕ t-s-i č̓ač̓as
get.lost det.def-FEM-dir child
'the child (fem) gets lost'

#3 Incorrect: Missing determiner
Source: author
Vetted: f
Judgment: u
Phenomena: {det}
x̣ʷil̕ č̓ač̓as
x̣ʷil̕ č̓ač̓as
get.lost child
'the child gets lost'

#4 Incorrect: Determiner should precede noun
Source: author
Vetted: f
Judgment: u
Phenomena: {det}
x̣ʷil̕ č̓ač̓as ti
x̣ʷil̕ č̓ač̓as t-i
get.lost child det.def-dir
'the child gets lost'

#5 Incorrect: Particles do not take determiners
Source: author
Vetted: f
Judgment: u
Phenomena: {det}
x̣ʷil̕ ti čad
x̣ʷil̕ t-i č-ad
get.lost det.def-dir ind.pred-I
'I get lost'

#6 Incorrect: Pronouns do not take determiners
Source: author
Vetted: f
Judgment: u
Phenomena: {det}
x̣ʷil̕ ti ʔəcá
x̣ʷil̕ t-i ʔəcá
get.lost det.def-dir I
'I get lost'
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 197 bytes
Desc: This is a digitally signed message part
URL: <http://lists.delph-in.net/archives/developers/attachments/20100122/e347e265/attachment.sig>


More information about the developers mailing list