<div dir="ltr"><div>To answer question 2 on the wiki, instances are formally different from types.</div><div><br></div><div>After all type definitions have been given, the type hierarchy can be defined along with all its constraints (or fail to be defined, if there's a conflict somewhere). An instance is a feature structure that must follow the type hierarchy and its constraints. A type is given a name, and this becomes part of the type hierarchy. An instance is given a name, but this is for ease of reference and doesn't affect the hierarchy.<br></div><div><br></div><div>For example, in defining a type, we are free to combine any supertypes to create multiple inheritance in the type hierarchy. However, in defining an instance, we must choose which type it instantiates. The definition of an instance gives its type, not a supertype. (The type's feature structure will subsume the instance's feature structure, but both feature structures have the same type.) Now, in terms of syntax, there are two options:</div><div><br></div><div>1. An instance must be defined as instantiating one specific type (so "multiple supertype" syntax is not allowed).</div><div><br></div><div>2. An instance can be defined as instantiating multiple types, in which case the TDL processor must perform unification, either finding the glb if it exists, or failing.</div><div><br></div><div>I'm not sure there's a use case for 2. It would seem to be clearer to directly state the type of the instance, rather than leaving it implicit. In principle, option 2 would allow instantiating a glb type that isn't explicitly defined... but I don't know if anyone ever has a need for that, and if the glb is important, it would probably be clearer to define it explicitly. So I think option 1 is better.<br></div><div><br></div><div><br></div><div>For questions 3, 5, and 6:</div><div><br></div><div>Requiring types to come first seems sensible. I think everyone already does this, anyway?</div><div><br></div><div>Fixing the position of docstrings seems sensible. Is there any variation in current code?</div><div><br></div><div>Allowing extra whitespace and comments seems sensible.<br></div><div><br></div></div><div class="gmail_extra"><br><div class="gmail_quote">2018-07-11 2:01 GMT+01:00 Michael Wayne Goodman <span dir="ltr"><<a href="mailto:goodmami@uw.edu" target="_blank">goodmami@uw.edu</a>></span>:<br><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><div dir="ltr"><div>I attempted to define a BNF-like description of TDL syntax on the wiki: <a href="http://moin.delph-in.net/TdlRfc" target="_blank">http://moin.delph-in.net/<wbr>TdlRfc</a></div><div>I tried to follow the partial BNF in the LKB source and often referred to the lisp code itself in order to fill out the rest of the description.<br></div><div><br></div><div>My 3 questions above are concisely repeated at the bottom of the wiki along with some others.</div><div><br></div><div>I welcome corrections and discussion (here or on the wiki) from any TDL nerds or authorities (especially if you've written a TDL parser).<br></div></div><div class="gmail_extra"><div><div class="h5"><br><div class="gmail_quote">On Mon, Jul 9, 2018 at 12:49 PM, Michael Wayne Goodman <span dir="ltr"><<a href="mailto:goodmami@uw.edu" target="_blank">goodmami@uw.edu</a>></span> wrote:<br><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><div dir="ltr"><div>Hi developers,</div><div><br></div><div>I'm taking a closer look at the syntax of TDL files and the situation is a bit of a mess. Can anyone help me clarify some things? (I'll restrict myself to 3 questions for now)<br></div><div><br></div><div>The Copestake 2002 reference (Implementing TFS Grammars) has a BNF for TDL, but it's a bit out of date and, according to comments in the LKB source code, incorrect in parts. The LKB source comments are scattered, incomplete, inconsistent, and also a bit outdated. There is not much on the wiki. There is some discussion in the mailing list archives (much from before my time in DELPH-IN), but it's not clear how current those descriptions are.</div><div><br></div><div>Q1: Are supertypes special in a definition?</div><div><br></div><div>The BNF (in the LKB source) says this:</div><div><br></div><div> Type-def -> Type { Avm-def | Subtype-def} . | <br> Type { Avm-def | Subtype-def}.<br></div><div> Avm-def -> := Conjunction | Comment Conjunction<br> Conjunction -> Term { & Term } *<br> Term -> Type | Feature-term | Diff-list | List | Coreference<br></div><div><br></div><div>That makes it sound like I could do this:</div><div><br></div><div> mytype := [ FEAT val ] & supertype.</div><div><br></div><div>or even:</div><div><br></div><div> mytype := <! diff list.. !> & #coref & supertype.</div><div><br></div><div>But elsewhere it seems like a list of parents is special and appears before the rest of the conjunction. E.g., at read-tdl-avm-def of lingo/lkb/src/io-tdl/tdltypein<wbr>put.lsp I see this alternate definition of Avm-def:<br></div><div><br></div><div> ;;; Avm-def -> := Parents Conjunction | Parents Comment Conjunction |<br> ;;; Parents | Parents Comment<br></div><div><br></div><div>It seems that both ACE and PET are fine with putting supertypes after the feature list (and some other variations). I'm fine with this, but I wonder what it means for docstrings (see Q3 below), which (I think) are supposed to appear after the list of parents and before the feature list.<br></div><div><br></div><div><br></div><div>Q2: Subtype-def is now just a variant of Avm-def, yes?</div><div><br></div><div>The BNF still describes subtyping (with the :< operator) as only taking a single parent:</div><div><br></div><div> Subtype-def -> :< type</div><div><br></div><div>But I believe the consensus is that this is unnecessary (it's equivalent to using := with only a supertype), so :< is treated as equivalent to := (to avoid breaking backward compatibility). Is this interpretation used by all processors?</div><div><br></div><div><br></div><div>Q3: What's the final word with type comments / docstrings?</div><div><br></div><div>I find evidence of 3 proposed variants: (1) a block of ";" comments before a typename (LTDB-style); (2) a block of ";" comments within a type description; and (3) a "doc string" within a type description. Furthermore, there is a question as to whether comments or strings within a type go after the ":=" or after the list of supertypes. I think #| ... |# comments were not considered for this purpose.</div><div><br></div><div>My guess is this:</div><div><br></div><div>* LTDB-style comments (before the type identifier) are processed separately from TDL-parsing</div><div>* type-internal comments can go anywhere but are discarded</div><div>* type-internal doc strings must appear after the list of supertypes and are later available for inspection (they are included as a non-functional part of a type)</div><div><br></div><div>ACE seems happy with my assumptions, although PET doesn't seem to like doc strings at all.<br></div><div><br></div><div><br></div><div>Thanks!<span class="m_-3015204524010399632HOEnZb"><font color="#888888"><br></font></span></div><span class="m_-3015204524010399632HOEnZb"><font color="#888888"><div><br></div><div>-- <br><div class="m_-3015204524010399632m_-3591690003182791182gmail_signature"><div dir="ltr"><div><div dir="ltr"><div><div dir="ltr">Michael Wayne Goodman</div></div></div></div></div></div>
</div></font></span></div>
</blockquote></div><br><br clear="all"><br></div></div><span class="HOEnZb"><font color="#888888">-- <br><div class="m_-3015204524010399632gmail_signature" data-smartmail="gmail_signature"><div dir="ltr"><div><div dir="ltr"><div><div dir="ltr">Michael Wayne Goodman</div></div></div></div></div></div>
</font></span></div>
</blockquote></div><br></div>