[developers] Questions from Korea - Yong-hun Lee
Emily M. Bender
ebender at uw.edu
Fri Jul 8 05:12:14 CEST 2011
You can find information on setting up a MT system with
relatively small grammars here:
The lab assumes that you have a grammar you've been working
on, but it provides two grammars (for English and Italian),
which should be able to be used together for an MT system
(in either direction).
On Thu, Jul 7, 2011 at 8:08 PM, Francis Bond <bond at ieee.org> wrote:
> I am CCing this to developers, with permission, in order keep the
> information widely available.
>> I send this e-mail to ask some questions on machine translation system using the LOGON system. Since I am also a member of the project, I nowadays study how the machine translation system works in the LOGON system, using the DELPH-IN wiki. However, I have many difficulties to understand the basic mechanism of the system. Accordingly, I decided to ask some questions to you. I am very sorry, but please give me the answers for the following questions.
>>  Is it impossible to make a machine translation system without the LinGO Grammar Matrix? I know that the LinGO Grammar Matrix is helpful to the machine translation system. What I want to know is if it is possible to make a machine translation system without the elements in the LinGO Grammar Matrix.
> It is possible to make an MT system without the LinGO Grammar Matrix.
> You do not even have to have HPSG grammars on both sides.
>>  How “Rephrase” is different from “Translation” in the basic algorithm? Here, “Rephrase” means the mechanism which can generate (from the sentence “John eats an apple”) “John ate an apple”, “An apple John eats”, “An apple John ate”, and so on. Is it possible to make a “Rephrase” system without “mt.lsp”, “transfer.tdl”, “trigger.mtr”, “semi.vpm”, and so on? Which file is necessary for “Rephrase”, which file is necessary for “Translation”, and which file is necessary for both?
> Rephrase uses a small set of transfer (paraphrase) rules that go from
> one language into the same language. As far as I know, only the ERG
> uses this, and I don't know any details of how it is set up or whether
> it is supported. Dan Flickinger may know more.
> semi.vpm and trigger.mtr are used even by monolingual generation.
> mt.lsp and transfer.tdl are used by the transfer machinery (and I
> guess the "rephrase" engine).
>>  If the answer for  is Yes, how can I make a toy machine translation system? For example, can I make a toy machine translation system based on the “g7sem” grammar which is included in the LKB system? That is, can I make a toy machine translation system from “g7sem” grammar to “g7sem-trans” grammar? In the “g7sem-trans” grammar, all the ‘_rel’ in the lexical items are changed into ‘_trans’. For example, the word “the” has ‘the_rel’ in “g7sem” grammar, whereas the same word has ‘the_trans’ in “g7sem-trans” grammar. My question is if I can make a toy machine translation system which maps ‘the_rel’ in “g7sem” grammar into ‘the_trans’ in “g7sem-trans” grammar? If possible, how can I do it? Likewise, can I make a toy “Rephrase” system based on the “g7sem” grammar? If possible, how can I do it?
> I am afraid that I only know how to make a relatively heavy MT system
> such as JaEn or the EnEn paraphrasing system. Sanghoun and I (and
> others) set up a prototype KoEn system using the KRG and ERG, but it
> is still a little buggy.
>> In fact, both KRG and ERG are too big for me to understand the basic algorithms of the “Rephrase” and ”Translation” system. If there is a toy grammar which implemented the basic algorithms, it will be very helpful (to me) to understand the basic algorithms of the “Rephrase” and ”Translation” system. If you have such one, please let me know.
> I think the transfer systems produced by Emily Bender in her class may
> be more light weight, perhaps she or one of her students could help
>> I am very grateful for your reading my mail, and I want your comments to my questions.
> Sorry that I can't answer all of your questions, I hope someone else can,
> Francis Bond <http://www3.ntu.edu.sg/home/fcbond/>
> Division of Linguistics and Multilingual Studies
> Nanyang Technological University
Emily M. Bender
Department of Linguistics
Check out CLMA on facebook! http://www.facebook.com/uwclma
More information about the developers