[sdp] Final Call for Participation: CoNLL 2020 Shared Task on Cross-Framework Meaning Representation Parsing

Tue Jun 23 22:57:42 CEST 2020

[with apologies for cross-posting]

We are excited to invite participants to the Shared Task at the 2020
Conference on Computational Natural Language Learning (CoNLL):

  Cross-Framework Meaning Representation Parsing (MRP 2020)

For background on the nature of the task and its schedule, please see:

  http://mrp.nlpl.eu

A sample of sentences annotated with MRP graphs in five frameworks:

  http://svn.nlpl.eu/mrp/2020/public/sample.tgz

Any potentially interested parties, please sign up for future updates:

  http://lists.nlpl.eu/mailman/listinfo/mrp-users

OBJECTIVES

The goal of the task is to advance data-driven parsing into
graph-structured representations of sentence meaning.  All things
semantic are receiving heightened attention in recent years.  And
despite remarkable advances in vector-based (continuous and
distributed) encodings of meaning, ‘classic’ (discrete and
hierarchically structured) semantic representations will continue to
play an important role in ‘making sense’ of natural language.  While
parsing has long been dominated by tree-structured target
representations, there is now growing interest in general graphs as
more expressive and arguably more adequate target structures.

For the first time, this task combines formally and linguistically
different approaches to meaning representation in graph form in a
uniform training and evaluation setup.  Participants are invited to
develop parsing systems that support up to five distinct semantic
graph frameworks—which all encode core predicate–argument structure,
among other things.  Training and evaluation is provided for all five
frameworks.  Participants are asked to design and train a system that
predicts sentence-level meaning representations in multiple (and
potentially all) frameworks in parallel.  Architectures that utilize
complementary knowledge sources (e.g. via parameter sharing and
multi-task learning) are encouraged (though not required).  Learning
from multiple flavors of meaning representation in tandem has hardly
been explored.

The task seeks to reduce framework-specific ‘balkanization’ in the
field of meaning representation parsing.  Expected outcomes include
(a) a unifying formal model over different semantic graph banks, (b)
uniform representations and framework-agnostic scoring, (c) systematic
contrastive evaluation across frameworks, and (d) increased
cross-fertilization via transfer and multi-task learning.  We hope to
engage the combined community of parser developers for
graph-structured output representations, including from six prior
framework-specific tasks at the Semantic Evaluation exercises between
2014 and 2019.  Owing to scarcity of semantic annotations across
frameworks, the shared task is organized into two tracks: (a)
cross-framework MRP, regrettably limited to English for the time
being, and (b) cross-lingual MRP, with one additional language for
four distinct frameworks.

FRAMEWORKS

The task combines five frameworks for graph-based meaning
representation, each with its specific formal and linguistic
assumptions.

+ Abstract Meaning Representation (Banarescu et al., 2013)
+ Discourse Representation Graphs (Bos et al., 2017)
+ Elementary Dependency Structures (Oepen & Lønning, 2006)
+ Prague Tectogrammatical Graphs (Hajič et al., 2012)
+ Universal Conceptual Cognitive Annotation (Abend & Rappoport, 2013)

For the shared task, we have repackaged different graph banks into a
uniform and normalized abstract representation with a common
serialization format (in JSON).  Training data comprising semantic
graphs over a total of some 3.5 million tokens in running English text
is now available to participants; additional, cross-lingual data
comprises gold-standard meaning representation graphs in Chinese,
Czech, and German for four of the five frameworks.  High-quality
tokenization, PoS tagging, lemmatization, and Universal Dependency
parse trees are provided as an optional ‘companion’ resource.  For all
frameworks, both in- and out-of-domain evaluation data will be
provided in the same unified format.

SCHEDULE

+ June 22, 2020: All Training and Companion Data Available
+ July 20–August 3, 2020: Evaluation Period (Held-Out Data)
+ September 7, 2020: Submission of System Descriptions
+ November 19–20, 2020: Presentation of Results at CoNLL

EVALUATION

For each of the individual frameworks, there are common ways of
evaluating the quality of parser outputs in terms of graph similarity
to gold-standard target representations.  There is broad similarity
between the framework-specific evaluation metrics used to date,
although there are some subtle differences too.  In a nutshell,
meaning representation parsing is commonly evaluated in terms of a
graph similarity F1 score at the level of individual node–edge–node
triples, i.e. ‘atomic’ dependencies.

For the shared task, we provide a (relatively straightforward)
generalization of existing, framework-specific metrics that is (a)
applicable across different flavors of semantic graphs, (b)
distinguishes separate ‘types’ of information, (c) does not require
matching node anchoring in the underlying string, but (d) takes
advantage of node ordering when available.  Labeled per-dependency
scores, macro-averaged across all frameworks, will be the official
metric for the task; but we will also provide per-framework rankings,
additional cross-framework evaluation perspectives, as well as scoring
in established framework-specific metrics.

INVOLVEMENT

The shared task comprises multiple meaning representation frameworks
and languages.  Partial submissions, however, will be very welcome
(even for just one framework and a single language)—with or without
cross-framework transfer learning.  We expect that the parsing
problem, for each framework, defined by the shared task will provide a
stable benchmark for at least a few years to come; thus, we hope to
document and advance the current state of the art in meaning
representation parsing.  We invite all interested parties to
self-subscribe to the mailing list for this task; the subscription
link and access information for the training data are available from
the task web site:

  http://mrp.nlpl.eu

Please do not hesitate to contact the task organizers for questions or
clarifications, using the joint email address provided on the task web
pages.  And stay safe and healthy!

Omri Abend, Lasha Abzianidze, Johan Bos, Jan Hajič,
Daniel Hershcovich, Bin Li, Stephan Oepen,
Tim O'Gorman, Nianwen Xue, and Dan Zeman