[developers] ACL Anthology Searchbench

Ulrich Schaefer ulrich.schaefer at dfki.de
Thu Jul 14 21:48:50 CEST 2011

with apologies for cross-posting --

Dear all,

the ACL Anthology Searchbench at

is now also reachable from the ACL Anthology start page 
aclweb.org/anthology .

The Searchbench combines semantic, full text and bibliographic search
in more than 19,000 Computational Linguistics papers of the ACL
Anthology from the past 47 years, including the complete Journal,
-- parsed with the ERG and DELPH-IN tools!

Highlights are

- "statements" search: you can search for subject-predicate-object
   triples in millions of sentences, where predicates can also be
   synonyms, and taking passives and sentence negation into account

- combination with bibliographic and full text filters

- autosuggest search fields, faceted search

- search result (filter) URLs can be bookmarked or emailed

- display of search result sentences in original PDF layout.
   This requires the Adobe Acrobat Reader browser plug-in with
   Preferences/Search/"external highlight server" enabled and doesn't
   work well on older, scanned papers (page should always be correct).

The Searchbench itself requires a recent web browser with JavaScript
enabled, for details see http://aclasb.dfki.de/help.html .

The Searchbench is not perfect -- it is a milestone in an ongoing
research project (TAKE).  There was no manual correction of OCR or NLP
errors.  Missing author affiliation data of 2010 and 2011 papers will
be added later.

However, we hope you find it a useful tool also for your scientific
work.  Your feedback is welcome ("Feedback" button at left bottom)!

-- The TAKE Searchbench team Ulrich Schäfer, Bernd Kiefer, Christian
Spurk, Jörg Steffen and Rui Wang

   ...with thanks to all others who have contributed to this endeavor
   (see "About" at left bottom, also contains a link to the ACL paper
   describing the Searchbench internals).

The Searchbench has been developed in the context of the BMBF-funded
project TAKE, the DFG Cluster of Excellence on Multimodal Computing
and Interaction (MMCI) and the international DELPH-IN collaboration.

Dr. Ulrich Schäfer http://dfki.de/~uschaefer phone:+49681857755154
     DFKI Language Technology Lab, D-66123 Saarbruecken, Germany
    Deutsches Forschungszentrum fuer Kuenstliche Intelligenz GmbH
      Trippstadter Strasse 122, D-67663 Kaiserslautern, Germany
    Geschaeftsfuehrung: Prof. Dr. Dr. h.c. mult. Wolfgang Wahlster
(Vorsitzender), Dr. Walter Olthoff. Vorsitzender des Aufsichtsrats:
Prof. Dr. h.c. Hans A. Aukes. Amtsgericht Kaiserslautern, HRB 2313

More information about the developers mailing list