Title: Biomedical event extraction from abstracts and full papers using search-based structured prediction
Authors: Vlachos, Andreas
Craven, Mark
Issue Date: 26-Jun-2012
Abstract: Abstract Background Biomedical event extraction has attracted substantial attention as it can assist researchers in understanding the plethora of interactions among genes that are described in publications in molecular biology. While most recent work has focused on abstracts, the BioNLP 2011 shared task evaluated the submitted systems on both abstracts and full papers. In this article, we describe our submission to the shared task which decomposes event extraction into a set of classification tasks that can be learned either independently or jointly using the search-based structured prediction framework. Our intention is to explore how these two learning paradigms compare in the context of the shared task. Results We report that models learned using search-based structured prediction exceed the accuracy of independently learned classifiers by 8.3 points in F-score, with the gains being more pronounced on the more complex Regulation events (13.23 points). Furthermore, we show how the trade-off between recall and precision can be adjusted in both learning paradigms and that search-based structured prediction achieves better recall at all precision points. Finally, we report on experiments with a simple domain-adaptation method, resulting in the second-best performance achieved by a single system. Conclusions We demonstrate that joint inference using the search-based structured prediction framework can achieve better performance than independently learned classifiers, thus demonstrating the potential of this learning paradigm for event extraction and other similarly complex information-extraction tasks.
Description: RIGHTS : This article is licensed under the BioMed Central licence at http://www.biomedcentral.com/about/license which is similar to the 'Creative Commons Attribution Licence'. In brief you may : copy, distribute, and display the work; make derivative works; or make commercial use of the work - under the following conditions: the original author must be given credit; for any reuse or distribution, it must be made clear to others what the license terms of this work are.
URI: http://www.dspace.cam.ac.uk/handle/1810/243406
Other Identifiers: http://dx.doi.org/10.1186/1471-2105-13-S11-S5
Appears in Collections:Scholarly works - Computer Laboratory

Files in This Item:

File Description SizeFormat
1471-2105-13-S11-S5.xml95.62 kBXMLView/Open
1471-2105-13-S11-S5.pdf710.03 kBAdobe PDFThumbnail
View/Open
Additional resources for this item
search for alternative versions in eresources@cambridge
retrieve citation metadata in EndNote format

This item has been accessed 407 times.

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.