Title: Natural Language Processing in aid of FlyBase curators
Authors: Karamanis, Nikiforos
Seal, Ruth
Lewin, Ian
McQuilton, Peter
Vlachos, Andreas
Gasperin, Caroline
Drysdale, Rachel
Briscoe, Ted
Issue Date: 14-Apr-2008
Citation: BMC Bioinformatics 2008, 9:193
Abstract: Abstract Background Despite increasing interest in applying Natural Language Processing (NLP) to biomedical text, whether this technology can facilitate tasks such as database curation remains unclear. Results PaperBrowser is the first NLP-powered interface that was developed under a user-centered approach to improve the way in which FlyBase curators navigate an article. In this paper, we first discuss how observing curators at work informed the design and evaluation of PaperBrowser. Then, we present how we appraise PaperBrowser's navigational functionalities in a user-based study using a text highlighting task and evaluation criteria of Human-Computer Interaction. Our results show that PaperBrowser reduces the amount of interactions between two highlighting events and therefore improves navigational efficiency by about 58% compared to the navigational mechanism that was previously available to the curators. Moreover, PaperBrowser is shown to provide curators with enhanced navigational utility by over 74% irrespective of the different ways in which they highlight text in the article. Conclusion We show that state-of-the-art performance in certain NLP tasks such as Named Entity Recognition and Anaphora Resolution can be combined with the navigational functionalities of PaperBrowser to support curation quite successfully.
Description: RIGHTS : This article is licensed under the BioMed Central licence at http://www.biomedcentral.com/about/license which is similar to the 'Creative Commons Attribution Licence'. In brief you may : copy, distribute, and display the work; make derivative works; or make commercial use of the work - under the following conditions: the original author must be given credit; for any reuse or distribution, it must be made clear to others what the license terms of this work are.
URI: http://www.dspace.cam.ac.uk/handle/1810/237975
http://dx.doi.org/10.1186/1471-2105-9-193
Appears in Collections:Scholarly works - Genetics

Files in This Item:

File Description SizeFormat
1471-2105-9-193.xml84.7 kBXMLView/Open
1471-2105-9-193-S1.PDF66.95 kBAdobe PDFThumbnail
View/Open
1471-2105-9-193.pdf2.15 MBAdobe PDFThumbnail
View/Open
Additional resources for this item
search for alternative versions in eresources@cambridge
retrieve citation metadata in EndNote format

This item has been accessed 198 times.

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.