A sentence sliding window approach to extract protein annotations from biomedical articles

DSpace Home
→
Harvested articles مقالات مستوردة من مؤسسات وجامعات عالمية
→
Digital Csic
→
View Item

dc.creator	Krallinger, Martin
dc.creator	Padron, Maria
dc.creator	Valencia, Alfonso
dc.date	2007-05-08T15:52:46Z
dc.date	2007-05-08T15:52:46Z
dc.date	2005-05-24
dc.date.accessioned	2017-01-31T00:57:12Z
dc.date.available	2017-01-31T00:57:12Z
dc.identifier	BMC Bioinformatics 2005, 6(Suppl 1):S19
dc.identifier	1471-2105
dc.identifier	http://hdl.handle.net/10261/1439
dc.identifier	10.1186/1471-2105-6-S1-S19
dc.identifier.uri	http://dspace.mediu.edu.my:8181/xmlui/handle/10261/1439
dc.description	From A critical assessment of text mining methods in molecular biology
dc.description	[Background] Within the emerging field of text mining and statistical natural language processing (NLP) applied to biomedical articles, a broad variety of techniques have been developed during the past years. Nevertheless, there is still a great ned of comparative assessment of the performance of the proposed methods and the development of common evaluation criteria. This issue was addressed by the Critical Assessment of Text Mining Methods in Molecular Biology (BioCreative) contest. The aim of this contest was to assess the performance of text mining systems applied to biomedical texts including tools which recognize named entities such as genes and proteins, and tools which automatically extract protein annotations.
dc.description	[Results] The "sentence sliding window" approach proposed here was found to efficiently extract text fragments from full text articles containing annotations on proteins, providing the highest number of correctly predicted annotations. Moreover, the number of correct extractions of individual entities (i.e. proteins and GO terms) involved in the relationships used for the annotations was significantly higher than the correct extractions of the complete annotations (protein-function relations).
dc.description	[Conclusion] We explored the use of averaging sentence sliding windows for information extraction, especially in a context where conventional training data is unavailable. The combination of our approach with more refined statistical estimators and machine learning techniques might be a way to improve annotation extraction for future biomedical text mining applications.
dc.description	This work was sponsored by DOC, the doctoral scholarship programme of the Austrian Academy of Sciences and the ORIEL (IST-2001-32688) and TEMBLOR (QLRT-2001-00015) projects.
dc.description	Peer reviewed
dc.language	eng
dc.publisher	BioMed Central
dc.relation	Publisher’s version
dc.rights	openAccess
dc.title	A sentence sliding window approach to extract protein annotations from biomedical articles
dc.type	Artículo

Files in this item

Files	Size	Format	View
There are no files associated with this item.

This item appears in the following Collection(s)

Digital Csic

Show simple item record

Search DSpace

Advanced Search

Browse

All of DSpace
This Collection
- By Issue Date
- Authors
- Titles
- Subjects

A sentence sliding window approach to extract protein annotations from biomedical articles

Files in this item

This item appears in the following Collection(s)

Search DSpace

Browse

All of DSpace

This Collection

My Account