DSpace Repository

An analysis of the Sargasso Sea resource and the consequences for database composition

Show simple item record

dc.creator Tress, Michael L.
dc.creator Cozzetto, Domenico
dc.creator Tramontano, Anna
dc.creator Valencia, Alfonso
dc.date 2007-05-08T11:22:13Z
dc.date 2007-05-08T11:22:13Z
dc.date 2006-04-19
dc.date.accessioned 2017-01-31T00:57:06Z
dc.date.available 2017-01-31T00:57:06Z
dc.identifier BMC Bioinformatics 7 (213): 2006
dc.identifier http://dx.doi.org/10.1186/1471-2105-7-213
dc.identifier 1471-2105
dc.identifier http://hdl.handle.net/10261/1399
dc.identifier 10.1186/1471-2105-7-213
dc.identifier.uri http://dspace.mediu.edu.my:8181/xmlui/handle/10261/1399
dc.description [Background] The environmental sequencing of the Sargasso Sea has introduced a huge new resource of genomic information. Unlike the protein sequences held in the current searchable databases, the Sargasso Sea sequences originate from a single marine environment and have been sequenced from species that are not easily obtainable by laboratory cultivation. The resource also contains very many fragments of whole protein sequences, a side effect of the shotgun sequencing method. These sequences form a significant addendum to the current searchable databases but also present us with some intrinsic difficulties. While it is important to know whether it is possible to assign function to these sequences with the current methods and whether they will increase our capacity to explore sequence space, it is also interesting to know how current bioinformatics techniques will deal with the new sequences in the resource.
dc.description [Results] The Sargasso Sea sequences seem to introduce a bias that decreases the potential of current methods to propose structure and function for new proteins. In particular the high proportion of sequence fragments in the resource seems to result in poor quality multiple alignments.
dc.description [Conclusion] These observations suggest that the new sequences should be used with care, especially if the information is to be used in large scale analyses. On a positive note, the results may just spark improvements in computational and experimental methods to take into account the fragments generated by environmental sequencing techniques.
dc.description This work was funded by grants from BioSapiens (LSHC-CT-2003-505265) and GENEFUN (LSHGCT- 2004-503567).
dc.description Peer reviewed
dc.language eng
dc.publisher BioMed Central
dc.relation Publisher’s version
dc.rights openAccess
dc.rights http://creativecommons.org/licenses/by/2.0/
dc.title An analysis of the Sargasso Sea resource and the consequences for database composition
dc.type Artículo


Files in this item

Files Size Format View

There are no files associated with this item.

This item appears in the following Collection(s)

Show simple item record

Search DSpace


Advanced Search

Browse

My Account