MAO: a Multiple Alignment Ontology for nucleic acid and protein sequences.

Fiche publication


Date publication

janvier 2005

Auteurs

Membres identifiés du Cancéropôle Est :
Dr MORAS Dino, Dr POCH Olivier


Tous les auteurs :
Thompson JD, Holbrook SR, Katoh K, Koehl P, Moras D, Westhof E, Poch O

Résumé

The application of high-throughput techniques such as genomics, proteomics or transcriptomics means that vast amounts of heterogeneous data are now available in the public databases. Bioinformatics is responding to the challenge with new integrated management systems for data collection, validation and analysis. Multiple alignments of genomic and protein sequences provide an ideal environment for the integration of this mass of information. In the context of the sequence family, structural and functional data can be evaluated and propagated from known to unknown sequences. However, effective integration is being hindered by syntactic and semantic differences between the different data resources and the alignment techniques employed. One solution to this problem is the development of an ontology that systematically defines the terms used in a specific domain. Ontologies are used to share data from different resources, to automatically analyse information and to represent domain knowledge for non-experts. Here, we present MAO, a new ontology for multiple alignments of nucleic and protein sequences. MAO is designed to improve interoperation and data sharing between different alignment protocols for the construction of a high quality, reliable multiple alignment in order to facilitate knowledge extraction and the presentation of the most pertinent information to the biologist.

Référence

Nucleic Acids Res. 2005 Jul 25;33(13):4164-71. Print 2005.