Exon analysis with CAbase : a novel Carbonic Anhydrases database tool.
Zuniga Norman, Sebastian (2020-12-21)
Exon analysis with CAbase : a novel Carbonic Anhydrases database tool.
Zuniga Norman, Sebastian
(21.12.2020)
Julkaisu on tekijänoikeussäännösten alainen. Teosta voi lukea ja tulostaa henkilökohtaista käyttöä varten. Käyttö kaupallisiin tarkoituksiin on kielletty.
avoin
Julkaisun pysyvä osoite on:
https://urn.fi/URN:NBN:fi-fe20201223102869
https://urn.fi/URN:NBN:fi-fe20201223102869
Tiivistelmä
The Carbonic Anhydrases are a superfamily of proteins, a type of metalloenzyme that are found ubiquitously throughout the most diverse clades in the tree of life. Since first identified as a catalyst for the reversible hydration and dehydration of carbon dioxide almost 90 years ago, the interest in research for its pharmacological and industrial applications is constantly growing and counting with tools like the one here discussed becomes prevailing.
When assembly errors or gaps are in exonic regions, misses or misplacing exons occur and it is because of these errors that approaches like homology-based gene models fail due to the lack of correct data. Instead, a similar length of poorly matching sequence may be predicted. Performing exon analysis for the carbonic anhydrases in the context of CAbase aims to identify the mis predicted exons and refers to a set of actions that are possible to perform over them, be they computationally or humanly conducted. Analysis like the comparison of any relevant exon to a desired prototype, the curated addition of labels to them, indicating different important features, or simply, a visual guide to understand their distribution in transcripts or genes of interest.
CAbase is a web-based Third Party Annotation System (TPA). A toolkit that allows to browse, annotate and perform analysis over sets of data related to the Carbonic Anhydrases protein family. CAbase is an effort to centralise, normalise and analyse information and annotations over the CA family.
To date, 47 CAs (14 Human) genes have been incorporated and currently, more than 50 thousand exons are possible to be characterized based on their splicing site as well as analyse their predicted translations and alignment to the precursor homolog.
CAbase continues to be developed, more improvements are included and in conjunction with the usage by the experts in the field of CAs, this inferal type of TPA, could become a central tool in the CA research community, creating better consensus data on this ever-growing protein family.
When assembly errors or gaps are in exonic regions, misses or misplacing exons occur and it is because of these errors that approaches like homology-based gene models fail due to the lack of correct data. Instead, a similar length of poorly matching sequence may be predicted. Performing exon analysis for the carbonic anhydrases in the context of CAbase aims to identify the mis predicted exons and refers to a set of actions that are possible to perform over them, be they computationally or humanly conducted. Analysis like the comparison of any relevant exon to a desired prototype, the curated addition of labels to them, indicating different important features, or simply, a visual guide to understand their distribution in transcripts or genes of interest.
CAbase is a web-based Third Party Annotation System (TPA). A toolkit that allows to browse, annotate and perform analysis over sets of data related to the Carbonic Anhydrases protein family. CAbase is an effort to centralise, normalise and analyse information and annotations over the CA family.
To date, 47 CAs (14 Human) genes have been incorporated and currently, more than 50 thousand exons are possible to be characterized based on their splicing site as well as analyse their predicted translations and alignment to the precursor homolog.
CAbase continues to be developed, more improvements are included and in conjunction with the usage by the experts in the field of CAs, this inferal type of TPA, could become a central tool in the CA research community, creating better consensus data on this ever-growing protein family.