Misra_2002_Genome.Biol_3_RESEARCH0083

Reference

Title : Annotation of the Drosophila melanogaster euchromatic genome: a systematic review - Misra_2002_Genome.Biol_3_RESEARCH0083
Author(s) : Misra S , Crosby MA , Mungall CJ , Matthews BB , Campbell KS , Hradecky P , Huang Y , Kaminker JS , Millburn GH , Prochnik SE , Smith CD , Tupy JL , Whitfied EJ , Bayraktaroglu L , Berman BP , Bettencourt BR , Celniker SE , de Grey AD , Drysdale RA , Harris NL , Richter J , Russo S , Schroeder AJ , Shu SQ , Stapleton M , Yamada C , Ashburner M , Gelbart WM , Rubin GM , Lewis SE
Ref : Genome Biol , 3 :RESEARCH0083 , 2002
Abstract :

BACKGROUND: The recent completion of the Drosophila melanogaster genomic sequence to high quality and the availability of a greatly expanded set of Drosophila cDNA sequences, aligning to 78% of the predicted euchromatic genes, afforded FlyBase the opportunity to significantly improve genomic annotations. We made the annotation process more rigorous by inspecting each gene visually, utilizing a comprehensive set of curation rules, requiring traceable evidence for each gene model, and comparing each predicted peptide to SWISS-PROT and TrEMBL sequences.
RESULTS: Although the number of predicted protein-coding genes in Drosophila remains essentially unchanged, the revised annotation significantly improves gene models, resulting in structural changes to 85% of the transcripts and 45% of the predicted proteins. We annotated transposable elements and non-protein-coding RNAs as new features, and extended the annotation of untranslated (UTR) sequences and alternative transcripts to include more than 70% and 20% of genes, respectively. Finally, cDNA sequence provided evidence for dicistronic transcripts, neighboring genes with overlapping UTRs on the same DNA sequence strand, alternatively spliced genes that encode distinct, non-overlapping peptides, and numerous nested genes.
CONCLUSIONS: Identification of so many unusual gene models not only suggests that some mechanisms for gene regulation are more prevalent than previously believed, but also underscores the complex challenges of eukaryotic gene prediction. At present, experimental data and human curation remain essential to generate high-quality genome annotations.

PubMedSearch : Misra_2002_Genome.Biol_3_RESEARCH0083
PubMedID: 12537572
Gene_locus related to this paper: drome-a1z6g9 , drome-abhd2 , drome-ACHE , drome-CG8058 , drome-CG8093 , drome-CG8233 , drome-CG8425 , drome-CG9059 , drome-CG9186 , drome-CG9542 , drome-CG10982 , drome-CG11309 , drome-CG11406 , drome-CG11598 , drome-CG17097 , drome-glita , drome-KRAKEN , drome-nrtac , drome-OME , drome-q7k274 , drome-q9vux3

Related information

Gene_locus drome-a1z6g9    drome-abhd2    drome-ACHE    drome-CG8058    drome-CG8093    drome-CG8233    drome-CG8425    drome-CG9059    drome-CG9186    drome-CG9542    drome-CG10982    drome-CG11309    drome-CG11406    drome-CG11598    drome-CG17097    drome-glita    drome-KRAKEN    drome-nrtac    drome-OME    drome-q7k274    drome-q9vux3
Gene_locus_frgt drome-a1z6g9    drome-abhd2    drome-ACHE    drome-CG8058    drome-CG8093    drome-CG8233    drome-CG8425    drome-CG9059    drome-CG9186    drome-CG9542    drome-CG10982    drome-CG11309    drome-CG11406    drome-CG11598    drome-CG17097    drome-glita    drome-KRAKEN    drome-nrtac    drome-OME    drome-q7k274    drome-q9vux3    drome-f172a

Citations formats

Misra S, Crosby MA, Mungall CJ, Matthews BB, Campbell KS, Hradecky P, Huang Y, Kaminker JS, Millburn GH, Prochnik SE, Smith CD, Tupy JL, Whitfied EJ, Bayraktaroglu L, Berman BP, Bettencourt BR, Celniker SE, de Grey AD, Drysdale RA, Harris NL, Richter J, Russo S, Schroeder AJ, Shu SQ, Stapleton M, Yamada C, Ashburner M, Gelbart WM, Rubin GM, Lewis SE (2002)
Annotation of the Drosophila melanogaster euchromatic genome: a systematic review
Genome Biol 3 :RESEARCH0083

Misra S, Crosby MA, Mungall CJ, Matthews BB, Campbell KS, Hradecky P, Huang Y, Kaminker JS, Millburn GH, Prochnik SE, Smith CD, Tupy JL, Whitfied EJ, Bayraktaroglu L, Berman BP, Bettencourt BR, Celniker SE, de Grey AD, Drysdale RA, Harris NL, Richter J, Russo S, Schroeder AJ, Shu SQ, Stapleton M, Yamada C, Ashburner M, Gelbart WM, Rubin GM, Lewis SE (2002)
Genome Biol 3 :RESEARCH0083