(Below N is a link to NCBI taxonomic web page and E link to ESTHER at designed phylum.) > cellular organisms: NE > Eukaryota: NE > Euglenozoa: NE > Kinetoplastida: NE > Trypanosomatidae: NE > Leishmaniinae: NE > Leishmania [genus]: NE > Leishmania [subgenus]: NE > Leishmania major species complex: NE > Leishmania major: NE
LegendThis sequence has been compared to family alignement (MSA) red => minority aminoacid blue => majority aminoacid color intensity => conservation rate title => sequence position(MSA position)aminoacid rate Catalytic site Catalytic site in the MSA MTGDLSASGAAAPLLCRVPFLSKLKFLCLHVFCALVVDLVLRLTRFLSRA QPGIQTTATHGSLQAKSFFKLDRLDKVPYPDPPYSGHVSTILCAFRPRRS IPYQRVVHPGADGNPMHLDWMLTDSRAAKGVFLIIPGLASWSGTNYIEHF VWSAFTHHFHCGVFNSRGMGNTPIETPRLMSGKWTDDLRAVLRDGPFSRA AIEERCGAGIPIIGVGFSLGGVILSKYVGEECLAGRELVMDAVMVVNSPL DCLDSNAVISRGISKVLYQPSMAGSLTAYARRHAKVLKDLPGLSPDVRAA FASGRLEKILAQVKTVHDFDRLITAPTLGFATPEAYYHHISPIQWLPHFS VPVLCISAADDPVTGEPRMESLDNTMRSNPNVALLVIPHGGHLGYIRSVR DEWLGRETMMEKIIYEVAAAITPRR
Leishmania species cause a spectrum of human diseases in tropical and subtropical regions of the world. We have sequenced the 36 chromosomes of the 32.8-megabase haploid genome of Leishmania major (Friedlin strain) and predict 911 RNA genes, 39 pseudogenes, and 8272 protein-coding genes, of which 36% can be ascribed a putative function. These include genes involved in host-pathogen interactions, such as proteolytic enzymes, and extensive machinery for synthesis of complex surface glycoconjugates. The organization of protein-coding genes into long, strand-specific, polycistronic clusters and lack of general transcription factors in the L. major, Trypanosoma brucei, and Trypanosoma cruzi (Tritryp) genomes suggest that the mechanisms regulating RNA polymerase II-directed transcription are distinct from those operating in other eukaryotes, although the trypanosomatids appear capable of chromatin remodeling. Abundant RNA-binding proteins are encoded in the Tritryp genomes, consistent with active posttranscriptional regulation of gene expression.