(Below N is a link to NCBI taxonomic web page and E link to ESTHER at designed phylum.) > cellular organisms: NE > Eukaryota: NE > Euglenozoa: NE > Kinetoplastida: NE > Trypanosomatidae: NE > Trypanosoma: NE > Trypanozoon: NE > Trypanosoma brucei: NE
Warning: This entry is a compilation of different species or line or strain with more than 90% amino acide identity. You can retrieve all strain data
(Below N is a link to NCBI taxonomic web page and E link to ESTHER at designed phylum.) Trypanosoma brucei TREU927: N, E.
Trypanosoma brucei gambiense: N, E.
Trypanosoma brucei brucei strain 927/4 GUTat10.1: N, E.
Trypanosoma brucei brucei: N, E.
Trypanosoma brucei gambiense DAL972: N, E.
Trypanosoma brucei brucei TREU927: N, E.
Trypanosoma brucei equiperdum: N, E.
LegendThis sequence has been compared to family alignement (MSA) red => minority aminoacid blue => majority aminoacid color intensity => conservation rate title => sequence position(MSA position)aminoacid rate Catalytic site Catalytic site in the MSA MSEPVDQLLKTYLQSCDYSIDSSSRKKALEMSCLCCHVYGGEGHLPDGWL VCTREVEGLKKRDESCGFRSELYTNGSKYVLAFAGVHDNRSAFESALQLV GKSDAYKLAAANAALVVSAFGLSNVSFTGHSLGGGLATAAAVFTGAPAIT FNPAWLSSSTRSELLKFPSVEVINYVIFAEALDVFQRHPQLLNSVPAGAF FAGLLSNSKIQQFGTFKYIYCKVIHDRPHYIDAHLIETIIEELRKENGEK ISASDLAASSLHEDVMGGMAQLVQQKMSVIMEVVASVMSKQFSAGGFGSS
The African trypanosome, Trypanosoma brucei, causes sleeping sickness in humans in sub-Saharan Africa. Here we report the sequence and analysis of the 1.1 Mb chromosome I, which encodes approximately 400 predicted genes organised into directional clusters, of which more than 100 are located in the largest cluster of 250 kb. A 160-kb region consists primarily of three gene families of unknown function, one of which contains a hotspot for retroelement insertion. We also identify five novel gene families. Indeed, almost 20% of predicted genes are members of families. In some cases, tandemly arrayed genes are 99-100% identical, suggesting an active process of amplification and gene conversion. One end of the chromosome consists of a putative bloodstream-form variant surface glycoprotein (VSG) gene expression site that appears truncated and degenerate. The other chromosome end carries VSG and expression site-associated genes and pseudogenes over 50 kb of subtelomeric sequence where, unusually, the telomere-proximal VSG gene is oriented away from the telomere. Our analysis includes the cataloguing of minor genetic variations between the chromosome I homologues and an estimate of crossing-over frequency during genetic exchange. Genetic polymorphisms are exceptionally rare in sequences located within and around the strand-switches between several gene clusters.