(Below N is a link to NCBI taxonomic web page and E link to ESTHER at designed phylum.) > cellular organisms: NE > Eukaryota: NE > Viridiplantae: NE > Streptophyta: NE > Streptophytina: NE > Embryophyta: NE > Tracheophyta: NE > Euphyllophyta: NE > Spermatophyta: NE > Magnoliophyta: NE > Mesangiospermae: NE > eudicotyledons: NE > Gunneridae: NE > Pentapetalae: NE > rosids: NE > fabids: NE > Fabales: NE > Fabaceae: NE > Papilionoideae: NE > Trifolieae: NE > Medicago: NE > Medicago truncatula: NE
LegendThis sequence has been compared to family alignement (MSA) red => minority aminoacid blue => majority aminoacid color intensity => conservation rate title => sequence position(MSA position)aminoacid rate Catalytic site Catalytic site in the MSA MGGGLVGDNIGLNEEVIKKTCSLSFKAHNSENEPYISELFRISSDLTEYL VLSFPGSWVETDWFVRKSFGETKIDLAKFPLLKSVGNDETALVNQAFFNR FERLLKLSSFVAEVKKAIAEGKQVVFTGHSSGAVLAILATFWALEEYLYP TKIQIQHKSPMCVTFGSPLVGNHIFSHASNREKWSHHFIHFVMQYDIVPR IFLAPFSSIEKLISPVLQLLTPNNNSLTQDPIRDSVTCEFYSTVMRNAAT VTSHVACNLMGSTNLLLETMTNFVELSPYRPFGTYIFCNGNGQMIVVKNS DAVLQLMFHIAQLRDLAQLSEVANKSILQHLAYEAELEESLGMQNVVYLN KLDDLPLSSGEGHDTDIAAALDSLGLSARARLCLRAAGELEKQKERNEEK IKKEFQDKAVPSMRDLEEYKSTCEINNGKGYYDAFKVQKEPKDFQANVKR LVLAGVWDEIIEMLKRYELPDEFEGKKEWIEHGTRFRRLVEPLDIANYHR HLKNEDTGPYMNKARPKRYRYTQRWLEHANRSPKAEITESTFWAEVEELC SWTSNNKPFEDIKERVLKLEQDIKMWTDKEVLTKDVLSKDPTFIKLWETL PQEHKATSCISTLFTVKE
BACKGROUND: Medicago truncatula, a close relative of alfalfa, is a preeminent model for studying nitrogen fixation, symbiosis, and legume genomics. The Medicago sequencing project began in 2003 with the goal to decipher sequences originated from the euchromatic portion of the genome. The initial sequencing approach was based on a BAC tiling path, culminating in a BAC-based assembly (Mt3.5) as well as an in-depth analysis of the genome published in 2011. RESULTS: Here we describe a further improved and refined version of the M. truncatula genome (Mt4.0) based on de novo whole genome shotgun assembly of a majority of Illumina and 454 reads using ALLPATHS-LG. The ALLPATHS-LG scaffolds were anchored onto the pseudomolecules on the basis of alignments to both the optical map and the genotyping-by-sequencing (GBS) map. The Mt4.0 pseudomolecules encompass ~360 Mb of actual sequences spanning 390 Mb of which ~330 Mb align perfectly with the optical map, presenting a drastic improvement over the BAC-based Mt3.5 which only contained 70% sequences (~250 Mb) of the current version. Most of the sequences and genes that previously resided on the unanchored portion of Mt3.5 have now been incorporated into the Mt4.0 pseudomolecules, with the exception of ~28 Mb of unplaced sequences. With regard to gene annotation, the genome has been re-annotated through our gene prediction pipeline, which integrates EST, RNA-seq, protein and gene prediction evidences. A total of 50,894 genes (31,661 high confidence and 19,233 low confidence) are included in Mt4.0 which overlapped with ~82% of the gene loci annotated in Mt3.5. Of the remaining genes, 14% of the Mt3.5 genes have been deprecated to an "unsupported" status and 4% are absent from the Mt4.0 predictions. CONCLUSIONS: Mt4.0 and its associated resources, such as genome browsers, BLAST-able datasets and gene information pages, can be found on the JCVI Medicago web site (http://www.jcvi.org/medicago). The assembly and annotation has been deposited in GenBank (BioProject: PRJNA10791). The heavily curated chromosomal sequences and associated gene models of Medicago will serve as a better reference for legume biology and comparative genomics.
Legumes (Fabaceae or Leguminosae) are unique among cultivated plants for their ability to carry out endosymbiotic nitrogen fixation with rhizobial bacteria, a process that takes place in a specialized structure known as the nodule. Legumes belong to one of the two main groups of eurosids, the Fabidae, which includes most species capable of endosymbiotic nitrogen fixation. Legumes comprise several evolutionary lineages derived from a common ancestor 60 million years ago (Myr ago). Papilionoids are the largest clade, dating nearly to the origin of legumes and containing most cultivated species. Medicago truncatula is a long-established model for the study of legume biology. Here we describe the draft sequence of the M. truncatula euchromatin based on a recently completed BAC assembly supplemented with Illumina shotgun sequence, together capturing approximately 94% of all M. truncatula genes. A whole-genome duplication (WGD) approximately 58 Myr ago had a major role in shaping the M. truncatula genome and thereby contributed to the evolution of endosymbiotic nitrogen fixation. Subsequent to the WGD, the M. truncatula genome experienced higher levels of rearrangement than two other sequenced legumes, Glycine max and Lotus japonicus. M. truncatula is a close relative of alfalfa (Medicago sativa), a widely cultivated crop with limited genomics tools and complex autotetraploid genetics. As such, the M. truncatula genome sequence provides significant opportunities to expand alfalfa's genomic toolbox.