BACKGROUND: Medicago truncatula, a close relative of alfalfa, is a preeminent model for studying nitrogen fixation, symbiosis, and legume genomics. The Medicago sequencing project began in 2003 with the goal to decipher sequences originated from the euchromatic portion of the genome. The initial sequencing approach was based on a BAC tiling path, culminating in a BAC-based assembly (Mt3.5) as well as an in-depth analysis of the genome published in 2011. RESULTS: Here we describe a further improved and refined version of the M. truncatula genome (Mt4.0) based on de novo whole genome shotgun assembly of a majority of Illumina and 454 reads using ALLPATHS-LG. The ALLPATHS-LG scaffolds were anchored onto the pseudomolecules on the basis of alignments to both the optical map and the genotyping-by-sequencing (GBS) map. The Mt4.0 pseudomolecules encompass ~360 Mb of actual sequences spanning 390 Mb of which ~330 Mb align perfectly with the optical map, presenting a drastic improvement over the BAC-based Mt3.5 which only contained 70% sequences (~250 Mb) of the current version. Most of the sequences and genes that previously resided on the unanchored portion of Mt3.5 have now been incorporated into the Mt4.0 pseudomolecules, with the exception of ~28 Mb of unplaced sequences. With regard to gene annotation, the genome has been re-annotated through our gene prediction pipeline, which integrates EST, RNA-seq, protein and gene prediction evidences. A total of 50,894 genes (31,661 high confidence and 19,233 low confidence) are included in Mt4.0 which overlapped with ~82% of the gene loci annotated in Mt3.5. Of the remaining genes, 14% of the Mt3.5 genes have been deprecated to an "unsupported" status and 4% are absent from the Mt4.0 predictions. CONCLUSIONS: Mt4.0 and its associated resources, such as genome browsers, BLAST-able datasets and gene information pages, can be found on the JCVI Medicago web site (http://www.jcvi.org/medicago). The assembly and annotation has been deposited in GenBank (BioProject: PRJNA10791). The heavily curated chromosomal sequences and associated gene models of Medicago will serve as a better reference for legume biology and comparative genomics.
Paraoxonase (PON) is a high-density lipoprotein (HDL)-associated esterase, which may prevent the transformation of low-density lipoproteins (LDL) into biologically active, atherogenic particles. PON concentration and activity are affected by PON1 gene polymorphisms and found to be altered in type 2 diabetes patients with retinopathy. We investigated serum PON concentration, in vitro activity and polymorphism at position 54 (L/M, Leu-Met54) in 193 Caucasian adolescents and young adults (88 males, 105 females) with type 1 diabetes mellitus, as well as its relationship to the presence of retinopathy. An inverse linear correlation was found between blood glucose levels and both serum PON concentration (r = -.20, P =.017) and its activity (r = -0.17, P =.037). Patients with elevated blood glucose values (> or =10 mmol/L) had significantly lower levels of both PON concentration (P =.003) and activity (P =.028) than those with lower glucose levels. After adjusting for blood glucose and diabetes duration, PON activity was significantly higher in patients with different stages of retinopathy compared with those without retinopathy (P =.003). The L/L genotype was closely associated with the presence of retinopathy (P <.0001). These data show that young people with type 1 diabetes and the L/L polymorphism at position 54 of PON1 gene are more susceptible to retinal complications. However, the role of serum PON concentration and activity as a possible marker for monitoring late microvascular complications in these patients has to be established.
The genome of the flowering plant Arabidopsis thaliana has five chromosomes. Here we report the sequence of the largest, chromosome 1, in two contigs of around 14.2 and 14.6 megabases. The contigs extend from the telomeres to the centromeric borders, regions rich in transposons, retrotransposons and repetitive elements such as the 180-base-pair repeat. The chromosome represents 25% of the genome and contains about 6,850 open reading frames, 236 transfer RNAs (tRNAs) and 12 small nuclear RNAs. There are two clusters of tRNA genes at different places on the chromosome. One consists of 27 tRNA(Pro) genes and the other contains 27 tandem repeats of tRNA(Tyr)-tRNA(Tyr)-tRNA(Ser) genes. Chromosome 1 contains about 300 gene families with clustered duplications. There are also many repeat elements, representing 8% of the sequence.