(Below N is a link to NCBI taxonomic web page and E link to ESTHER at designed phylum.) > cellular organisms: NE > Eukaryota: NE > Viridiplantae: NE > Streptophyta: NE > Streptophytina: NE > Embryophyta: NE > Tracheophyta: NE > Euphyllophyta: NE > Spermatophyta: NE > Magnoliophyta: NE > Mesangiospermae: NE > Liliopsida: NE > Petrosaviidae: NE > commelinids: NE > Poales: NE > Poaceae: NE > BOP clade: NE > Oryzoideae: NE > Oryzeae: NE > Oryzinae: NE > Oryza: NE > Oryza sativa: NE > Oryza sativa Japonica Group: NE
Warning: This entry is a compilation of different species or line or strain with more than 90% amino acide identity. You can retrieve all strain data
(Below N is a link to NCBI taxonomic web page and E link to ESTHER at designed phylum.) Oryza sativa Japonica Group: N, E.
Oryza nivara: N, E.
Oryza glumipatula: N, E.
Oryza rufipogon: N, E.
Oryza meridionalis: N, E.
LegendThis sequence has been compared to family alignement (MSA) red => minority aminoacid blue => majority aminoacid color intensity => conservation rate title => sequence position(MSA position)aminoacid rate Catalytic site Catalytic site in the MSA MRPVSSAEEVGALLSRSDSSGRRRRSSPVQSASPRPAGCGCGGPRRQSSF RDDVGHAASETYLVTRLTFSLLQYLGLGYRWMSQLLALTIYAILLMPGFL QVGYYYFFSSQVRRSIVYGEQPRNRLDLYIPKDINRPCPVVAFVTGGAWI IGYKAWGSLLGRRLAERGIIVACIDYRNFPQGTIGDMVSDASQGISYVCN NIASYGGDPNRIYLVGQSAGAHIAACALIEQAVKESSGQSISWSVTQIKA YFGLSGGYNMHSLVDHFHERGLNRSIFFSIMEGEESLSRYSPEIVVKQSS SQTIALLPPIVLMHGTEDYSIPSSASQTFADVLQQAGAQAKLQLYEGKTH TDIFIQDPLRGGRDPLVEDVLSIIHVDDEITQEKIALAPAPRRLVFEWQL QLARRFSPF
We report improved whole-genome shotgun sequences for the genomes of indica and japonica rice, both with multimegabase contiguity, or almost 1,000-fold improvement over the drafts of 2002. Tested against a nonredundant collection of 19,079 full-length cDNAs, 97.7% of the genes are aligned, without fragmentation, to the mapped super-scaffolds of one or the other genome. We introduce a gene identification procedure for plants that does not rely on similarity to known genes to remove erroneous predictions resulting from transposable elements. Using the available EST data to adjust for residual errors in the predictions, the estimated gene count is at least 38,000-40,000. Only 2%-3% of the genes are unique to any one subspecies, comparable to the amount of sequence that might still be missing. Despite this lack of variation in gene content, there is enormous variation in the intergenic regions. At least a quarter of the two sequences could not be aligned, and where they could be aligned, single nucleotide polymorphism (SNP) rates varied from as little as 3.0 SNP/kb in the coding regions to 27.6 SNP/kb in the transposable elements. A more inclusive new approach for analyzing duplication history is introduced here. It reveals an ancient whole-genome duplication, a recent segmental duplication on Chromosomes 11 and 12, and massive ongoing individual gene duplications. We find 18 distinct pairs of duplicated segments that cover 65.7% of the genome; 17 of these pairs date back to a common time before the divergence of the grasses. More important, ongoing individual gene duplications provide a never-ending source of raw material for gene genesis and are major contributors to the differences between members of the grass family.
The rice species Oryza sativa is considered to be a model plant because of its small genome size, extensive genetic map, relative ease of transformation and synteny with other cereal crops. Here we report the essentially complete sequence of chromosome 1, the longest chromosome in the rice genome. We summarize characteristics of the chromosome structure and the biological insight gained from the sequence. The analysis of 43.3 megabases (Mb) of non-overlapping sequence reveals 6,756 protein coding genes, of which 3,161 show homology to proteins of Arabidopsis thaliana, another model plant. About 30% (2,073) of the genes have been functionally categorized. Rice chromosome 1 is (G + C)-rich, especially in its coding regions, and is characterized by several gene families that are dispersed or arranged in tandem repeats. Comparison with a draft sequence indicates the importance of a high-quality finished sequence.