Budiman_2000_Genome.Res_10_129

Reference

Title : A deep-coverage tomato BAC library and prospects toward development of an STC framework for genome sequencing - Budiman_2000_Genome.Res_10_129
Author(s) : Budiman MA , Mao L , Wood TC , Wing RA
Ref : Genome Res , 10 :129 , 2000
Abstract :

Recently a new strategy using BAC end sequences as sequence-tagged connectors (STCs) was proposed for whole-genome sequencing projects. In this study, we present the construction and detailed characterization of a 15.0 haploid genome equivalent BAC library for the cultivated tomato, Lycopersicon esculentum cv. Heinz 1706. The library contains 129,024 clones with an average insert size of 117.5 kb and a chloroplast content of 1.11%. BAC end sequences from 1490 ends were generated and analyzed as a preliminary evaluation for using this library to develop an STC framework to sequence the tomato genome. A total of 1205 BAC end sequences (80.9%) were obtained, with an average length of 360 high-quality bases, and were searched against the GenBank database. Using a cutoff expectation value of <10(-6), and combining the results from BLASTN, BLASTX, and TBLASTX searches, 24.3% of the BAC end sequences were similar to known sequences, of which almost half (48.7%) share sequence similarities to retrotransposons and 7% to known genes. Some of the transposable element sequences were the first reported in tomato, such as sequences similar to maize transposon Activator (Ac) ORF and tobacco pararetrovirus-like sequences. Interestingly, there were no BAC end sequences similar to the highly repeated TGRI and TGRII elements. However, the majority (70.3%) of STCs did not share significant sequence similarities to any sequences in GenBank at either the DNA or predicted protein levels, indicating that a large portion of the tomato genome is still unknown. Our data demonstrate that this BAC library is suitable for developing an STC database to sequence the tomato genome. The advantages of developing an STC framework for whole-genome sequencing of tomato are discussed.

PubMedSearch : Budiman_2000_Genome.Res_10_129
PubMedID: 10645957
Gene_locus related to this paper: sollc-e0ycs4 , sollc-e0ycs5 , sollc-k4b383 , sollc-k4b389 , sollc-k4b6v8 , sollc-k4b6w2 , sollc-k4bg91 , sollc-k4bhg7 , sollc-k4crx1 , sollc-k4cnu5 , sollc-k4bhq5 , sollc-k4db70 , sollc-k4d480 , sollc-k4czd6 , sollc-k4b1g3 , sollc-k4cmm3 , sollc-k4cmm0 , sollc-k4cmm1 , sollc-k4d0n0 , sollc-k4bf33 , sollc-k4cj71 , soltu-m1c8d8 , sollc-k4ddh0 , sollc-k4ci68 , sollc-k4cmm4 , sollc-k4db68 , sollc-k4c685 , sollc-k4cui8 , sollc-k4cvn7

Related information

Gene_locus sollc-e0ycs4    sollc-e0ycs5    sollc-k4b383    sollc-k4b389    sollc-k4b6v8    sollc-k4b6w2    sollc-k4bg91    sollc-k4bhg7    sollc-k4crx1    sollc-k4cnu5    sollc-k4bhq5    sollc-k4db70    sollc-k4d480    sollc-k4czd6    sollc-k4b1g3    sollc-k4cmm3    sollc-k4cmm0    sollc-k4cmm1    sollc-k4d0n0    sollc-k4bf33    sollc-k4cj71    soltu-m1c8d8    sollc-k4ddh0    sollc-k4ci68    sollc-k4cmm4    sollc-k4db68    sollc-k4c685    sollc-k4cui8    sollc-k4cvn7
Gene_locus_frgt sollc-k4b6z1    sollc-k4cpl1    sollc-k4b390    sollc-k4b397    sollc-k4b6v9    sollc-k4b6w0    sollc-k4cml3    sollc-k4cml7

Citations formats

Budiman MA, Mao L, Wood TC, Wing RA (2000)
A deep-coverage tomato BAC library and prospects toward development of an STC framework for genome sequencing
Genome Res 10 :129

Budiman MA, Mao L, Wood TC, Wing RA (2000)
Genome Res 10 :129