The vast majority of wine fermentations are performed principally by Saccharomyces cerevisiae. However, there are a growing number of instances in which other species of Saccharomyces play a predominant role. Interestingly, the presence of these other yeast species generally occurs via the formation of interspecific hybrids that contain genomic contributions from both S. cerevisiae and non-S. cerevisiae species. However, despite the large number of wine strains that are characterized at the genomic level, there remains limited information regarding the detailed genomic structure of hybrids used in winemaking. To address this, we describe the genome sequence of the thiol-releasing commercial wine yeast hybrid VIN7. VIN7 is shown to be an almost complete allotriploid interspecific hybrid that is comprised of a heterozygous diploid complement of S. cerevisiae chromosomes and a haploid Saccharomyces kudriavzevii genomic contribution. Both parental strains appear to be of European origin, with the S. cerevisiae parent being closely related to, but distinct from, the commercial wine yeasts QA23 and EC1118. In addition, several instances of chromosomal rearrangement between S. cerevisiae and S. kudriavzevii sequences were observed that may mark the early stages of hybrid genome consolidation.
Human intervention has subjected the yeast Saccharomyces cerevisiae to multiple rounds of independent domestication and thousands of generations of artificial selection. As a result, this species comprises a genetically diverse collection of natural isolates as well as domesticated strains that are used in specific industrial applications. However the scope of genetic diversity that was captured during the domesticated evolution of the industrial representatives of this important organism remains to be determined. To begin to address this, we have produced whole-genome assemblies of six commercial strains of S. cerevisiae (four wine and two brewing strains). These represent the first genome assemblies produced from S. cerevisiae strains in their industrially-used forms and the first high-quality assemblies for S. cerevisiae strains used in brewing. By comparing these sequences to six existing high-coverage S. cerevisiae genome assemblies, clear signatures were found that defined each industrial class of yeast. This genetic variation was comprised of both single nucleotide polymorphisms and large-scale insertions and deletions, with the latter often being associated with ORF heterogeneity between strains. This included the discovery of more than twenty probable genes that had not been identified previously in the S. cerevisiae genome. Comparison of this large number of S. cerevisiae strains also enabled the characterization of a cluster of five ORFs that have integrated into the genomes of the wine and bioethanol strains on multiple occasions and at diverse genomic locations via what appears to involve the resolution of a circular DNA intermediate. This work suggests that, despite the scrutiny that has been directed at the yeast genome, there remains a significant reservoir of ORFs and novel modes of genetic transmission that may have significant phenotypic impact in this important model and industrial species.
Many industrial strains of Saccharomyces cerevisiae have been selected primarily for their ability to convert sugars into ethanol efficiently despite exposure to a variety of stresses. To begin investigation of the genetic basis of phenotypic variation in industrial strains of S. cerevisiae, we have sequenced the genome of a wine yeast, AWRI1631, and have compared this sequence with both the laboratory strain S288c and the human pathogenic isolate YJM789. AWRI1631 was found to be substantially different from S288c and YJM789, especially at the level of single-nucleotide polymorphisms, which were present, on average, every 150 bp between all three strains. In addition, there were major differences in the arrangement and number of Ty elements between the strains, as well as several regions of DNA that were specific to AWRI1631 and that were predicted to encode proteins that are unique to this industrial strain.