Domestication, diversity and evolution in the light of 1,011 yeast genomes

Elucidating the roots of the astonishing phenotypic diversity observed in natural populations is a major challenge in biology. An essential step in this process is to explore the genetic diversity at the species-wide level. In this context, a large-scale study was conducted by the teams of Joseph Schacherer (Université de Strasbourg / CNRS), Gianni Liti (Université Côte d’Azur, CNRS, INSERM, IRCAN) and the Genoscope (Institut de biologie François Jacob du CEA, CNRS, Université d’Evry, Université Paris-Saclay) in the frame of a flagship project selected by the program France Génomique with the goal of generating a detailed map of the genetic in the classic model yeast Saccharomyces cerevisiae ( The completion of whole genome sequencing of 1,011 natural isolates, plus the accompanying phenotyping efforts, led to one of the best understanding of population-level natural genetic and phenotypic diversity of any eukaryote model system. This study was published in the journal Nature.

Sequenced strains were collected world-wide to sample as much diversity as possible in terms of global locations (including all continents), as well as ecological sources (both human-related such as dairy products, wine, sake, bread and wild niches as trees, insects, flowers, soil). These isolates were also phenotyped and their growth fitness were determined in different conditions impacting various physiological and cellular responses leading to a global view of the phenotypic landscape of this species.

Altogether the generated datasets allowed to highlight key points of the evolutionary history, genome evolution and its impact on the genotype-phenotype relationship. First, the exquisite detail with which the pattern of polymorphism was examined, allowed to dissect the species’ history. This study provided novel and clear evidences of East Asian origin and strongly suggest that S. cerevisiae started to disperse through the world from a single out-of-China event. As a result of human activity, S. cerevisiae then has undergone substantial genomic and phenotypic changes during multiple and independent domestication events underlying specific human processes (e.g. wine, sake and beer fermentations). Interestingly, these various domestication events differently impacted genome evolution. Whereas the sake and wine populations are characterized by a low genetic diversity, beer populations present a higher genetic as well as more complex genomic diversity. Furthermore, human-related environment foster expansion and loss of genes resulting in rampant variation in genome content. By contrast, wild isolates share a similar genome content and genetic diversity is mainly generated via the accumulation of mutations.
Second, the study also provided an overview of the respective importance of the various genomic features (e.g. ploidy, aneuploidy, introgressions, genetic variants) shaping genome evolution and consequently the species-wide phenotypic landscape. As an example, it was possible to define the core genome (i.e. 4,940 genes present in all 1,011 sequenced strains) and the variable genome (2,908 variable genes only present in a fraction of the population). Gene content is variable among isolates with a set of dispensable genes subject to segregation, introgressions (from closely related species) and horizontal gene transfer. As an example, horizontal gene transfer events are mostly restricted to S. cerevisiae present in domestic fermentative environments. In addition, ploidy and aneuploidy levels are variable between subpopulations and depend on their ecological origin.
Finally, this study shed new light on the genotype-phenotype relationship in a natural population. The S. cerevisiae species presents a high level of genetic diversity, much greater than that found in humans. Among the 1,011 genomes, much of the detected genetic polymorphisms are very low-frequency variants with a trend like the one observed in the human population, raising questions regarding the impact and importance of rare variants on the phenotypic landscape within a population. Genome-wide association analyses highlighted the importance of the copy number variants, which explain a larger proportion of the phenotypic variance and have greater effects on phenotype compared to the single nucleotide polymorphisms. Beyond the analysis reported in the upcoming paper, this resource will enable powerful genetic and genomic studies in a key model system.

Figure – © Eric Rottinger
Yeast colonies grown on a solid agar plate from the S. cerevisiae isolates, which genome was sequenced and described in the manuscript. The picture was overlaid with a world map because this panel of 1,011 strains was collected worldwide and one of the result presented in the manuscript is the Asian geographic origin of S. cerevisiae.


Joseph Schacherer –
Université de Strasbourg / CNRS, UMR 7156
28, rue Goethe
67000 Strasbourg
Tel : +33 (0)3 68 85 19 61

Gianni Liti –
Université Côte d’Azur, CNRS, INSERM, IRCAN
28 Avenue de Valombrose
06107 NICE Cedex 2
Tel: +33 (0)4 93 37 76 72