S. cerevisiae strain DBVPG6044 genome sequence. 02/03/2015 This directory contains 6 files describing the genome of S. cerevisiae strain DBVPG6044, a haploid derivative strain of CBS405, which was isolated in West Africa from bili wine fermented from the flowering plant Dissotis (Osbeckia) grandiflora (Accession: JRIG00000000; ATCC Number: 10604). genome fasta file: DBVPG6044_Stanford_2014_JRIG00000000.fsa nucleic acid coding sequence file: DBVPG6044_JRIG00000000_cds.fsa protein sequence file: DBVPG6044_JRIG00000000_pep.fsa GFF file: DBVPG6044_JRIG00000000.gff snv variant call format file: DBVPG6044.gatk.vcf.gz indel variant call format file: DBVPG6044.indel.gatk.vcf.gz FASTA headers in each of the files include the following pieces of information: genome fasta file: GenBank GI number GenBank accession.version species name strain name scaffold number contig length nucleic acid coding sequence file: ORF name_strain name gene name feature type feature classification (Verified, Uncharacterized, or Dubious) feature description protein sequence file: ORF name_strain name gene name feature type feature classification (Verified, Uncharacterized, or Dubious) feature description Feature definitions (for coding sequences and features in the gff file) were assigned via homology-based annotation using LASTZ and AUGUSTUS vs. the R64-1 S288C reference sequence in conjuction with ab initio annotation using MAKER and BLASTX validation. Feature definitions (for coding sequences and features in the gff file) were assigned via homology-based annotation using LASTZ and AUGUSTUS vs. the R64-1 S288C reference sequence in conjuction with ab initio annotation using MAKER and BLASTX validation. Assembly statistics ================================ Fold coverage: 176 Number of scaffolds: 819 Assembly size: 11642411 Longest scaffold: 134064 Scaffold N50: 36171 Number of ORFs: 5297 Reference for this S. cerevisiae DBVPG6044 genome sequence, assembly and annotation: Song G., et al., 2014. AGAPE (Automated Genome Analysis PipelinE) for Pan-Genome Analysis of Saccharomyces cerevisiae. PLoS ONE, in press. For more information about DBVPG6044: Liti, G., Barton, D.B. & Louis, E.J. Sequence diversity, reproductive isolation and species concepts in Saccharomyces. Genetics 174, 839-50 (2006). PMID: 16951060 Liti, G., Peruffo, A., James, S.A., Roberts, I.N. & Louis, E.J. Inferences of evolutionary relationships from a population survey of LTR-retrotransposons and telomeric-associated sequences in the Saccharomyces sensu stricto complex. Yeast 22, 177-92 (2005). PMID: 15704235 For more information about other yeast strains: Engel SR, Cherry JM. 2013. The new modern era of yeast genomics: community sequencing and the resulting annotation of multiple Saccharomyces cerevisiae strains at the Saccharomyces Genome Database. Database (Oxford). 2013 Mar 13;2013:bat012. doi: 10.1093/database/bat012. PMID: 23487186 For more information about the S288C Reference Genome: Engel SR, Dietrich FS, Fisk DG, Binkley G, Balakrishnan R, Costanzo MC, Dwight SS, Hitz BC, Karra K, Nash RS, Weng S, Wong ED, Lloyd P, Skrzypek MS, Miyasato SR, Simison M, Cherry JM. 2014. The reference genome sequence of Saccharomyces cerevisiae: then and now. G3 (Bethesda). 2014 Mar 20;4(3):389-98. doi: 10.1534/g3.113.008995. PMID: 24374639 To download the S288C Reference Genome sequence and annotation: http://www.yeastgenome.org/download-data/sequence