This directory contains download files of the Saccharomyces
cerevisiae genome sequence and associated annotations. The data
is based on sequence dated June 2008 in the Saccharomyces Genome
Database (http://www.yeastgenome.org/) and was obtained from the site
http://downloads.yeastgenome.org/sequence/genomic_sequence/chromosomes/fasta/
The S288C strain was used in this sequencing project.

Files included in this directory:

sacCer2.2bit - contains the complete genome sequence in the 2bit file format.
    The utility program, twoBitToFa (available from the kent src tree),
    can be used to extract .fa file(s) from this file.  A pre-compiled
    version of the command line tool can be found at:
        http://hgdownload.cse.ucsc.edu/admin/exe/linux.x86_64/
    See also:
        http://genome.ucsc.edu/admin/git.html
	http://genome.ucsc.edu/admin/jk-install.html

chromAgp.tar.gz - contains the list of accession identifiers for
    each chromosome, unpacking to one file per chromosome.

chromFa.tar.gz - The assembly sequence in one file per chromosome.
    No masking has been applied to these sequences.

There are NO RepeatMasker .out files for this assembly.

chromTrf.tar.gz - Tandem Repeats Finder locations, filtered to keep repeats
    with period less than or equal to 12, and translated into UCSC's BED
    format (one file per chromosome).

est.fa.gz - S. cerevisiae ESTs in GenBank. This sequence data is updated once a
    week via automatic GenBank updates.

md5sum.txt - checksums of files in this directory

mrna.fa.gz - S. cerevisiae mRNA from GenBank. This sequence data is updated
    once a week via automatic GenBank updates.

sgdGene.upstream*.fa.gz - Saccharomyces Genome Database genes upstream
	sequences, 1000, 2000 and 5000 bases


sacCer2.chrom.sizes - Two-column tab-separated text file containing assembly
    sequence names and sizes.

------------------------------------------------------------------
If you plan to download a large file or multiple files from this
directory, we recommend that you use ftp rather than downloading the
files via our website. To do so, ftp to hgdownload.cse.ucsc.edu
[username: anonymous, password: your email address], then cd to the
directory goldenPath/sacCer2/bigZips. To download multiple files, use
the "mget" command:

    mget <filename1> <filename2> ...
    - or -
    mget -a (to download all the files in the directory)

Alternate methods to ftp access.

Using an rsync command to download the entire directory:
    rsync -avzP rsync://hgdownload.cse.ucsc.edu/goldenPath/sacCer2/bigZips/ .
For a single file, e.g. chromFa.tar.gz
    rsync -avzP 
        rsync://hgdownload.cse.ucsc.edu/goldenPath/sacCer2/bigZips/chromFa.tar.gz .

Or with wget, all files:
    wget --timestamping 
        'ftp://hgdownload.cse.ucsc.edu/goldenPath/sacCer2/bigZips/*'
With wget, a single file:
    wget --timestamping 
        'ftp://hgdownload.cse.ucsc.edu/goldenPath/sacCer2/bigZips/chromFa.tar.gz' 
        -O chromFa.tar.gz

To unpack the *.tar.gz files:
    tar xvzf <file>.tar.gz
To uncompress the fa.gz files:
    gunzip <file>.fa.gz


All the tables in this directory are freely available for public use.
      Name                       Last modified      Size  Description
Parent Directory - chromAgp.tar.gz 2009-02-24 15:40 711 chromFa.tar.gz 2009-02-24 15:40 3.6M chromTrf.tar.gz 2009-02-24 15:40 20K est.fa.gz 2019-10-17 21:04 6.2M est.fa.gz.md5 2019-10-17 21:04 44 genes/ 2020-02-05 13:47 - md5sum.txt 2012-01-09 13:19 434 mrna.fa.gz 2019-10-17 21:00 111K mrna.fa.gz.md5 2019-10-17 21:00 45 sacCer2.2bit 2009-02-03 14:05 2.9M sacCer2.chrom.sizes 2009-02-03 14:05 242 sacCer2.fa.gz 2020-01-23 02:26 3.6M sgdGene.upstream1000.fa.gz 2009-07-28 12:37 2.1M sgdGene.upstream2000.fa.gz 2009-07-28 12:37 3.8M sgdGene.upstream5000.fa.gz 2009-07-28 12:37 8.1M upstream1000.fa.gz 2019-10-17 21:04 16K upstream1000.fa.gz.md5 2019-10-17 21:04 53 upstream2000.fa.gz 2019-10-17 21:04 30K upstream2000.fa.gz.md5 2019-10-17 21:04 53 upstream5000.fa.gz 2019-10-17 21:04 73K upstream5000.fa.gz.md5 2019-10-17 21:04 53 xenoRefMrna.fa.gz 2019-10-17 21:04 331M xenoRefMrna.fa.gz.md5 2019-10-17 21:04 52