This directory contains fasta files 
for human genomic sequence in the ENCODE regions, for 
the May 2004 (UCSC hg17, NCBI Build 35) build of the human genome.
The sequences are in the files:

    hg17.fa.gz          unmasked sequence (all upper case)
    hg17.msk.fa.gz      soft-masked sequence (repeats in lower case)

In October 2005, the ENCODE project is transitioning from
the initial reference build (Build 34) to this one.

For background on the ENCODE project, see: 

NHGRI: The ENCODE Project: ENCylopedia Of DNA Elements

For the list of primary and backup regions see:

ENCODE Target Regions
      Name                     Last modified      Size  Description
Parent Directory - md5sum.txt 2005-09-26 11:18 94 hg17.fa.gz 2005-09-26 16:36 8.7M hg17.msk.fa.gz 2005-09-26 16:36 9.3M