This directory contains fasta files for human genomic sequence in the ENCODE regions, for the March 2006 (UCSC hg18, NCBI Build 36) build of the human genome. The sequences are in the files: hg18.fa.gz unmasked sequence (all upper case) hg18.msk.fa.gz soft-masked sequence (repeats in lower case) In July 2007, the ENCODE project transitioned from the previous reference build (Build 35) to this one. For background on the ENCODE project, see: NHGRI: The ENCODE Project: ENCylopedia Of DNA Elements http://www.genome.gov/10005107 For the list of primary and backup regions see: ENCODE Target Regions http://genome.ucsc.edu/ENCODE/regions.html
Name Last modified Size Description
Parent Directory - hg18.fa.gz 2007-09-21 11:00 8.7M hg18.msk.fa.gz 2007-09-21 11:00 9.3M md5sum.txt 2007-09-21 11:00 94 hg18_count.txt 2007-09-21 11:01 626