cytoBandIdeo Chromosome Band (Ideogram) bed 4 + Chromosome Bands Localized by FISH Mapping Clones (for Ideogram) 1 0.1 0 0 0 127 127 127 0 0 0 map 1 group map\ longLabel Chromosome Bands Localized by FISH Mapping Clones (for Ideogram)\ priority .1\ shortLabel Chromosome Band (Ideogram)\ track cytoBandIdeo\ type bed 4 +\ visibility dense\ netRBestGorGor1 Gorilla RBest Net netAlign gorGor1 chainGorGor1 Gorilla (Oct. 2008 (Sanger 0.1/gorGor1)) Reciprocal Best Alignment Net 0 1 0 0 0 127 127 127 1 0 0 compGeno 0 group compGeno\ longLabel $o_Organism ($o_date) Reciprocal Best Alignment Net\ otherDb gorGor1\ parent rBestNet\ priority 1\ shortLabel $o_Organism RBest Net\ spectrum on\ track netRBestGorGor1\ type netAlign gorGor1 chainGorGor1\ visibility hide\ netSyntenyPanTro2 Chimp Syn Net netAlign panTro2 chainPanTro2 Chimp (Mar. 2006 (CGSC 2.1/panTro2)) Syntenic Alignment Net 0 1 0 0 0 127 127 127 1 0 0 compGeno 0 group compGeno\ longLabel $o_Organism ($o_date) Syntenic Alignment Net\ otherDb panTro2\ parent syntenicNet\ priority 1\ shortLabel $o_Organism Syn Net\ spectrum on\ track netSyntenyPanTro2\ type netAlign panTro2 chainPanTro2\ visibility hide\ encodeEgaspPartAceCons ACEScan Cons Alt genePred ACEScan Conserved Alternative Exon Predictions 0 1 100 12 100 177 133 177 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeGenes 1 color 100,12,100\ longLabel ACEScan Conserved Alternative Exon Predictions\ parent encodeEgaspPartial\ priority 1\ shortLabel ACEScan Cons Alt\ track encodeEgaspPartAceCons\ encodeEgaspFullAceview AceView genePred AceView Gene Predictions 0 1 22 150 20 138 202 137 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeGenes 1 color 22,150,20\ longLabel AceView Gene Predictions\ parent encodeEgaspFull\ priority 1\ shortLabel AceView\ track encodeEgaspFullAceview\ encodeAffyChIpHl60PvalBrg1Hr00 Affy Brg1 RA 0h wig 0.0 534.54 Affymetrix ChIP/Chip (Brg1 retinoic acid-treated HL-60, 0hrs) P-Value 0 1 225 0 0 240 127 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 0 color 225,0,0\ longLabel Affymetrix ChIP/Chip (Brg1 retinoic acid-treated HL-60, 0hrs) P-Value\ parent encodeAffyChIpHl60Pval\ priority 1\ shortLabel Affy Brg1 RA 0h\ subGroups factor=Brg1 time=0h\ track encodeAffyChIpHl60PvalBrg1Hr00\ encodeTransFragsAffyDistal Affy Dist bed 4 Affy Distal 0 1 0 0 0 127 127 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeAnalysis 1 longLabel Affy Distal\ parent encodeTransFrags\ priority 1\ shortLabel Affy Dist\ track encodeTransFragsAffyDistal\ affyExonTissues Affy Exon Tissues expRatio Affymetrix Exon Array 1.0: Normal Tissues 2 1 0 0 0 127 127 127 0 0 0
\ RNA (from a commercial source) from 11 tissues were hybridized to Affymetrix \ Human Exon 1.0 ST arrays. For each tissue, 3 replicate experiments were \ performed \ for a total of 33 arrays. The raw intensity signal from the arrays\ was normalized\ with a quantile normalization method, then run through the PLIER algorithm.\ The normalized data were then converted to median-centered log-ratios, \ which are displayed as green for negative log-ratios (below-median expression),\ and red for positive (above-median expression).
\ \The probe sets for this microarray track are shown in the \ the Affy Exon Probes track.
\ \\ This track was produced by Andy Pohl, Kayla Smith, and Pauline Fujita of the \ genome browser group at UCSC, Melissa Cline of the \ Ares lab at UCSC, and \ Chuck Sugnet at Affymetrix, based on \ \ sample exon array data available from Affymetrix, produced by Tyson Clark. \
\ \\ Pohl AA, Sugnet CW, Clark TA, Smith K, Fujita PA, Cline MS.\ Affy Exon Tissues: Exon Levels in Normal Tissues in Human, Mouse, and Rat.\ Bioinformatics.2009 Sept 15;25(18):2442-3. \
\\ The chromosome band track represents the approximate \ location of bands seen on Giemsa-stained chromosomes.\ Chromosomes are displayed in the browser with the short arm first. \ Cytologically identified bands on the chromosome are numbered outward \ from the centromere on the short (p) and long (q) arms. At low resolution, \ bands are classified using the nomenclature \ [chromosome][arm][band], where band is a \ single digit. Examples of bands on chromosome 3 include 3p2, 3p1, cen, 3q1, \ and 3q2. At a finer resolution, some of the bands are subdivided into \ sub-bands, adding a second digit to the band number, e.g. 3p26. This \ resolution produces about 500 bands. A final subdivision into a \ total of 862 sub-bands is made by adding a period and another digit to the \ band, resulting in 3p26.3, 3p26.2, etc.
\ \\ A full description of the method by which the chromosome band locations are \ estimated can be found in Furey and Haussler, 2003.\
\ Barbara Trask, Vivian Cheung, Norma Nowak and others in the BAC Resource\ Consortium used fluorescent in-situ hybridization (FISH) to determine a \ cytogenetic location for large genomic clones on the chromosomes.\ The results from these experiments are the primary source of information used\ in estimating the chromosome band locations.\ For more information about the process, see the paper, BAC Resource Consortium, \ et al., 2001. and the accompanying web site,\ Human BAC Resource.
\\ BAC clone placements in the human sequence are determined at UCSC using a \ combination of full BAC clone sequence, BAC end sequence, and STS marker \ information.
\ \\ We would like to thank all the labs that have contributed to this resource:\
\ Furey TS and Haussler D.\ Integration of the cytogenetic map with the draft human genome \ sequence, Hum Molec Genet. 2003;12(9):1037-44.\
\ \\ BAC Resource Consortium, Cheung VG, Nowak N, Jang W, Kirsch IR, Zhao S, \ Chen XN, Furey TS, Kim UJ, Kuo WL et al..\ Integration of cytogenetic landmarks into the draft sequence of\ the human genome, Nature. 2001;409:953-98.\
\ \ map 1 group map\ longLabel Chromosome Bands Localized by FISH Mapping Clones\ priority 1\ shortLabel Chromosome Band\ track cytoBand\ type bed 4 +\ visibility hide\ snp131Common Common SNPs (131) bed 6 + Simple Nucleotide Polymorphisms (dbSNP build 131) from HapMap and/or 1000Genomes 3 1 0 0 0 127 127 127 0 0 0 http://www.ncbi.nlm.nih.gov/SNP/snp_ref.cgi?type=rs&rs=$$ varRep 1 longLabel Simple Nucleotide Polymorphisms (dbSNP build 131) from HapMap and/or 1000Genomes\ parent snp131Composite\ priority 1\ shortLabel Common SNPs (131)\ subGroups view=common\ track snp131Common\ visibility pack\ encodePseudogeneConsensus Consensus Pseudogenes genePred Consensus of Yale, Havana-Gencode, UCSC and GIS ENCODE Pseudogenes 0 1 0 0 0 127 127 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, http://vega.sanger.ac.uk/Homo_sapiens/geneview?transcript=$$ encodeGenes 1 longLabel Consensus of Yale, Havana-Gencode, UCSC and GIS ENCODE Pseudogenes\ parent encodePseudogene\ priority 1\ shortLabel Consensus Pseudogenes\ track encodePseudogeneConsensus\ url http://vega.sanger.ac.uk/Homo_sapiens/geneview?transcript=$$\ url2 http://www.pseudogene.org/cgi-bin/search-results.cgi?tax_id=9606&set_search=25&criterion0=pgene_by_acc&operator0=%3D&sort=0&output=html&searchValue0_0=$$\ url2Label Yale Pseudogene Link:\ urlLabel Vega Genes Link:\ urlName gene\ kiddEichlerDiscAbc14 Discordant ABC14 bed 12 HGSV Individual ABC14 (CEPH) Discordant Clone End Alignments 0 1 0 0 0 127 127 127 0 0 0 http://mrhgsv.gs.washington.edu/cgi-bin/hgc?i=$$&c=$S&l=$[&r=$]&db=$D&position=$S:$[-$] varRep 1 longLabel HGSV Individual ABC14 (CEPH) Discordant Clone End Alignments\ parent kiddEichlerDisc\ priority 1\ shortLabel Discordant ABC14\ track kiddEichlerDiscAbc14\ encodeDNDSsmall dN/dS 0.0 to 0.2 bed 4 + ENCODE Exons dN/dS 0.0 to 0.2 0 1 0 0 0 127 127 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeAnalysis 1 longLabel ENCODE Exons dN/dS 0.0 to 0.2\ parent encodeDNDS\ priority 1\ shortLabel dN/dS 0.0 to 0.2\ track encodeDNDSsmall\ encodeNhgriDnaseHsNonAct DNase CD4 Unact. bed 5 . NHGRI DNaseI-Hypersensitive Sites (CD4+ T-Cells Unactivated) 0 1 0 0 0 127 127 127 1 0 19 chr1,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr5,chr6,chr7,chr8,chr9,chrX, encodeChrom 1 longLabel NHGRI DNaseI-Hypersensitive Sites (CD4+ T-Cells Unactivated)\ parent encodeNhgriDnaseHs\ priority 1\ shortLabel DNase CD4 Unact.\ track encodeNhgriDnaseHsNonAct\ encodeAffyEc1BrainCerebellumSignal EC1 Sgnl BrainC wig 0 62385 Affy Ext Trans Signal (1-base window) (Brain Cerebellum) 0 1 248 0 8 251 127 131 0 0 2 chr21,chr22, encodeTxLevels 0 color 248,0,8\ longLabel Affy Ext Trans Signal (1-base window) (Brain Cerebellum)\ parent encodeAffyEcSignal\ priority 1\ shortLabel EC1 Sgnl BrainC\ track encodeAffyEc1BrainCerebellumSignal\ encodeAffyEc1BrainCerebellumSites EC1 Sites BrainC bed 3 . Affy Ext Trans Sites (1-base window) (Brain Cerebellum) 0 1 248 0 8 251 127 131 0 0 2 chr21,chr22, encodeTxLevels 1 color 248,0,8\ longLabel Affy Ext Trans Sites (1-base window) (Brain Cerebellum)\ parent encodeAffyEcSites\ priority 1\ shortLabel EC1 Sites BrainC\ track encodeAffyEc1BrainCerebellumSites\ eioJcviNASPos EIO/JCVI CD34+ NAS bed 3 . CD34+ cells Nuclease Accessible sites 0 1 100 30 150 177 142 202 0 0 0 regulation 1 color 100,30,150\ longLabel CD34+ cells Nuclease Accessible sites\ parent eioJcviNAS\ priority 1\ shortLabel EIO/JCVI CD34+ NAS\ track eioJcviNASPos\ encodeUncFaireSignal FAIRE Signal bedGraph 4 University of North Carolina FAIRE Signal 0 1 20 150 20 50 100 50 0 0 21 chr1,chr4,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr5,chr6,chr7,chr8,chr9,chr10,chrX, encodeChrom 0 longLabel University of North Carolina FAIRE Signal\ parent encodeUncFaire\ priority 1\ shortLabel FAIRE Signal\ track encodeUncFaireSignal\ fox2ClipSeq FOX2 CLIP-seq bed 9 . FOX2 adaptor-trimmed CLIP-seq reads 3 1 0 0 0 127 127 127 0 0 0 regulation 1 itemRgb on\ longLabel FOX2 adaptor-trimmed CLIP-seq reads\ noInherit on\ noScoreFilter .\ parent fox2ClipSeqCompViewreads\ priority 1\ shortLabel FOX2 CLIP-seq\ subGroups view=reads\ track fox2ClipSeq\ type bed 9 .\ encodeAllGencodeExonic Gencode Exonic bed 4 Consensus Gencode Exonic 0 1 0 0 0 127 127 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeAnalysis 1 longLabel Consensus Gencode Exonic\ parent encodeWorkshopSelections\ priority 1\ shortLabel Gencode Exonic\ track encodeAllGencodeExonic\ encodeGencodeGeneKnownMar07 Gencode Ref genePred Gencode Reference Genes 0 1 33 91 51 144 173 153 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeGenes 1 color 33,91,51\ longLabel Gencode Reference Genes\ parent encodeGencodeGeneMar07\ priority 1\ shortLabel Gencode Ref\ track encodeGencodeGeneKnownMar07\ encodeGencodeGeneKnownOct05 Gencode Ref genePred Gencode Reference Genes 0 1 33 91 51 144 173 153 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeGenes 1 color 33,91,51\ longLabel Gencode Reference Genes\ parent encodeGencodeGeneOct05\ priority 1\ shortLabel Gencode Ref\ track encodeGencodeGeneKnownOct05\ encodeGisRnaPetMCF7 GIS RNA MCF7 bed 12 Gene Identification Signature Paired-End Tags of PolyA+ RNA (log phase MCF7) 0 1 0 0 0 127 127 127 0 0 23 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr17,chr18,chr19,chr2,chr20,chr21,chr22,chr3,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeGenes 1 longLabel Gene Identification Signature Paired-End Tags of PolyA+ RNA (log phase MCF7)\ parent encodeGisRnaPet\ priority 1\ shortLabel GIS RNA MCF7\ track encodeGisRnaPetMCF7\ snpRecombHotspotHapmap HapMap bed 3 . Oxford Recombination Hotspots from HapMap Phase I Release 16c.1 0 1 0 0 0 127 127 127 0 0 0 varRep 1 longLabel Oxford Recombination Hotspots from HapMap Phase I Release 16c.1\ parent snpRecombHotspot\ priority 1\ shortLabel HapMap\ track snpRecombHotspotHapmap\ encodeHapMapCovCEU HapMap Cov CEU wig 0.0 100.0 HapMap Resequencing Coverage CEPH (CEU) 0 1 0 0 0 127 127 127 0 0 7 chr2,chr4,chr7,chr8,chr9,chr12,chr18, encodeVariation 0 longLabel HapMap Resequencing Coverage CEPH (CEU)\ parent encodeHapMapCov\ priority 1\ shortLabel HapMap Cov CEU\ track encodeHapMapCovCEU\ snpRecombRateHapmap HapMap Phase I bedGraph 4 Oxford Recombination Rates from HapMap Phase I Release 16c.1 0 1 0 0 0 127 127 127 0 0 22 chr1,chr2,chr3,chr4,chr5,chr6,chr7,chr8,chr9,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr20,chr21,chr22,chrX, varRep 0 longLabel Oxford Recombination Rates from HapMap Phase I Release 16c.1\ parent snpRecombRate\ priority 1\ shortLabel HapMap Phase I\ track snpRecombRateHapmap\ hapmapSnpsASW HapMap SNPs ASW bed 6 + HapMap SNPs from the ASW Population (African Ancestry in SouthWestern United States) 0 1 0 0 0 127 127 127 0 0 0 varRep 1 longLabel HapMap SNPs from the ASW Population (African Ancestry in SouthWestern United States)\ parent hapmapSnps\ priority 1\ shortLabel HapMap SNPs ASW\ track hapmapSnpsASW\ hgdpHzyAfrica Hetzgty Africa bedGraph 4 Human Genome Diversity Proj Smoothd Expec Heterozygosity (Africa) 0 1 224 0 0 239 127 127 0 0 22 chr1,chr2,chr3,chr4,chr5,chr6,chr7,chr8,chr9,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr17,chr18,chr19,chr20,chr21,chr22, varRep 0 color 224, 0, 0\ longLabel Human Genome Diversity Proj Smoothd Expec Heterozygosity (Africa)\ parent hgdpHzy\ priority 1\ shortLabel Hetzgty Africa\ track hgdpHzyAfrica\ hgdpIhsBantu iHS Bantu bedGraph 4 Human Genome Diversity Project iHS (Bantu populations in Africa) 0 1 224 0 0 239 127 127 0 0 23 chr1,chr2,chr3,chr4,chr5,chr6,chr7,chr8,chr9,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr17,chr18,chr19,chr20,chr21,chr22,chrX, varRep 0 color 224, 0, 0\ longLabel Human Genome Diversity Project iHS (Bantu populations in Africa)\ parent hgdpIhs\ priority 1\ shortLabel iHS Bantu\ track hgdpIhsBantu\ encodeGencodeIntronsProximal Intronic Prox bed 4 . Gencode Intronic Proximal Regions 0 1 0 0 0 127 127 127 0 0 0 encodeAnalysis 1 longLabel Gencode Intronic Proximal Regions\ parent encodeGencodeRegions\ priority 1\ shortLabel Intronic Prox\ track encodeGencodeIntronsProximal\ iscaRetrospectiveBenign ISCA Ret Benign gvf Internat. Stds. for Cytogen. Arrays Consort. (ISCA) - Retrospective (Benign) 0 1 0 0 0 127 127 127 0 0 0 varRep 1 longLabel Internat. Stds. for Cytogen. Arrays Consort. (ISCA) - Retrospective (Benign)\ parent iscaRetrospectiveComposite\ priority 1\ shortLabel ISCA Ret Benign\ track iscaRetrospectiveBenign\ L1_LINE L1_LINE bed 5 L1 LINEs for Intersection 0 1 0 0 0 127 127 127 0 0 0 encodeAnalysis 1 longLabel L1 LINEs for Intersection\ parent encodeWorkshopIntersections\ priority 1\ shortLabel L1_LINE\ track L1_LINE\ hapmapLdYri LD YRI bed 4 + Linkage Disequilibrium for the Yoruba (YRI) 0 1 0 0 0 127 127 127 0 0 23 chr1,chr2,chr3,chr4,chr5,chr6,chr7,chr8,chr9,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr17,chr18,chr19,chr20,chr21,chr22,chrX, varRep 0 longLabel Linkage Disequilibrium for the Yoruba (YRI)\ parent hapmapLd\ priority 1\ shortLabel LD YRI\ track hapmapLdYri\ encodeUcsdChipHeLaH3H4dmH3K4_p0 LI H3K4me2 -gIF bedGraph 4 Ludwig Institute ChIP-chip: H3K4me2 ab, HeLa cells, no gamma interferon 0 1 109 51 43 182 153 149 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 0 color 109,51,43\ longLabel Ludwig Institute ChIP-chip: H3K4me2 ab, HeLa cells, no gamma interferon\ parent encodeLIChIPgIF\ priority 1\ shortLabel LI H3K4me2 -gIF\ track encodeUcsdChipHeLaH3H4dmH3K4_p0\ encodeUcsdNgHeLaH3K4me3_p0 LI Ng H3K4m3 -gIF bedGraph 4 Ludwig Institute/UCSD ChIP/Chip Ng: HeLa, H3K4me3, no gamma interferon 0 1 0 0 0 127 127 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 0 longLabel Ludwig Institute/UCSD ChIP/Chip Ng: HeLa, H3K4me3, no gamma interferon\ parent encodeUcsdNgGif\ priority 1\ shortLabel LI Ng H3K4m3 -gIF\ track encodeUcsdNgHeLaH3K4me3_p0\ encodeUcsdChipRnapHela_f LI Pol2 HeLa bedGraph 4 Ludwig Institute ChIP-chip: Pol2 8WG16 ab, HeLa cells 0 1 0 0 0 127 127 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 0 longLabel Ludwig Institute ChIP-chip: Pol2 8WG16 ab, HeLa cells\ parent encodeLIChIP\ priority 1\ shortLabel LI Pol2 HeLa\ track encodeUcsdChipRnapHela_f\ jaxQtlAsIs MGI Mouse QTL bed 4 . MGI Mouse QTLs Coarsely Mapped to Human 0 1 200 100 0 227 177 127 0 0 0 http://www.informatics.jax.org/searches/accession_report.cgi?id=$$ phenDis 1 color 200,100,0\ longLabel MGI Mouse QTLs Coarsely Mapped to $Organism\ parent jaxQtlMapped\ priority 1\ shortLabel MGI Mouse QTL\ track jaxQtlAsIs\ encodeMlaganPhastConsEl MLAGAN PhastCons bed 5 . MLAGAN PhastCons Conserved Elements 0 1 170 100 50 212 177 152 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeCompGeno 1 color 170,100,50\ longLabel MLAGAN PhastCons Conserved Elements\ parent encodeMlaganElements\ priority 1\ shortLabel MLAGAN PhastCons\ track encodeMlaganPhastConsEl\ encodeMlaganPhastCons MLAGAN PhastCons wig 0.0 1.0 MLAGAN PhastCons Conservation 0 1 170 100 50 212 177 152 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeCompGeno 0 autoScale Off\ color 170,100,50\ longLabel MLAGAN PhastCons Conservation\ maxHeightPixels 100:25:11\ noInherit on\ parent encodeMlaganCons\ priority 1\ shortLabel MLAGAN PhastCons\ track encodeMlaganPhastCons\ type wig 0.0 1.0\ windowingFunction mean\ encodeNhgriDnaseHs2 NHGRI DNAseI HS bed 4 NHGRI DNAseI HS 0 1 0 0 0 127 127 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeAnalysis 1 longLabel NHGRI DNAseI HS\ parent encodeDnase\ priority 1\ shortLabel NHGRI DNAseI HS\ track encodeNhgriDnaseHs2\ nimhBipolarUs NIMH Bipolar Us chromGraph NIMH Bipolar disorder (US) -log10 P-value 0 1 0 0 0 127 127 127 0 0 0 phenDis 0 longLabel NIMH Bipolar disorder (US) -log10 P-value\ parent nimhBipolar\ priority 1\ shortLabel NIMH Bipolar Us\ track nimhBipolarUs\ numtS NumtS bed 6 . Human NumtS 0 1 0 60 120 127 157 187 1 0 0\ NumtS (Nuclear mitochondrial sequences) are mitochondrial fragments inserted in nuclear genomic sequences. The most credited hypothesis concerning their generation suggests that in presence of mutagenic agents or under stress conditions fragments of mtDNA escape from mitochondria, reach the nucleus and insert into chromosomes during break repair, although NumtS can derive from duplication of genomic fragments. NumtS may be cause of contamination during human mtDNA sequencing and hence frequent false low heteroplasmic evidences have been reported.\ The Bioinformatics group chaired by M.Attimonelli (Bari, Italy) has produced the RHNumtS compilation annotating more than 500 Human NumtS. To allow the scientific community to access to the compilation and to perform genomics comparative analyses inclusive of the NumtS data, the group has designed the Human NumtS tracks below described.\
\ \\ The NumtS tracks show the High Score Pairs (HSPs) obtained by aligning the mitochondrial reference genome (NC_012920) with the hg18 release of the human genome.\
\\ The "NumtS mitochondrial sequences" track shows the mapping of the HSPs returned by BlastN on the nuclear genome. The shading of the items reflects the similarity returned by BlastN, and the direction of the arrows is concordant with the strand of the alignment. For every item, a link pointing to the mitochondrial mapping is provided, thus allowing a fast cross among the NumtS genomic contexts.\
\\ The "NumtS assembled" track shows items obtained by assembling HSPs annotated in the "NumtS" track fulfilling the following conditions:\
\ Exceptions for the second condition arise when a long repetitive element is present between two HSPs.\
\\ The "NumtS on mitochondrion" track shows the mapping of the HSPs on the mitochondrial genome. The shading of the items reflects the similarity returned by BlastN, and the direction of the arrows is concordant with the strand of the alignment. For every item, a link pointing to the nuclear mapping is provided.\
\ \\ The "NumtS on mitochondrion with chromosome placement" shows the mapping of the HSPs on the mitochondrial genome, but the items are coloured according to the colours assigned to each human chromosome on the UCSC genome browser. No shading is here provided. For every item, a link pointing to the nuclear mapping is provided.\
\\ NumtS mappings were obtained by running Blast2seq (program: BlastN) between each chromosome of of the Human Genome hg18 build and the human mitochondrial reference sequence (rCRS, AC: NC_012920), fixing the e-value threshold to 1e-03. The assembling of the HSPs was performed with spreadsheet interpolation and manual inspection.\
\ \\ NumtS predicted in silico were validated by carrying out PCR amplification and sequencing on blood-extracted DNA of a healthy individual of European origin. PCR amplification was successful for 275 NumtS and provided amplicons of the expected length. All PCR fragments were sequenced on both strands, and submitted to the EMBL databank.\
\\ Furthermore, 541 NumtS were validated by merging NumtS nuclear coordinates with HapMap annotations. Our analysis has been carried on eight HapMap individuals (NA18517, NA18507, NA18956, NA19240, NA18555, NA12878, NA19129, NA12156). For each sample, clones with a single best concordant placement (according to the fosmid end-sequence-pair analysis described in Kidd et al., 2008), have been considered. The analysis showed that 541 NumtS (at least 30bp for each one) had been sequenced in such samples.\
\ \\ These data were provided by Domenico Simone and Marcella Attimonelli at Department of Biochemistry and Molecular Biology "Ernesto Quagliariello" (University of Bari, Italy). Primer designing was carried out by Francesco Calabrese and Giuseppe Mineccia. PCR validation was carried out by Martin Lang, Domenico Simone and Giuseppe Gasparre. Merging with HapMap annotations has been performed by Domenico Simone.\
\ \\ Simone D, Calabrese FM, Lang M, Gasparre G, Attimonelli M: Validation and UCSC tracks of the extended RHNumtS compilation (submitted). \
\ \\ Lascaro D, Castellana S, Gasparre G, Romeo G, Saccone S, Attimonelli M. The RHNumtS compilation: features and bioinformatics approaches to locate and quantify Human NumtS. BMC\ Genomics. 2008 June 3;9:267.\
\ \\ Kidd JM, Cooper GM, Donahue WF, et al.\ \ Mapping and sequencing of structural variation from eight human genomes.\ Nature, 2008, 453(7191):56-64.\
\ \ \ \ varRep 1 color 0,60,120\ html numtSeq\ longLabel Human NumtS\ parent numtSeq\ priority 1\ shortLabel NumtS\ track numtS\ type bed 6 .\ useScore 1\ hapmapLdPhYri Phased YRI ld2 Linkage Disequilibrium for the Yoruba (YRI) from phased genotypes 0 1 0 0 0 127 127 127 0 0 22 chr1,chr2,chr3,chr4,chr5,chr6,chr7,chr8,chr9,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr17,chr18,chr19,chr20,chr21,chr22, varRep 0 longLabel Linkage Disequilibrium for the Yoruba (YRI) from phased genotypes\ parent hapmapLdPh\ priority 1\ shortLabel Phased YRI\ track hapmapLdPhYri\ encodeGencodeRaceFragsPrimer RACEfrags Primer genePred Gencode 5' RACE primer 0 1 0 0 0 127 127 127 0 0 19 chr1,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr5,chr6,chr7,chr8,chr9,chrX, encodeGenes 1 longLabel Gencode 5' RACE primer\ parent encodeGencodeRaceFrags\ priority 1\ shortLabel RACEfrags Primer\ track encodeGencodeRaceFragsPrimer\ encodeRikenCagePlus Riken CAGE + bedGraph 4 Riken CAGE Plus Strand - Predicted Gene Start Sites 0 1 109 51 43 182 153 149 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeTxLevels 0 color 109,51,43\ longLabel Riken CAGE Plus Strand - Predicted Gene Start Sites\ parent encodeRikenCage\ priority 1\ shortLabel Riken CAGE +\ track encodeRikenCagePlus\ encodeRikenCageMappedTagsPositive Riken CAGE MT + bedGraph 4 Riken CAGE Mapped Tags overlap count, Plus strand - TEST TRACK ONLY 0 1 109 51 43 182 153 149 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeTxLevels 0 color 109,51,43\ longLabel Riken CAGE Mapped Tags overlap count, Plus strand - TEST TRACK ONLY\ parent encodeRikenCageMappedTagsScore\ priority 1\ shortLabel Riken CAGE MT +\ track encodeRikenCageMappedTagsPositive\ decodeSexAveraged Sex Average bigWig 0.0 108.804 deCODE recombination map, sex-average 2 1 109 51 43 182 153 149 0 0 22 chr1,chr2,chr3,chr4,chr5,chr6,chr7,chr8,chr9,chr10,chr11,chr12,chr13,chr16,chr14,chr15,chr17,chr18,chr19,chr20,chr22,chr21, map 0 color 109,51,43\ configurable on\ longLabel deCODE recombination map, sex-average\ parent avgView\ priority 1\ shortLabel Sex Average\ subGroups view=avg\ track decodeSexAveraged\ type bigWig 0.0 108.804\ cnpSharp Sharp CNPs bed 4 + Copy Number Polymorphisms from BAC Microarray Analysis (Sharp) 0 1 0 0 0 127 127 127 0 0 0\ This track shows 160 regions detected as putative copy number \ polymorphisms by BAC microarray analysis in a population of 47 individuals, \ comprising 8 Chinese, 4 Japanese, 10 Czech, 2 Druze, 7 Biaka, 9 Mbuti, \ and 7 Amerindians.
\ \\ Hybridizations were all performed in duplicate incorporating a dye-reversal \ using a custom array consisting of 2194 end-sequence or FISH confirmed BACs, \ targeted to regions of the genome flanked by segmental duplications. \ The false positive rate was estimated as ~3 clones per 4,000 tested.
\ \\ Sharp, A.J., Locke D..P, McGrath S.D., Cheng Z., Bailey J.A., Samonte R.V., \ Pertz L.M., Clark R.A., Schwartz S., Segraves R., Oseroff V.V., Albertson D.G., \ Pinkel D. and Eichler E..E \ Segmental duplications and copy number variation in the human genome. \ Am J Hum Genet 77(1), 78-88 (2005).
\ varRep 1 longLabel Copy Number Polymorphisms from BAC Microarray Analysis (Sharp)\ noInherit on\ parent cnp\ priority 1\ shortLabel Sharp CNPs\ track cnpSharp\ type bed 4 +\ encodeSangerChipH3K4me1 SI H3K4m1 GM06990 bedGraph 4 Sanger Institute ChIP/Chip (H3K4me1 ab, GM06990 cells) 0 1 10 10 130 132 132 192 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 0 color 10,10,130\ longLabel Sanger Institute ChIP/Chip (H3K4me1 ab, GM06990 cells)\ parent encodeSangerChipH3H4\ priority 1\ shortLabel SI H3K4m1 GM06990\ track encodeSangerChipH3K4me1\ tajdSnpAd SNPs AD bed 4 . SNPs from African Descent 0 1 200 100 0 0 100 200 0 0 0 varRep 1 altColor 0,100,200\ color 200,100,0\ longLabel SNPs from African Descent\ parent tajdSnp\ priority 1\ shortLabel SNPs AD\ track tajdSnpAd\ stanfordChipGMO6990GABP Stan GMO6690 GABP bedGraph 4 Stanford ChIP-chip (GMO6990 cells, GABP ChIP) 0 1 120 0 20 150 0 25 0 0 22 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr3,chr4,chr5,chr6,chr7,chr8,chr9,chrX, regulation 0 longLabel Stanford ChIP-chip (GMO6990 cells, GABP ChIP)\ parent stanfordChip\ priority 1\ shortLabel Stan GMO6690 GABP\ track stanfordChipGMO6990GABP\ encodeStanfordChipGMO6990GABP Stan GMO6690 GABP bedGraph 4 Stanford ChIP-chip (GMO6990 cells, GABP ChIP) 0 1 120 0 20 150 0 25 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 0 longLabel Stanford ChIP-chip (GMO6990 cells, GABP ChIP)\ parent encodeStanfordChipJohnson\ priority 1\ shortLabel Stan GMO6690 GABP\ track encodeStanfordChipGMO6990GABP\ encodeStanfordChipHCT116Sp1 Stan HCT116 Sp1 bedGraph 4 Stanford ChIP-chip (HCT116 cells, Sp1 ChIP) 0 1 120 0 20 150 0 25 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 0 longLabel Stanford ChIP-chip (HCT116 cells, Sp1 ChIP)\ parent encodeStanfordChip\ priority 1\ shortLabel Stan HCT116 Sp1\ track encodeStanfordChipHCT116Sp1\ encodeStanfordMethBe2C Stan Meth Be2C bedGraph 4 Stanford Methylation Digest (Be2C cells) 0 1 120 0 20 150 0 25 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChrom 0 longLabel Stanford Methylation Digest (Be2C cells)\ parent encodeStanfordMeth\ priority 1\ shortLabel Stan Meth Be2C\ track encodeStanfordMethBe2C\ encodeStanfordMethSmoothedBe2C Stan Meth Sc Be2C bedGraph 4 Stanford Methylation Digest Smoothed Score (Be2C cells) 0 1 120 0 20 150 0 25 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChrom 0 longLabel Stanford Methylation Digest Smoothed Score (Be2C cells)\ parent encodeStanfordMethSmoothed\ priority 1\ shortLabel Stan Meth Sc Be2C\ track encodeStanfordMethSmoothedBe2C\ encodeStanfordPromotersAGS Stan Pro AGS bed 9 + Stanford Promoter Activity (AGS cells) 0 1 0 0 0 127 127 127 0 0 19 chr1,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr5,chr6,chr7,chr8,chr9,chrX, encodeTxLevels 1 longLabel Stanford Promoter Activity (AGS cells)\ parent encodeStanfordPromoters\ priority 1\ shortLabel Stan Pro AGS\ track encodeStanfordPromotersAGS\ encodeStanfordChipSmoothedHCT116Sp1 Stan Sc HCT116 Sp1 bedGraph 4 Stanford ChIP-chip Smoothed Score (HCT116 cells, Sp1 ChIP) 0 1 120 0 20 150 0 25 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 0 longLabel Stanford ChIP-chip Smoothed Score (HCT116 cells, Sp1 ChIP)\ parent encodeStanfordChipSmoothed\ priority 1\ shortLabel Stan Sc HCT116 Sp1\ track encodeStanfordChipSmoothedHCT116Sp1\ encodeStanfordNRSFEnriched Stanf NRSF Enriched bed 6 . Stanford NRSF/REST Enriched 0 1 0 128 0 127 191 127 0 0 23 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX,chrY,chrM, encodeChip 1 color 0,128,0\ longLabel Stanford NRSF/REST Enriched\ parent encodeStanfordNRSF\ priority 1\ shortLabel Stanf NRSF Enriched\ track encodeStanfordNRSFEnriched\ encodeTbaPhastCons TBA PhastCons wig 0.0 1.0 TBA PhastCons Conservation 0 1 170 100 50 212 177 152 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeCompGeno 0 autoScale Off\ color 170,100,50\ longLabel TBA PhastCons Conservation\ maxHeightPixels 100:25:11\ noInherit on\ parent encodeTbaCons\ priority 1\ shortLabel TBA PhastCons\ track encodeTbaPhastCons\ type wig 0.0 1.0\ windowingFunction mean\ encodeTbaPhastConsEl TBA PhastCons bed 5 . TBA PhastCons Conserved Elements 0 1 170 100 50 212 177 152 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeCompGeno 1 color 170,100,50\ longLabel TBA PhastCons Conserved Elements\ parent encodeTbaElements\ priority 1\ shortLabel TBA PhastCons\ track encodeTbaPhastConsEl\ hiSeqDepthTopPt1Pct Top 0.001 Depth bed 3 Top 0.001 of Read Depth Distribution 0 1 139 69 19 197 162 137 0 0 0 map 1 longLabel Top 0.001 of Read Depth Distribution\ parent hiSeqDepth\ priority 1\ shortLabel Top 0.001 Depth\ track hiSeqDepthTopPt1Pct\ encodeAllUnionEl Union bed 5 . TBA and MLAGAN PhastCons/BinCons/GERP Union Conserved Elements 0 1 80 70 180 167 162 217 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeCompGeno 1 color 80,70,180\ longLabel TBA and MLAGAN PhastCons/BinCons/GERP Union Conserved Elements\ parent encodeAllElements\ priority 1\ shortLabel Union\ track encodeAllUnionEl\ encodeUtexChipHeLaMycRaw UT Myc HeLa bedGraph 4 University of Texas, Austin ChIP-chip (c-Myc, HeLa) 0 1 120 30 50 187 142 152 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 0 color 120,30,50\ longLabel University of Texas, Austin ChIP-chip (c-Myc, HeLa)\ parent encodeUtexChip\ priority 1\ shortLabel UT Myc HeLa\ subGroups dataType=raw\ track encodeUtexChipHeLaMycRaw\ encodeUvaDnaRep0 UVa DNA Rep 0h bed 3 . University of Virginia Temporal Profiling of DNA Replication (0-2 hrs) 0 1 60 75 60 10 130 10 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChrom 1 longLabel University of Virginia Temporal Profiling of DNA Replication (0-2 hrs)\ parent encodeUvaDnaRep\ priority 1\ shortLabel UVa DNA Rep 0h\ track encodeUvaDnaRep0\ encodeUvaDnaRepEarly UVa DNA Rep Early bed 3 . University of Virginia Temporal Profiling of DNA Replication (Early) 0 1 50 50 100 152 152 177 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChrom 1 color 50,50,100\ longLabel University of Virginia Temporal Profiling of DNA Replication (Early)\ parent encodeUvaDnaRepSeg\ priority 1\ shortLabel UVa DNA Rep Early\ track encodeUvaDnaRepEarly\ encodeUvaDnaRepOriginsNSGM UVa Ori-NS GM bed 3 . University of Virginia DNA Replication Origins, Ori-NS, GM06990 0 1 205 0 0 230 127 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChrom 1 color 205,0,0\ dataVersion May 2007\ longLabel University of Virginia DNA Replication Origins, Ori-NS, GM06990\ origAssembly hg17\ parent encodeUvaDnaRepOrigins\ priority 1\ shortLabel UVa Ori-NS GM\ track encodeUvaDnaRepOriginsNSGM\ kiddEichlerValidAbc14 Validated ABC14 bed 9 HGSV Individual ABC14 (CEPH) Validated Sites of Structural Variation 0 1 0 0 0 127 127 127 0 0 0 varRep 1 longLabel HGSV Individual ABC14 (CEPH) Validated Sites of Structural Variation\ parent kiddEichlerValid\ priority 1\ shortLabel Validated ABC14\ track kiddEichlerValidAbc14\ hgdpXpehhBantu XP-EHH Bantu bedGraph 4 Human Genome Diversity Project XP-EHH (Bantu populations in Africa) 0 1 224 0 0 239 127 127 0 0 22 chr1,chr2,chr3,chr4,chr5,chr6,chr7,chr8,chr9,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr17,chr18,chr19,chr20,chr21,chr22, varRep 0 color 224, 0, 0\ longLabel Human Genome Diversity Project XP-EHH (Bantu populations in Africa)\ parent hgdpXpehh\ priority 1\ shortLabel XP-EHH Bantu\ track hgdpXpehhBantu\ encodeYaleChIPSTAT1HeLaMaskLess36mer36bpPval Yale 36-36 PVal bedGraph 4 Yale ChIP/Chip (STAT1 ab, Hela cells) Maskless 36-mer, 36bp Win, P-Values 0 1 50 50 200 152 152 227 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 0 color 50,50,200\ longLabel Yale ChIP/Chip (STAT1 ab, Hela cells) Maskless 36-mer, 36bp Win, P-Values\ parent encodeYaleChIPSTAT1Pval\ priority 1\ shortLabel Yale 36-36 PVal\ track encodeYaleChIPSTAT1HeLaMaskLess36mer36bpPval\ encodeYaleChIPSTAT1HeLaMaskLess36mer36bpSig Yale 36-36 Sig bedGraph 4 Yale ChIP/Chip (STAT1 ab, Hela cells) Maskless 36-mer, 36bp Win, Signal 0 1 112 63 175 224 66 81 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 0 altColor 224,66,81\ color 112,63,175\ longLabel Yale ChIP/Chip (STAT1 ab, Hela cells) Maskless 36-mer, 36bp Win, Signal\ parent encodeYaleChIPSTAT1Sig\ priority 1\ shortLabel Yale 36-36 Sig\ track encodeYaleChIPSTAT1HeLaMaskLess36mer36bpSig\ encodeYaleChIPSTAT1HeLaMaskLess36mer36bpSite Yale 36-36 Sites bed . Yale ChIP/Chip (STAT1 ab, Hela cells) Maskless 36-mer, 36bp Win, Binding Sites 0 1 200 50 50 50 50 200 0 0 18 chr1,chr10,chr11,chr13,chr14,chr15,chr16,chr19,chr2,chr20,chr21,chr22,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 1 altColor 50,50,200\ color 200,50,50\ longLabel Yale ChIP/Chip (STAT1 ab, Hela cells) Maskless 36-mer, 36bp Win, Binding Sites\ parent encodeYaleChIPSTAT1Sites\ priority 1\ shortLabel Yale 36-36 Sites\ track encodeYaleChIPSTAT1HeLaMaskLess36mer36bpSite\ encodeYaleMASNB4RNANprotTMFWDMless36mer36bp Yale NB4 NgF RNA bedGraph 4 Yale NB4 RNA Trans Map, MAS Array, Forward Direction, NimbleGen Protocol 0 1 200 50 50 50 50 200 0 0 8 chr5,chr7,chrX,chr11,chr16,chr19,chr21,chr22, encodeTxLevels 0 altColor 50,50,200\ color 200,50,50\ longLabel Yale NB4 RNA Trans Map, MAS Array, Forward Direction, NimbleGen Protocol\ parent encodeYaleMASPlacRNATransMap\ priority 1\ shortLabel Yale NB4 NgF RNA\ track encodeYaleMASNB4RNANprotTMFWDMless36mer36bp\ encodeYaleMASNB4RNANProtTarsFWDMless36mer36bp Yale NB4 NgF TAR bed 6 . Yale NB4 RNA TARs, MAS array, Forward Direction, NimbleGen Protocol 0 1 200 50 50 50 50 200 0 0 8 chr5,chr7,chrX,chr11,chr16,chr19,chr21,chr22, encodeTxLevels 1 altColor 50,50,200\ color 200,50,50\ longLabel Yale NB4 RNA TARs, MAS array, Forward Direction, NimbleGen Protocol\ parent encodeYaleMASPlacRNATars\ priority 1\ shortLabel Yale NB4 NgF TAR\ track encodeYaleMASNB4RNANProtTarsFWDMless36mer36bp\ encodeYaleAffyNeutRNATransMapAll Yale RNA Neu Sum wig -2730 3394 Yale Neutrophil RNA Transcript Map, Summary 0 1 50 70 50 152 162 152 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeTxLevels 0 color 50,70,50\ longLabel Yale Neutrophil RNA Transcript Map, Summary\ parent encodeYaleAffyRNATransMap\ priority 1\ shortLabel Yale RNA Neu Sum\ subGroups celltype=neutro samples=summary\ track encodeYaleAffyNeutRNATransMapAll\ encodeYaleAffyNeutRNATarsAll Yale TAR Neu Sum bed 3 . Yale Neutrophil RNA Transcriptionally Active Region (TAR), Summary 0 1 50 70 50 152 162 152 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeTxLevels 1 color 50,70,50\ longLabel Yale Neutrophil RNA Transcriptionally Active Region (TAR), Summary\ parent encodeYaleAffyRNATars\ priority 1\ shortLabel Yale TAR Neu Sum\ subGroups celltype=neutro samples=summary\ track encodeYaleAffyNeutRNATarsAll\ hapmapLdHotspotYRI YRI bedGraph 4 Hotspots of Linkage Disequilibrium in the Yoruban HapMap (YRI) 0 1 0 0 0 127 127 127 0 0 23 chr1,chr2,chr3,chr4,chr5,chr6,chr7,chr8,chr9,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr17,chr18,chr19,chr20,chr21,chr22,chrX, varRep 0 longLabel Hotspots of Linkage Disequilibrium in the Yoruban HapMap (YRI)\ parent hapmapLdHotspot\ priority 1\ shortLabel YRI\ track hapmapLdHotspotYRI\ netRBestCalJac1 Marmoset RBest Net netAlign calJac1 chainCalJac1 Marmoset (June 2007 (WUGSC 2.0.2/calJac1)) Reciprocal Best Alignment Net 0 2 0 0 0 127 127 127 1 0 0 compGeno 0 group compGeno\ longLabel $o_Organism ($o_date) Reciprocal Best Alignment Net\ otherDb calJac1\ parent rBestNet\ priority 2\ shortLabel $o_Organism RBest Net\ spectrum on\ track netRBestCalJac1\ type netAlign calJac1 chainCalJac1\ visibility hide\ netSyntenyPonAbe2 Orangutan Syn Net netAlign ponAbe2 chainPonAbe2 Orangutan (July 2007 (WUGSC 2.0.2/ponAbe2)) Syntenic Alignment Net 0 2 0 0 0 127 127 127 1 0 0 compGeno 0 group compGeno\ longLabel $o_Organism ($o_date) Syntenic Alignment Net\ otherDb ponAbe2\ parent syntenicNet\ priority 2\ shortLabel $o_Organism Syn Net\ spectrum on\ track netSyntenyPonAbe2\ type netAlign ponAbe2 chainPonAbe2\ visibility hide\ encodeEgaspPartAceOther ACEScan Other genePred ACEScan Unconserved Alternative and Constitutive Exon Predictions 0 2 66 12 133 160 133 194 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeGenes 1 color 66,12,133\ longLabel ACEScan Unconserved Alternative and Constitutive Exon Predictions\ parent encodeEgaspPartial\ priority 2\ shortLabel ACEScan Other\ track encodeEgaspPartAceOther\ encodeAffyChIpHl60SitesBrg1Hr00 Affy Brg1 RA 0h bed 3 . Affymetrix ChIP/Chip (Brg1 retinoic acid-treated HL-60, 0hrs) Sites 0 2 225 0 0 240 127 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 1 color 225,0,0\ longLabel Affymetrix ChIP/Chip (Brg1 retinoic acid-treated HL-60, 0hrs) Sites\ parent encodeAffyChIpHl60Sites\ priority 2\ shortLabel Affy Brg1 RA 0h\ subGroups factor=Brg1 time=0h\ track encodeAffyChIpHl60SitesBrg1Hr00\ affyAllExonProbes Affy Exon Probes bed 6 . Affymetrix Exon Array 1.0: Probesets 3 2 0 0 0 127 127 127 1 0 0\ The Exon GeneChip contains over one million probe \ sets\ designed to interrogate individual exons rather than the 3' ends of transcripts\ as in traditional GeneChips. Exons were derived from a variety of\ annotations that have been divided into the classes Core, Extended\ and Full. \
\ Probe sets are colored by class with the Core probe sets being\ the darkest and the Full being the lightest color. Additionally, probe\ sets that do not overlap the exons of a transcript cluster, but fall\ inside of its introns, are considered bounded by that transcript\ cluster and are colored slightly lighter. Probe sets that overlap the\ coding portion of the Core class are colored slightly darker.
\\ The microarray track using this probe set can be displayed by turning\ on the Affy Exon Tissue track.
\ \\ The exons interrogated by the probe sets displayed in this track are\ from the Affymetrix Exon 1.0 GeneChip and were derived from a\ number of sources. In addition to the millions of cDNA sequences\ contributed to the \ GenBank, \ dbEst and \ RefSeq \ databases by\ individual labs and scientists, the following annotations were used:\
\ Ensembl: \ Hubbard T, Barker D, Birney E, Cameron G, Chen Y, Clark L, Cox T, Cuff J,\ Curwen V, Down T et al..\ The Ensembl genome database project.\ Nucleic Acids Research. 2002 Jan 1;30(1):38-41.
\\ Exoniphy: Siepel, A., Haussler, D. \ Computational identification of evolutionarily conserved \ exons.\ Proc. 8th Int'l Conf. on Research in Computational Molecular Biology, \ 177-186 (2004).
\\ Geneid Genes:\ Parra, G., Blanco, E., Guigo, R. \ Geneid in Drosophila.\ Genome Res. 10(4), 511-515 (2000).
\\ Genscan Genes:\ Burge, C., Karlin, S. \ Prediction of Complete Gene Structures in Human Genomic DNA.\ J. Mol. Biol. 268(1), 78-94 (1997).
\\ microRNA:\ Griffiths-Jones, S. \ The microRNA Registry. \ Nucl. Acids Res. 32, D109-D111 (2004).
\\ MITOMAP:\ Brandon, M. C., Lott, M. T., Nguyen, K. C., Spolim, S., Navathe, S. B., \ Baldi, P. & Wallace, D. C.\ MITOMAP: a human mitochondrial genome database--2004 update\ Nucl. Acids Res. 33(Database Issue):D611-613 (2005).
\\ RNA Genes:\ Lowe, T. M., Eddy, S. R. \ tRNAscan-SE: A Program for Improved Detection of Transfer RNA \ Genes in Genomic Sequence.\ Nucleic Acids Res., 25(5), 955-964 (1997).
\\ SGP Genes: \ Wiehe, T., Gebauer-Jung, S., Mitchell-Olds, T., Guigo, R. \ SGP-1: prediction and validation of homologous genes based on \ sequence alignments.\ Genome Res., 11(9), 1574-83 (2001).
\\ Twinscan Genes:\ Korf, I., Flicek, P., Duan, D., Brent, M.R. \ Integrating genomic homology into gene structure prediction.\ Bioinformatics 17, S140-148 (2001).\
\ Vega Genes \ and Pseudogenes: The HAVANA group, \ Wellcome Trust Sanger \ Institute.
\ expression 1 exonArrows on\ group expression\ longLabel Affymetrix Exon Array 1.0: Probesets\ priority 2.0\ shortLabel Affy Exon Probes\ spectrum on\ superTrack affyAllExonSuper dense\ track affyAllExonProbes\ type bed 6 .\ useScore 1\ visibility pack\ encodeAffyChIpHl60SignalStrictH3K9K14DHr02 Affy H3K9ac2 2h wig -2.78 3.97 Affymetrix ChIP-chip (H3K9K14ac2, retinoic acid-treated HL-60, 2hrs) Strict Signal 0 2 225 0 0 240 127 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 0 color 225,0,0\ longLabel Affymetrix ChIP-chip (H3K9K14ac2, retinoic acid-treated HL-60, 2hrs) Strict Signal\ parent encodeAffyChIpHl60SignalStrict\ priority 2\ shortLabel Affy H3K9ac2 2h\ subGroups factor=H3K9K14ac2 time=2h\ track encodeAffyChIpHl60SignalStrictH3K9K14DHr02\ encodeAffyChIpHl60SitesStrictH3K9K14DHr02 Affy H3K9ac2 2h bed 3 . Affymetrix ChIP-chip (H3K9K14ac2, retinoic acid-treated HL-60, 2hrs) Strict Sites 0 2 225 0 0 240 127 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 1 color 225,0,0\ longLabel Affymetrix ChIP-chip (H3K9K14ac2, retinoic acid-treated HL-60, 2hrs) Strict Sites\ parent encodeAffyChIpHl60SitesStrict\ priority 2\ shortLabel Affy H3K9ac2 2h\ subGroups factor=H3K9K14ac2 time=2h\ track encodeAffyChIpHl60SitesStrictH3K9K14DHr02\ encodeAffyChIpHl60PvalStrictH3K9K14DHr02 Affy H3K9ac2 2h wig 0 696.62 Affymetrix ChIP-chip (H3K9K14ac2, retinoic acid-treated HL-60, 2hrs) Strict P-Value 0 2 225 0 0 240 127 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 0 color 225,0,0\ longLabel Affymetrix ChIP-chip (H3K9K14ac2, retinoic acid-treated HL-60, 2hrs) Strict P-Value\ parent encodeAffyChIpHl60PvalStrict\ priority 2\ shortLabel Affy H3K9ac2 2h\ subGroups factor=H3K9K14ac2 time=2h\ track encodeAffyChIpHl60PvalStrictH3K9K14DHr02\ encodeAffyRnaHeLaSitesIntronsProximal Affy In Prx HeLa bed 4 . Affy Intronic Proximal HeLa Transfrags 0 2 236 0 20 245 127 137 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeAnalysis 1 color 236,0,20\ longLabel Affy Intronic Proximal HeLa Transfrags\ parent encodeNoncodingTransFrags\ priority 2\ shortLabel Affy In Prx HeLa\ subGroups region=intronicProximal celltype=hela source=affy\ track encodeAffyRnaHeLaSitesIntronsProximal\ encodeTransFragsAffyIntronicDistal Affy Intron Dist bed 4 Affy Intronic Distal 0 2 0 0 0 127 127 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeAnalysis 1 longLabel Affy Intronic Distal\ parent encodeTransFrags\ priority 2\ shortLabel Affy Intron Dist\ track encodeTransFragsAffyIntronicDistal\ encodeAffyRnaHeLaSignal Affy RNA HeLa wig -1168.00 1686.5 Affymetrix PolyA+ RNA (HeLaS3) Signal 0 2 220 132 12 237 193 133 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeTxLevels 0 color 220,132,12\ longLabel Affymetrix PolyA+ RNA (HeLaS3) Signal\ parent encodeAffyRnaSignal\ priority 2\ shortLabel Affy RNA HeLa\ track encodeAffyRnaHeLaSignal\ encodeAffyRnaHeLaSites Affy RNA HeLa bed 3 . Affymetrix PolyA+ RNA (HeLaS3) Sites 0 2 220 132 12 237 193 133 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeTxLevels 1 color 220,132,12\ longLabel Affymetrix PolyA+ RNA (HeLaS3) Sites\ parent encodeAffyRnaTransfrags\ priority 2\ shortLabel Affy RNA HeLa\ track encodeAffyRnaHeLaSites\ snpArrayAffy6SV Affy SNP 6.0 SV bed 6 + Affymetrix SNP 6.0 Structural Variation 0 2 0 0 0 127 127 127 0 0 0 varRep 1 longLabel Affymetrix SNP 6.0 Structural Variation\ parent snpArray\ priority 2\ shortLabel Affy SNP 6.0 SV\ track snpArrayAffy6SV\ type bed 6 +\ encodeHapMapAlleleFreqCHB Allele Freq CHB bed 6 + HapMap Minor Allele Frequencies Chinese (CHB) 0 2 0 0 0 127 127 127 1 0 7 chr2,chr4,chr7,chr8,chr9,chr12,chr18, encodeVariation 1 longLabel HapMap Minor Allele Frequencies Chinese (CHB)\ parent encodeHapMapAlleleFreq\ priority 2\ shortLabel Allele Freq CHB\ track encodeHapMapAlleleFreqCHB\ encodeRegulomeAmpliconEven Amplicon (Even) bed 5 . Amplicon (Even) 0 2 0 0 0 127 127 127 1 0 10 chr2,chr5,chr7,chr8,chr9,chr11,chr12,chr16,chr18,chrX, encodeChrom 1 longLabel Amplicon (Even)\ parent encodeRegulomeAmplicon\ priority 2\ shortLabel Amplicon (Even)\ track encodeRegulomeAmpliconEven\ encodeEgaspUpdAugustusEst Augustus/EST Upd genePred Augustus + EST/Protein Evidence Gene Predictions 0 2 12 65 165 133 160 210 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeGenes 1 color 12,65,165\ longLabel Augustus + EST/Protein Evidence Gene Predictions\ parent encodeEgaspUpdate\ priority 2\ shortLabel Augustus/EST Upd\ track encodeEgaspUpdAugustusEst\ encodeBuFirstExonColon BU Colon bed 12 + Boston University First Exon Activity in Colon 0 2 0 0 0 127 127 127 0 0 10 chr11,chr13,chr15,chr16,chr19,chr2,chr5,chr7,chr9,chrX, encodeTxLevels 1 longLabel Boston University First Exon Activity in Colon\ parent encodeBuFirstExon\ priority 2\ shortLabel BU Colon\ track encodeBuFirstExonColon\ cccTrendPvalCad CCC Coronary Art chromGraph Case Control Consortium coronary artery disease trend -log10 P-value 0 2 0 0 0 127 127 127 0 0 0 phenDis 0 longLabel Case Control Consortium coronary artery disease trend -log10 P-value\ parent caseControl\ priority 2\ shortLabel CCC Coronary Art\ track cccTrendPvalCad\ hapmapLdHotspotCEU CEU bedGraph 4 Hotspots of Linkage Disequilibrium in the CEPH HapMap (CEU) 0 2 0 0 0 127 127 127 0 0 23 chr1,chr2,chr3,chr4,chr5,chr6,chr7,chr8,chr9,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr17,chr18,chr19,chr20,chr21,chr22,chrX, varRep 0 longLabel Hotspots of Linkage Disequilibrium in the CEPH HapMap (CEU)\ parent hapmapLdHotspot\ priority 2\ shortLabel CEU\ track hapmapLdHotspotCEU\ fox2ClipSeqDensityForwardStrand Density Forward wig 0 2401 FOX2 adaptor-trimmed CLIP-seq Density Forward Strand 2 2 0 0 0 127 127 127 0 0 0 regulation 0 configurable on\ graphTypeDefault Bar\ longLabel FOX2 adaptor-trimmed CLIP-seq Density Forward Strand\ maxHeightPixels 128:36:16\ noInherit on\ parent fox2ClipSeqCompViewdensity\ priority 2\ shortLabel Density Forward\ spanList 1\ subGroups view=density\ track fox2ClipSeqDensityForwardStrand\ type wig 0 2401\ windowingFunction mean\ kiddEichlerDiscAbc13 Discordant ABC13 bed 12 HGSV Individual ABC13 (Yoruba) Discordant Clone End Alignments 0 2 0 0 0 127 127 127 0 0 0 http://mrhgsv.gs.washington.edu/cgi-bin/hgc?i=$$&c=$S&l=$[&r=$]&db=$D&position=$S:$[-$] varRep 1 longLabel HGSV Individual ABC13 (Yoruba) Discordant Clone End Alignments\ parent kiddEichlerDisc\ priority 2\ shortLabel Discordant ABC13\ track kiddEichlerDiscAbc13\ encodeDNDSmedium dN/dS 0.2 to 0.5 bed 4 + ENCODE Exons dN/dS 0.2 to 0.5 0 2 0 0 0 127 127 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeAnalysis 1 longLabel ENCODE Exons dN/dS 0.2 to 0.5\ parent encodeDNDS\ priority 2\ shortLabel dN/dS 0.2 to 0.5\ track encodeDNDSmedium\ encodeNhgriDnaseHsAct DNase CD4 Activ. bed 5 . NHGRI DNaseI-Hypersensitive Sites (CD4+ T-Cells Activated) 0 2 0 0 0 127 127 127 1 0 19 chr1,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr5,chr6,chr7,chr8,chr9,chrX, encodeChrom 1 longLabel NHGRI DNaseI-Hypersensitive Sites (CD4+ T-Cells Activated)\ parent encodeNhgriDnaseHs\ priority 2\ shortLabel DNase CD4 Activ.\ track encodeNhgriDnaseHsAct\ encodeEgaspFullDogfish DOGFISH-C genePred DOGFISH-C Gene Predictions 0 2 12 20 150 133 137 202 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeGenes 1 color 12,20,150\ longLabel DOGFISH-C Gene Predictions\ parent encodeEgaspFull\ priority 2\ shortLabel DOGFISH-C\ track encodeEgaspFullDogfish\ encodeAffyEc51BrainCerebellumSignal EC51 Sgnl BrainC wig 0 62385 Affy Ext Trans Signal (51-base window) (Brain Cerebellum) 0 2 248 0 8 251 127 131 0 0 2 chr21,chr22, encodeTxLevels 0 color 248,0,8\ longLabel Affy Ext Trans Signal (51-base window) (Brain Cerebellum)\ parent encodeAffyEcSignal\ priority 2\ shortLabel EC51 Sgnl BrainC\ track encodeAffyEc51BrainCerebellumSignal\ encodeAffyEc51BrainCerebellumSites EC51 Sites BrainC bed 3 . Affy Ext Trans Sites (51-base window) (Brain Cerebellum) 0 2 248 0 8 251 127 131 0 0 2 chr21,chr22, encodeTxLevels 1 color 248,0,8\ longLabel Affy Ext Trans Sites (51-base window) (Brain Cerebellum)\ parent encodeAffyEcSites\ priority 2\ shortLabel EC51 Sites BrainC\ track encodeAffyEc51BrainCerebellumSites\ eioJcviNASNeg EIO/JCVI CD34- NAS bed 3 . CD34- cells Nuclease Accessible sites 0 2 100 30 250 177 142 252 0 0 0 regulation 1 color 100,30,250\ longLabel CD34- cells Nuclease Accessible sites\ parent eioJcviNAS\ priority 2\ shortLabel EIO/JCVI CD34- NAS\ track eioJcviNASNeg\ encodeUncFairePeaks FAIRE PeakFinder bedGraph 4 University of North Carolina FAIRE Peaks (PeakFinder) 0 2 20 150 20 50 100 50 0 0 21 chr1,chr4,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr5,chr6,chr7,chr8,chr9,chr10,chrX, encodeChrom 0 autoScale off\ longLabel University of North Carolina FAIRE Peaks (PeakFinder)\ maxHeightPixels 128:24:16\ noInherit on\ parent encodeUncFaire\ priority 2\ shortLabel FAIRE PeakFinder\ spanList 38\ track encodeUncFairePeaks\ type bedGraph 4\ viewLimits 0.4:3.7\ windowingFunction mean\ encodeGencodeGenePutativeMar07 Gencode Putative genePred Gencode Putative Genes 0 2 84 188 0 169 221 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeGenes 1 color 84,188,0\ longLabel Gencode Putative Genes\ parent encodeGencodeGeneMar07\ priority 2\ shortLabel Gencode Putative\ track encodeGencodeGenePutativeMar07\ encodeGencodeGenePutativeOct05 Gencode Putative genePred Gencode Putative Genes 0 2 84 188 0 169 221 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeGenes 1 color 84,188,0\ longLabel Gencode Putative Genes\ parent encodeGencodeGeneOct05\ priority 2\ shortLabel Gencode Putative\ track encodeGencodeGenePutativeOct05\ encodeGisRnaPetHCT116 GIS RNA HCT116 bed 12 Gene Identification Signature Paired-End Tags of PolyA+ RNA (5FU-stim HCT116) 0 2 58 119 40 156 187 147 0 0 23 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr17,chr18,chr19,chr2,chr20,chr21,chr22,chr3,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeGenes 1 color 58,119,40\ longLabel Gene Identification Signature Paired-End Tags of PolyA+ RNA (5FU-stim HCT116)\ parent encodeGisRnaPet\ priority 2\ shortLabel GIS RNA HCT116\ track encodeGisRnaPetHCT116\ encodeRegulomeQualityGM06990 GM06990 bed 5 . GM06990 Quality 0 2 90 50 210 172 152 232 1 0 10 chr2,chr5,chr7,chr8,chr9,chr11,chr12,chr16,chr18,chrX, encodeChrom 1 color 90,50,210\ longLabel GM06990 Quality\ parent encodeRegulomeQuality\ priority 2\ shortLabel GM06990\ track encodeRegulomeQualityGM06990\ encodeRegulomeProbGM06990 GM06990 bedGraph 4 GM06990 DNaseI HSs 0 2 90 50 210 172 152 232 0 0 10 chr2,chr5,chr7,chr8,chr9,chr11,chr12,chr16,chr18,chrX, encodeChrom 0 color 90,50,210\ longLabel GM06990 DNaseI HSs\ parent encodeRegulomeProb\ priority 2\ shortLabel GM06990\ track encodeRegulomeProbGM06990\ encodeRegulomeBaseGM06990 GM06990 wig 0.0 3.0 GM06990 DNaseI Sensitivity 0 2 90 50 210 172 152 232 0 0 10 chr2,chr5,chr7,chr8,chr9,chr11,chr12,chr16,chr18,chrX, encodeChrom 0 color 90,50,210\ longLabel GM06990 DNaseI Sensitivity\ parent encodeRegulomeBase\ priority 2\ shortLabel GM06990\ track encodeRegulomeBaseGM06990\ encodeHapMapCovCHB HapMap Cov CHB wig 0.0 100.0 HapMap Resequencing Coverage Chinese (CHB) 0 2 0 0 0 127 127 127 0 0 7 chr2,chr4,chr7,chr8,chr9,chr12,chr18, encodeVariation 0 longLabel HapMap Resequencing Coverage Chinese (CHB)\ parent encodeHapMapCov\ priority 2\ shortLabel HapMap Cov CHB\ track encodeHapMapCovCHB\ snpRecombRateHapmapPhase2 HapMap Phase II bedGraph 4 Oxford Recombination Rates from HapMap Phase II Release 21 0 2 0 0 0 127 127 127 0 0 22 chr1,chr2,chr3,chr4,chr5,chr6,chr7,chr8,chr9,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr20,chr21,chr22,chrX, varRep 0 longLabel Oxford Recombination Rates from HapMap Phase II Release 21\ parent snpRecombRate\ priority 2\ shortLabel HapMap Phase II\ track snpRecombRateHapmapPhase2\ hapmapSnpsCEU HapMap SNPs CEU bed 6 + HapMap SNPs from the CEU Population (Northern and Western European Ancestry in Utah, US - CEPH) 0 2 0 0 0 127 127 127 0 0 0 varRep 1 longLabel HapMap SNPs from the CEU Population (Northern and Western European Ancestry in Utah, US - CEPH)\ parent hapmapSnps\ priority 2\ shortLabel HapMap SNPs CEU\ track hapmapSnpsCEU\ encodePseudogeneHavana Havana-Gencode Pseudogenes genePred Havana-Gencode Annotated Pseudogenes and Immunoglobulin Segments 0 2 0 0 0 127 127 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, http://vega.sanger.ac.uk/Homo_sapiens/geneview?transcript=$$ encodeGenes 1 longLabel Havana-Gencode Annotated Pseudogenes and Immunoglobulin Segments\ parent encodePseudogene\ priority 2\ shortLabel Havana-Gencode Pseudogenes\ track encodePseudogeneHavana\ url http://vega.sanger.ac.uk/Homo_sapiens/geneview?transcript=$$\ urlLabel Vega Genes Link:\ hgdpHzyBantu Hetzgty Bantu bedGraph 4 Human Genome Diversity Proj Smoothd Expec Heterozygosity (Bantu pops. in Africa) 0 2 224 0 0 239 127 127 0 0 22 chr1,chr2,chr3,chr4,chr5,chr6,chr7,chr8,chr9,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr17,chr18,chr19,chr20,chr21,chr22, varRep 0 color 224, 0, 0\ longLabel Human Genome Diversity Proj Smoothd Expec Heterozygosity (Bantu pops. in Africa)\ parent hgdpHzy\ priority 2\ shortLabel Hetzgty Bantu\ track hgdpHzyBantu\ cnpIafrate Iafrate CNPs bed 4 + Copy Number Polymorphisms from BAC Microarray Analysis (Iafrate) 0 2 0 0 0 127 127 127 0 0 0\ This track shows 255 regions detected as putative copy number polymorphisms by BAC microarray analysis \ in a population of 55 individuals, 16 of which had previously characterized chromosome abnormalities.\ \
\ Hybridizations were all performed in duplicate incorporating a dye-reversal using proprietary 1 Mb \ GenomeChip V1.2 Human BAC Arrays consisting of 2,632 BAC clones (Spectral Genomics, Houston, TX). \ The false positive rate was estimated as ~1 clone per 5,264 tested. \
\ Further information is available at \ http://projects.tcag.ca/variation.\ \
\ Iafrate JA, Feuk L, Rivera MN, Listewnik ML, Donahoe PK, Qi Y, Scherer SW, Lee C (2004) \ Detection of large-scale variation in the human genome. Nature Genet 36:949-951\ varRep 1 longLabel Copy Number Polymorphisms from BAC Microarray Analysis (Iafrate)\ noInherit on\ parent cnp\ priority 2\ shortLabel Iafrate CNPs\ track cnpIafrate\ type bed 4 +\ hgdpIhsMideast iHS Mideast bedGraph 4 Human Genome Diversity Project iHS (Mideast) 0 2 0 0 200 127 127 227 0 0 23 chr1,chr2,chr3,chr4,chr5,chr6,chr7,chr8,chr9,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr17,chr18,chr19,chr20,chr21,chr22,chrX, varRep 0 color 0,0,200\ longLabel Human Genome Diversity Project iHS (Mideast)\ parent hgdpIhs\ priority 2\ shortLabel iHS Mideast\ track hgdpIhsMideast\ encodeGencodeIntronsDistal Intronic Dist bed 4 . Gencode Intronic Distal Regions 0 2 0 0 0 127 127 127 0 0 0 encodeAnalysis 1 longLabel Gencode Intronic Distal Regions\ parent encodeGencodeRegions\ priority 2\ shortLabel Intronic Dist\ track encodeGencodeIntronsDistal\ encodeAllIntronsProximal Intronic Prox bed 4 Consensus Intronic Proximal 0 2 0 0 0 127 127 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeAnalysis 1 longLabel Consensus Intronic Proximal\ parent encodeWorkshopSelections\ priority 2\ shortLabel Intronic Prox\ track encodeAllIntronsProximal\ iscaRetrospectiveLikelyBenign ISCA Ret Lik.Ben gvf Internat. Stds. for Cytogen. Arrays Consort. (ISCA) - Retrospective (Likely Benign) 0 2 0 0 0 127 127 127 0 0 0 varRep 1 longLabel Internat. Stds. for Cytogen. Arrays Consort. (ISCA) - Retrospective (Likely Benign)\ parent iscaRetrospectiveComposite\ priority 2\ shortLabel ISCA Ret Lik.Ben\ track iscaRetrospectiveLikelyBenign\ L2_LINE L2_LINE bed 5 L2 LINEs for Intersection 0 2 0 0 0 127 127 127 0 0 0 encodeAnalysis 1 longLabel L2 LINEs for Intersection\ parent encodeWorkshopIntersections\ priority 2\ shortLabel L2_LINE\ track L2_LINE\ hapmapLdCeu LD CEU bed 4 + Linkage Disequilibrium for the CEPH (CEU) 0 2 0 0 0 127 127 127 0 0 23 chr1,chr2,chr3,chr4,chr5,chr6,chr7,chr8,chr9,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr17,chr18,chr19,chr20,chr21,chr22,chrX, varRep 0 longLabel Linkage Disequilibrium for the CEPH (CEU)\ parent hapmapLd\ priority 2\ shortLabel LD CEU\ track hapmapLdCeu\ encodeUcsdChipHeLaH3H4dmH3K4_p30 LI H3K4me2 +gIF bedGraph 4 Ludwig Institute ChIP-chip: H3K4me2 ab, HeLa cells, 30 min. after gamma interferon 0 2 109 51 43 182 153 149 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 0 color 109,51,43\ longLabel Ludwig Institute ChIP-chip: H3K4me2 ab, HeLa cells, 30 min. after gamma interferon\ parent encodeLIChIPgIF\ priority 2\ shortLabel LI H3K4me2 +gIF\ track encodeUcsdChipHeLaH3H4dmH3K4_p30\ encodeUcsdNgHeLaH3K4me3_p30 LI Ng H3K4m3 +gIF bedGraph 4 Ludwig Institute/UCSD ChIP/Chip Ng: HeLa, H3K4me3, 30 min after gamma interferon 0 2 0 0 0 127 127 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 0 longLabel Ludwig Institute/UCSD ChIP/Chip Ng: HeLa, H3K4me3, 30 min after gamma interferon\ parent encodeUcsdNgGif\ priority 2\ shortLabel LI Ng H3K4m3 +gIF\ track encodeUcsdNgHeLaH3K4me3_p30\ encodeUcsdChipRnapThp1_f LI Pol2 THP1 bedGraph 4 Ludwig Institute ChIP-chip: Pol2 8WG16 ab, THP1 cells 0 2 0 63 135 127 159 195 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 0 color 0,63,135\ longLabel Ludwig Institute ChIP-chip: Pol2 8WG16 ab, THP1 cells\ parent encodeLIChIP\ priority 2\ shortLabel LI Pol2 THP1\ track encodeUcsdChipRnapThp1_f\ jaxQtlPadded MGI Mouse QTL Padded bed 4 . MGI Mouse QTL Peak-Score Markers Padded to 100k and Coarsely Mapped to Human 0 2 200 100 0 227 177 127 0 0 0 http://www.informatics.jax.org/searches/accession_report.cgi?id=$$ phenDis 1 color 200,100,0\ longLabel MGI Mouse QTL Peak-Score Markers Padded to 100k and Coarsely Mapped to $Organism\ parent jaxQtlMapped\ priority 2\ shortLabel MGI Mouse QTL Padded\ track jaxQtlPadded\ snp131Misc Misc SNPs (131) bed 6 + Simple Nucleotide Polymorphisms (dbSNP build 131) not from HapMap and/or 1000Genomes 1 2 0 0 0 127 127 127 0 0 0 http://www.ncbi.nlm.nih.gov/SNP/snp_ref.cgi?type=rs&rs=$$ varRep 1 longLabel Simple Nucleotide Polymorphisms (dbSNP build 131) not from HapMap and/or 1000Genomes\ parent snp131Composite\ priority 2\ shortLabel Misc SNPs (131)\ subGroups view=misc\ track snp131Misc\ visibility dense\ encodeMlaganBinConsEl MLAGAN BinCons bed 5 . MLAGAN BinCons Conserved Elements 0 2 170 5 10 212 130 132 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeCompGeno 1 color 170,5,10\ longLabel MLAGAN BinCons Conserved Elements\ parent encodeMlaganElements\ priority 2\ shortLabel MLAGAN BinCons\ track encodeMlaganBinConsEl\ encodeMlaganBinCons MLAGAN BinCons wig 0.0 1.0 MLAGAN BinCons Conservation 0 2 190 70 80 222 162 167 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeCompGeno 0 color 190,70,80\ longLabel MLAGAN BinCons Conservation\ parent encodeMlaganCons\ priority 2\ shortLabel MLAGAN BinCons\ track encodeMlaganBinCons\ encodeAllNcUnionEl NC Union bed 5 . TBA and MLAGAN PhastCons/BinCons/GERP Union NonCoding Conserved Elements 0 2 80 105 145 167 180 200 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeCompGeno 1 color 80,105,145\ longLabel TBA and MLAGAN PhastCons/BinCons/GERP Union NonCoding Conserved Elements\ parent encodeAllElements\ priority 2\ shortLabel NC Union\ track encodeAllNcUnionEl\ nimhBipolarDe NIMH Bipolar De chromGraph NIMH Bipolar disorder (German) -log10 P-value 0 2 0 0 0 127 127 127 0 0 0 phenDis 0 longLabel NIMH Bipolar disorder (German) -log10 P-value\ parent nimhBipolar\ priority 2\ shortLabel NIMH Bipolar De\ track nimhBipolarDe\ numtSAssembled NumtS assembled bed 12 . Human NumtS assembled 0 2 0 60 120 127 157 187 1 0 0
\ NumtS (Nuclear mitochondrial sequences) are mitochondrial fragments inserted in nuclear genomic sequences. The most credited hypothesis concerning their generation suggests that in presence of mutagenic agents or under stress conditions fragments of mtDNA escape from mitochondria, reach the nucleus and insert into chromosomes during break repair, although NumtS can derive from duplication of genomic fragments. NumtS may be cause of contamination during human mtDNA sequencing and hence frequent false low heteroplasmic evidences have been reported.\ The Bioinformatics group chaired by M.Attimonelli (Bari, Italy) has produced the RHNumtS compilation annotating more than 500 Human NumtS. To allow the scientific community to access to the compilation and to perform genomics comparative analyses inclusive of the NumtS data, the group has designed the Human NumtS tracks below described.\
\ \\ The NumtS tracks show the High Score Pairs (HSPs) obtained by aligning the mitochondrial reference genome (NC_012920) with the hg18 release of the human genome.\
\\ The "NumtS mitochondrial sequences" track shows the mapping of the HSPs returned by BlastN on the nuclear genome. The shading of the items reflects the similarity returned by BlastN, and the direction of the arrows is concordant with the strand of the alignment. For every item, a link pointing to the mitochondrial mapping is provided, thus allowing a fast cross among the NumtS genomic contexts.\
\\ The "NumtS assembled" track shows items obtained by assembling HSPs annotated in the "NumtS" track fulfilling the following conditions:\
\ Exceptions for the second condition arise when a long repetitive element is present between two HSPs.\
\\ The "NumtS on mitochondrion" track shows the mapping of the HSPs on the mitochondrial genome. The shading of the items reflects the similarity returned by BlastN, and the direction of the arrows is concordant with the strand of the alignment. For every item, a link pointing to the nuclear mapping is provided.\
\ \\ The "NumtS on mitochondrion with chromosome placement" shows the mapping of the HSPs on the mitochondrial genome, but the items are coloured according to the colours assigned to each human chromosome on the UCSC genome browser. No shading is here provided. For every item, a link pointing to the nuclear mapping is provided.\
\\ NumtS mappings were obtained by running Blast2seq (program: BlastN) between each chromosome of of the Human Genome hg18 build and the human mitochondrial reference sequence (rCRS, AC: NC_012920), fixing the e-value threshold to 1e-03. The assembling of the HSPs was performed with spreadsheet interpolation and manual inspection.\
\ \\ NumtS predicted in silico were validated by carrying out PCR amplification and sequencing on blood-extracted DNA of a healthy individual of European origin. PCR amplification was successful for 275 NumtS and provided amplicons of the expected length. All PCR fragments were sequenced on both strands, and submitted to the EMBL databank.\
\\ Furthermore, 541 NumtS were validated by merging NumtS nuclear coordinates with HapMap annotations. Our analysis has been carried on eight HapMap individuals (NA18517, NA18507, NA18956, NA19240, NA18555, NA12878, NA19129, NA12156). For each sample, clones with a single best concordant placement (according to the fosmid end-sequence-pair analysis described in Kidd et al., 2008), have been considered. The analysis showed that 541 NumtS (at least 30bp for each one) had been sequenced in such samples.\
\ \\ These data were provided by Domenico Simone and Marcella Attimonelli at Department of Biochemistry and Molecular Biology "Ernesto Quagliariello" (University of Bari, Italy). Primer designing was carried out by Francesco Calabrese and Giuseppe Mineccia. PCR validation was carried out by Martin Lang, Domenico Simone and Giuseppe Gasparre. Merging with HapMap annotations has been performed by Domenico Simone.\
\ \\ Simone D, Calabrese FM, Lang M, Gasparre G, Attimonelli M: Validation and UCSC tracks of the extended RHNumtS compilation (submitted). \
\ \\ Lascaro D, Castellana S, Gasparre G, Romeo G, Saccone S, Attimonelli M. The RHNumtS compilation: features and bioinformatics approaches to locate and quantify Human NumtS. BMC\ Genomics. 2008 June 3;9:267.\
\ \\ Kidd JM, Cooper GM, Donahue WF, et al.\ \ Mapping and sequencing of structural variation from eight human genomes.\ Nature, 2008, 453(7191):56-64.\
\ \ \ \ varRep 1 color 0,60,120\ html numtSeq\ longLabel Human NumtS assembled\ parent numtSeq\ priority 2\ shortLabel NumtS assembled\ track numtSAssembled\ type bed 12 .\ useScore 1\ snpRecombHotspotPerlegen Perlegen bed 3 . Oxford Recombination Hotspots from Perlegen Data 0 2 0 0 0 127 127 127 0 0 0 varRep 1 longLabel Oxford Recombination Hotspots from Perlegen Data\ parent snpRecombHotspot\ priority 2\ shortLabel Perlegen\ track snpRecombHotspotPerlegen\ hapmapLdPhCeu Phased CEU ld2 Linkage Disequilibrium for the CEPH (CEU) from phased genotypes 0 2 0 0 0 127 127 127 0 0 22 chr1,chr2,chr3,chr4,chr5,chr6,chr7,chr8,chr9,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr17,chr18,chr19,chr20,chr21,chr22, varRep 0 longLabel Linkage Disequilibrium for the CEPH (CEU) from phased genotypes\ parent hapmapLdPh\ priority 2\ shortLabel Phased CEU\ track hapmapLdPhCeu\ encodeGencodeRaceFragsBrain RACEfrags Brain genePred Gencode RACEfrags from Brain 0 2 248 0 8 251 127 131 0 0 19 chr1,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr5,chr6,chr7,chr8,chr9,chrX, encodeGenes 1 color 248,0,8\ longLabel Gencode RACEfrags from Brain\ parent encodeGencodeRaceFrags\ priority 2\ shortLabel RACEfrags Brain\ track encodeGencodeRaceFragsBrain\ encodeRegulomeDnaseHs Regulome DNAseI HS bed 4 Regulome DNAseI HS 0 2 0 0 0 127 127 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeAnalysis 1 longLabel Regulome DNAseI HS\ parent encodeDnase\ priority 2\ shortLabel Regulome DNAseI HS\ track encodeRegulomeDnaseHs\ encodeRikenCageMinus Riken CAGE - bedGraph 4 Riken CAGE Minus Strand - Predicted Gene Start Sites 0 2 43 51 109 149 153 182 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeTxLevels 0 color 43,51,109\ longLabel Riken CAGE Minus Strand - Predicted Gene Start Sites\ parent encodeRikenCage\ priority 2\ shortLabel Riken CAGE -\ track encodeRikenCageMinus\ encodeRikenCageMappedTagsNegative Riken CAGE MT - bedGraph 4 Riken CAGE Mapped Tags overlap count, Minus strand - TEST TRACK ONLY 0 2 43 51 109 149 153 182 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeTxLevels 0 color 43,51,109\ longLabel Riken CAGE Mapped Tags overlap count, Minus strand - TEST TRACK ONLY\ parent encodeRikenCageMappedTagsScore\ priority 2\ shortLabel Riken CAGE MT -\ track encodeRikenCageMappedTagsNegative\ decodeSexAveragedCarrier Sex Average Carrier bigWig 0.0 76.046 deCODE recombination map, sex-average carrier 2 2 209 45 51 232 150 153 0 0 22 chr1,chr2,chr3,chr4,chr5,chr6,chr7,chr8,chr9,chr10,chr11,chr12,chr13,chr16,chr14,chr15,chr17,chr18,chr19,chr20,chr22,chr21, map 0 color 209,45,51\ configurable on\ longLabel deCODE recombination map, sex-average carrier\ parent avgView off\ priority 2\ shortLabel Sex Average Carrier\ subGroups view=avg\ track decodeSexAveragedCarrier\ type bigWig 0.0 76.046\ encodeSangerChipH3K4me2 SI H3K4m2 GM06990 bedGraph 4 Sanger Institute ChIP/Chip (H3K4me2 ab, GM06990 cells) 0 2 10 10 130 132 132 192 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 0 color 10,10,130\ longLabel Sanger Institute ChIP/Chip (H3K4me2 ab, GM06990 cells)\ parent encodeSangerChipH3H4\ priority 2\ shortLabel SI H3K4m2 GM06990\ track encodeSangerChipH3K4me2\ stanfordChipGMO6990SRF Stan GMO6690 SRF bedGraph 4 Stanford ChIP-chip (GMO6990 cells, SRF ChIP) 0 2 120 0 20 150 0 25 0 0 22 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr3,chr4,chr5,chr6,chr7,chr8,chr9,chrX, regulation 0 longLabel Stanford ChIP-chip (GMO6990 cells, SRF ChIP)\ parent stanfordChip\ priority 2\ shortLabel Stan GMO6690 SRF\ track stanfordChipGMO6990SRF\ encodeStanfordChipGMO6990SRF Stan GMO6690 SRF bedGraph 4 Stanford ChIP-chip (GMO6990 cells, SRF ChIP) 0 2 120 0 20 150 0 25 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 0 longLabel Stanford ChIP-chip (GMO6990 cells, SRF ChIP)\ parent encodeStanfordChipJohnson\ priority 2\ shortLabel Stan GMO6690 SRF\ track encodeStanfordChipGMO6990SRF\ encodeStanfordChipHCT116Sp3 Stan HCT116 Sp3 bedGraph 4 Stanford ChIP-chip (HCT116 cells, Sp3 ChIP) 0 2 120 0 20 150 0 25 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 0 longLabel Stanford ChIP-chip (HCT116 cells, Sp3 ChIP)\ parent encodeStanfordChip\ priority 2\ shortLabel Stan HCT116 Sp3\ track encodeStanfordChipHCT116Sp3\ encodeStanfordMethCRL1690 Stan Meth CRL1690 bedGraph 4 Stanford Methylation Digest (CRL1690 cells) 0 2 120 0 20 150 0 25 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChrom 0 longLabel Stanford Methylation Digest (CRL1690 cells)\ parent encodeStanfordMeth\ priority 2\ shortLabel Stan Meth CRL1690\ track encodeStanfordMethCRL1690\ encodeStanfordMethSmoothedCRL1690 Stan Meth Sc CRL1690 bedGraph 4 Stanford Methylation Digest Smoothed Score (CRL1690 cells) 0 2 120 0 20 150 0 25 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChrom 0 longLabel Stanford Methylation Digest Smoothed Score (CRL1690 cells)\ parent encodeStanfordMethSmoothed\ priority 2\ shortLabel Stan Meth Sc CRL1690\ track encodeStanfordMethSmoothedCRL1690\ encodeStanfordPromotersBe2C Stan Pro Be2c bed 9 + Stanford Promoter Activity (Be2c cells) 0 2 0 0 0 127 127 127 0 0 19 chr1,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr5,chr6,chr7,chr8,chr9,chrX, encodeTxLevels 1 longLabel Stanford Promoter Activity (Be2c cells)\ parent encodeStanfordPromoters\ priority 2\ shortLabel Stan Pro Be2c\ track encodeStanfordPromotersBe2C\ encodeStanfordChipSmoothedHCT116Sp3 Stan Sc HCT116 Sp3 bedGraph 4 Stanford ChIP-chip Smoothed Score (HCT116 cells, Sp3 ChIP) 0 2 120 0 20 150 0 25 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 0 longLabel Stanford ChIP-chip Smoothed Score (HCT116 cells, Sp3 ChIP)\ parent encodeStanfordChipSmoothed\ priority 2\ shortLabel Stan Sc HCT116 Sp3\ track encodeStanfordChipSmoothedHCT116Sp3\ encodeStanfordNRSFControl Stanf NRSF Control bed 6 . Stanford NRSF/REST Control 0 2 0 128 0 127 191 127 0 0 23 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX,chrY,chrM, encodeChip 1 color 0,128,0\ longLabel Stanford NRSF/REST Control\ parent encodeStanfordNRSF\ priority 2\ shortLabel Stanf NRSF Control\ track encodeStanfordNRSFControl\ mapGenethon STS Markers bed 5 + Various STS Markers 0 2 0 0 0 127 127 127 0 0 0 map 1 group map\ longLabel Various STS Markers\ priority 2\ shortLabel STS Markers\ track mapGenethon\ type bed 5 +\ visibility hide\ tajdAd Tajima's D AD bedGraph 4 Tajima's D from African Descent 0 2 200 100 0 0 100 200 0 0 0 varRep 0 altColor 0,100,200\ color 200,100,0\ longLabel Tajima's D from African Descent\ parent tajD\ priority 2\ shortLabel Tajima's D AD\ track tajdAd\ encodeTbaBinCons TBA BinCons wig 0.0 1.0 TBA BinCons Conservation 0 2 190 70 80 222 162 167 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeCompGeno 0 color 190,70,80\ longLabel TBA BinCons Conservation\ parent encodeTbaCons\ priority 2\ shortLabel TBA BinCons\ track encodeTbaBinCons\ encodeTbaBinConsEl TBA BinCons bed 5 . TBA BinCons Conserved Elements 0 2 190 70 80 222 162 167 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeCompGeno 1 color 190,70,80\ longLabel TBA BinCons Conserved Elements\ parent encodeTbaElements\ priority 2\ shortLabel TBA BinCons\ track encodeTbaBinConsEl\ hiSeqDepthTopPt5Pct Top 0.005 Depth bed 3 Top 0.005 of Read Depth Distribution 0 2 139 69 19 197 162 137 0 0 0 map 1 longLabel Top 0.005 of Read Depth Distribution\ parent hiSeqDepth\ priority 2\ shortLabel Top 0.005 Depth\ track hiSeqDepthTopPt5Pct\ encodeUtexChip2091fibMycRaw UT Myc Fb bedGraph 4 University of Texas, Austin ChIP-chip (c-Myc, 2091 fibroblasts) 0 2 120 30 50 187 142 152 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 0 color 120,30,50\ longLabel University of Texas, Austin ChIP-chip (c-Myc, 2091 fibroblasts)\ parent encodeUtexChip\ priority 2\ shortLabel UT Myc Fb\ subGroups dataType=raw\ track encodeUtexChip2091fibMycRaw\ encodeUvaDnaRep2 UVa DNA Rep 2h bed 3 . University of Virginia Temporal Profiling of DNA Replication (2-4 hrs) 0 2 60 75 60 10 130 10 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChrom 1 longLabel University of Virginia Temporal Profiling of DNA Replication (2-4 hrs)\ parent encodeUvaDnaRep\ priority 2\ shortLabel UVa DNA Rep 2h\ track encodeUvaDnaRep2\ encodeUvaDnaRepMid UVa DNA Rep Mid bed 3 . University of Virginia Temporal Profiling of DNA Replication (Mid) 0 2 30 80 130 142 167 192 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChrom 1 color 30,80,130\ longLabel University of Virginia Temporal Profiling of DNA Replication (Mid)\ parent encodeUvaDnaRepSeg\ priority 2\ shortLabel UVa DNA Rep Mid\ track encodeUvaDnaRepMid\ encodeUvaDnaRepOriginsNSHela UVa Ori-NS HeLa bed 3 . University of Virginia DNA Replication Origins, Ori-NS, HeLa 0 2 250 0 0 252 127 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChrom 1 color 250,0,0\ dataVersion May 2007\ longLabel University of Virginia DNA Replication Origins, Ori-NS, HeLa\ origAssembly hg17\ parent encodeUvaDnaRepOrigins\ priority 2\ shortLabel UVa Ori-NS HeLa\ track encodeUvaDnaRepOriginsNSHela\ kiddEichlerValidAbc13 Validated ABC13 bed 9 HGSV Individual ABC13 (Yoruba) Validated Sites of Structural Variation 0 2 0 0 0 127 127 127 0 0 0 varRep 1 longLabel HGSV Individual ABC13 (Yoruba) Validated Sites of Structural Variation\ parent kiddEichlerValid\ priority 2\ shortLabel Validated ABC13\ track kiddEichlerValidAbc13\ hgdpXpehhMideast XP-EHH Mideast bedGraph 4 Human Genome Diversity Project XP-EHH (Mideast) 0 2 0 0 200 127 127 227 0 0 22 chr1,chr2,chr3,chr4,chr5,chr6,chr7,chr8,chr9,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr17,chr18,chr19,chr20,chr21,chr22, varRep 0 color 0,0,200\ longLabel Human Genome Diversity Project XP-EHH (Mideast)\ parent hgdpXpehh\ priority 2\ shortLabel XP-EHH Mideast\ track hgdpXpehhMideast\ encodeYaleChIPSTAT1HeLaMaskLess50mer38bpPval Yale 50-38 PVal bedGraph 4 Yale ChIP/Chip (STAT1 ab, Hela cells) Maskless 50-mer, 38bp Win, P-Values 0 2 50 50 200 152 152 227 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 0 color 50,50,200\ longLabel Yale ChIP/Chip (STAT1 ab, Hela cells) Maskless 50-mer, 38bp Win, P-Values\ parent encodeYaleChIPSTAT1Pval\ priority 2\ shortLabel Yale 50-38 PVal\ track encodeYaleChIPSTAT1HeLaMaskLess50mer38bpPval\ encodeYaleChIPSTAT1HeLaMaskLess50mer38bpSig Yale 50-38 Sig bedGraph 4 Yale ChIP/Chip (STAT1 ab, Hela cells) Maskless 50-mer, 38bp Win, Signal 0 2 112 63 175 224 66 81 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 0 altColor 224,66,81\ color 112,63,175\ longLabel Yale ChIP/Chip (STAT1 ab, Hela cells) Maskless 50-mer, 38bp Win, Signal\ parent encodeYaleChIPSTAT1Sig\ priority 2\ shortLabel Yale 50-38 Sig\ track encodeYaleChIPSTAT1HeLaMaskLess50mer38bpSig\ encodeYaleChIPSTAT1HeLaMaskLess50mer38bpSite Yale 50-38 Sites bed . Yale ChIP/Chip (STAT1 ab, Hela cells) Maskless 50-mer, 38bp Win, Binding Sites 0 2 200 50 50 50 50 200 0 0 18 chr1,chr10,chr11,chr13,chr14,chr15,chr16,chr19,chr2,chr20,chr21,chr22,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 1 altColor 50,50,200\ color 200,50,50\ longLabel Yale ChIP/Chip (STAT1 ab, Hela cells) Maskless 50-mer, 38bp Win, Binding Sites\ parent encodeYaleChIPSTAT1Sites\ priority 2\ shortLabel Yale 50-38 Sites\ track encodeYaleChIPSTAT1HeLaMaskLess50mer38bpSite\ encodeYaleMASNB4RNANprotTMREVMless36mer36bp Yale NB4 NgR RNA bedGraph 4 Yale NB4 RNA Trans Map, MAS Array, Reverse Direction, NimbleGen Protocol 0 2 50 50 200 200 50 50 0 0 8 chr5,chr7,chrX,chr11,chr16,chr19,chr21,chr22, encodeTxLevels 0 altColor 200,50,50\ color 50,50,200\ longLabel Yale NB4 RNA Trans Map, MAS Array, Reverse Direction, NimbleGen Protocol\ parent encodeYaleMASPlacRNATransMap\ priority 2\ shortLabel Yale NB4 NgR RNA\ track encodeYaleMASNB4RNANprotTMREVMless36mer36bp\ encodeYaleMASNB4RNANProtTarsREVMless36mer36bp Yale NB4 NgR TAR bed 6 . Yale NB4 RNA TARs, MAS array, Reverse Direction, NimbleGen Protocol 0 2 50 50 200 200 50 50 0 0 8 chr5,chr7,chrX,chr11,chr16,chr19,chr21,chr22, encodeTxLevels 1 altColor 200,50,50\ color 50,50,200\ longLabel Yale NB4 RNA TARs, MAS array, Reverse Direction, NimbleGen Protocol\ parent encodeYaleMASPlacRNATars\ priority 2\ shortLabel Yale NB4 NgR TAR\ track encodeYaleMASNB4RNANProtTarsREVMless36mer36bp\ encodeYaleAffyNeutRNATransMap01 Yale RNA Neu 1 wig -2730 3394 Yale Neutrophil RNA Transcript Map, Sample 1 0 2 50 205 50 152 230 152 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeTxLevels 0 color 50,205,50\ longLabel Yale Neutrophil RNA Transcript Map, Sample 1\ parent encodeYaleAffyRNATransMap\ priority 2\ shortLabel Yale RNA Neu 1\ subGroups celltype=neutro samples=samples\ track encodeYaleAffyNeutRNATransMap01\ encodeYaleAffyNeutRNATars01 Yale TAR Neu 1 bed 3 . Yale Neutrophil RNA Transcriptionally Active Region, Sample 1 0 2 50 205 50 152 230 152 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeTxLevels 1 color 50,205,50\ longLabel Yale Neutrophil RNA Transcriptionally Active Region, Sample 1\ parent encodeYaleAffyRNATars\ priority 2\ shortLabel Yale TAR Neu 1\ subGroups celltype=neutro samples=samples\ track encodeYaleAffyNeutRNATars01\ netRBestTarSyr1 Tarsier RBest Net netAlign tarSyr1 chainTarSyr1 Tarsier (Aug. 2008 (Broad/tarSyr1)) Reciprocal Best Alignment Net 0 3 0 0 0 127 127 127 1 0 0 compGeno 0 group compGeno\ longLabel $o_Organism ($o_date) Reciprocal Best Alignment Net\ otherDb tarSyr1\ parent rBestNet\ priority 3\ shortLabel $o_Organism RBest Net\ spectrum on\ track netRBestTarSyr1\ type netAlign tarSyr1 chainTarSyr1\ visibility hide\ netSyntenyRheMac2 Rhesus Syn Net netAlign rheMac2 chainRheMac2 Rhesus (Jan. 2006 (MGSC Merged 1.0/rheMac2)) Syntenic Alignment Net 0 3 0 0 0 127 127 127 1 0 0 compGeno 0 group compGeno\ longLabel $o_Organism ($o_date) Syntenic Alignment Net\ otherDb rheMac2\ parent syntenicNet\ priority 3\ shortLabel $o_Organism Syn Net\ spectrum on\ track netSyntenyRheMac2\ type netAlign rheMac2 chainRheMac2\ visibility hide\ encodeAffyChIpHl60PvalBrg1Hr02 Affy Brg1 RA 2h wig 0.0 534.54 Affymetrix ChIP/Chip (Brg1 retinoic acid-treated HL-60, 2hrs) P-Value 0 3 225 0 0 240 127 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 0 color 225,0,0\ longLabel Affymetrix ChIP/Chip (Brg1 retinoic acid-treated HL-60, 2hrs) P-Value\ parent encodeAffyChIpHl60Pval\ priority 3\ shortLabel Affy Brg1 RA 2h\ subGroups factor=Brg1 time=2h\ track encodeAffyChIpHl60PvalBrg1Hr02\ encodeAffyChIpHl60SignalStrictH3K9K14DHr08 Affy H3K9ac2 8h wig -2.78 3.97 Affymetrix ChIP-chip (H3K9K14ac2, retinoic acid-treated HL-60, 8hrs) Strict Signal 0 3 225 0 0 240 127 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 0 color 225,0,0\ longLabel Affymetrix ChIP-chip (H3K9K14ac2, retinoic acid-treated HL-60, 8hrs) Strict Signal\ parent encodeAffyChIpHl60SignalStrict\ priority 3\ shortLabel Affy H3K9ac2 8h\ subGroups factor=H3K9K14ac2 time=8h\ track encodeAffyChIpHl60SignalStrictH3K9K14DHr08\ encodeAffyChIpHl60SitesStrictH3K9K14DHr08 Affy H3K9ac2 8h bed 3 . Affymetrix ChIP-chip (H3K9K14ac2, retinoic acid-treated HL-60, 8hrs) Strict Sites 0 3 225 0 0 240 127 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 1 color 225,0,0\ longLabel Affymetrix ChIP-chip (H3K9K14ac2, retinoic acid-treated HL-60, 8hrs) Strict Sites\ parent encodeAffyChIpHl60SitesStrict\ priority 3\ shortLabel Affy H3K9ac2 8h\ subGroups factor=H3K9K14ac2 time=8h\ track encodeAffyChIpHl60SitesStrictH3K9K14DHr08\ encodeAffyChIpHl60PvalStrictH3K9K14DHr08 Affy H3K9ac2 8h wig 0 696.62 Affymetrix ChIP-chip (H3K9K14ac2, retinoic acid-treated HL-60, 8hrs) Strict P-Value 0 3 225 0 0 240 127 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 0 color 225,0,0\ longLabel Affymetrix ChIP-chip (H3K9K14ac2, retinoic acid-treated HL-60, 8hrs) Strict P-Value\ parent encodeAffyChIpHl60PvalStrict\ priority 3\ shortLabel Affy H3K9ac2 8h\ subGroups factor=H3K9K14ac2 time=8h\ track encodeAffyChIpHl60PvalStrictH3K9K14DHr08\ encodeAffyRnaHl60SitesHr00IntronsProximal Affy In Prx HL60 bed 4 . Affy Intronic Proximal HL60 Transfrags 0 3 224 0 32 239 127 143 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeAnalysis 1 color 224,0,32\ longLabel Affy Intronic Proximal HL60 Transfrags\ parent encodeNoncodingTransFrags\ priority 3\ shortLabel Affy In Prx HL60\ subGroups region=intronicProximal celltype=hl60 source=affy\ track encodeAffyRnaHl60SitesHr00IntronsProximal\ encodeTransFragsAffyProximal Affy Prox bed 4 Affy Proximal 0 3 0 0 0 127 127 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeAnalysis 1 longLabel Affy Proximal\ parent encodeTransFrags\ priority 3\ shortLabel Affy Prox\ track encodeTransFragsAffyProximal\ encodeAffyRnaHl60SignalHr00 Affy RNA RA 0h wig -1168.00 1686.5 Affymetrix PolyA+ RNA (retinoic acid-treated HL-60, 0hrs) Signal 0 3 50 50 150 152 152 202 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeTxLevels 0 color 50,50,150\ longLabel Affymetrix PolyA+ RNA (retinoic acid-treated HL-60, 0hrs) Signal\ parent encodeAffyRnaSignal\ priority 3\ shortLabel Affy RNA RA 0h\ track encodeAffyRnaHl60SignalHr00\ encodeAffyRnaHl60SitesHr00 Affy RNA RA 0h bed 3 . Affymetrix PolyA+ RNA (retinoic acid-treated HL-60, 0hrs) Sites 0 3 50 50 150 152 152 202 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeTxLevels 1 color 50,50,150\ longLabel Affymetrix PolyA+ RNA (retinoic acid-treated HL-60, 0hrs) Sites\ parent encodeAffyRnaTransfrags\ priority 3\ shortLabel Affy RNA RA 0h\ track encodeAffyRnaHl60SitesHr00\ snpArrayAffy5 Affy SNP 5.0 bed 6 + Affymetrix SNP 5.0 0 3 0 0 0 127 127 127 0 0 0 varRep 1 longLabel Affymetrix SNP 5.0\ parent snpArray off\ priority 3\ shortLabel Affy SNP 5.0\ track snpArrayAffy5\ type bed 6 +\ encodeHapMapAlleleFreqJPT Allele Freq JPT bed 6 + HapMap Minor Allele Frequencies Japanese (JPT) 0 3 0 0 0 127 127 127 1 0 7 chr2,chr4,chr7,chr8,chr9,chr12,chr18, encodeVariation 1 longLabel HapMap Minor Allele Frequencies Japanese (JPT)\ parent encodeHapMapAlleleFreq\ priority 3\ shortLabel Allele Freq JPT\ track encodeHapMapAlleleFreqJPT\ Alu_SINE Alu_SINE bed 5 Alu SINEs for Intersection 0 3 0 0 0 127 127 127 0 0 0 encodeAnalysis 1 longLabel Alu SINEs for Intersection\ parent encodeWorkshopIntersections\ priority 3\ shortLabel Alu_SINE\ track Alu_SINE\ encodeEgaspUpdAugustusDual August/Mouse Upd genePred Augustus + Mouse Homology Gene Predictions 0 3 12 85 135 133 170 195 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeGenes 1 color 12,85,135\ longLabel Augustus + Mouse Homology Gene Predictions\ parent encodeEgaspUpdate\ priority 3\ shortLabel August/Mouse Upd\ track encodeEgaspUpdAugustusDual\ encodeEgaspPartAugustusAbinitio Augustus genePred Augustus Ab Initio Gene Predictions 0 3 12 50 200 133 152 227 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeGenes 1 color 12,50,200\ longLabel Augustus Ab Initio Gene Predictions\ parent encodeEgaspPartial\ priority 3\ shortLabel Augustus\ track encodeEgaspPartAugustusAbinitio\ encodeBuFirstExonHeart BU Heart bed 12 + Boston University First Exon Activity in Heart 0 3 0 0 0 127 127 127 0 0 10 chr11,chr13,chr15,chr16,chr19,chr2,chr5,chr7,chr9,chrX, encodeTxLevels 1 longLabel Boston University First Exon Activity in Heart\ parent encodeBuFirstExon\ priority 3\ shortLabel BU Heart\ track encodeBuFirstExonHeart\ cccTrendPvalCd CCC Crohns Dis chromGraph Case Control Consortium Crohn's disease trend -log10 P-value 0 3 0 0 0 127 127 127 0 0 0 phenDis 0 longLabel Case Control Consortium Crohn's disease trend -log10 P-value\ parent caseControl\ priority 3\ shortLabel CCC Crohns Dis\ track cccTrendPvalCd\ hapmapLdHotspotCJ CHB + JPT bedGraph 4 Hotspots of Linkage Disequilibrium in the Chinese/Japanese HapMap (CHB and JPT) 0 3 0 0 0 127 127 127 0 0 23 chr1,chr2,chr3,chr4,chr5,chr6,chr7,chr8,chr9,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr17,chr18,chr19,chr20,chr21,chr22,chrX, varRep 0 longLabel Hotspots of Linkage Disequilibrium in the Chinese/Japanese HapMap (CHB and JPT)\ parent hapmapLdHotspot\ priority 3\ shortLabel CHB + JPT\ track hapmapLdHotspotCJ\ encodeDnaseHs Combined DNAseI HS bed 4 Combined DNAseI HS 0 3 0 0 0 127 127 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeAnalysis 1 longLabel Combined DNAseI HS\ parent encodeDnase\ priority 3\ shortLabel Combined DNAseI HS\ track encodeDnaseHs\ fox2ClipSeqDensityReverseStrand Density Reverse wig 0 1406 FOX2 adaptor-trimmed CLIP-seq Density Reverse Strand 2 3 0 0 0 127 127 127 0 0 0 regulation 0 configurable on\ graphTypeDefault Bar\ longLabel FOX2 adaptor-trimmed CLIP-seq Density Reverse Strand\ maxHeightPixels 128:36:16\ noInherit on\ parent fox2ClipSeqCompViewdensity\ priority 3\ shortLabel Density Reverse\ spanList 1\ subGroups view=density\ track fox2ClipSeqDensityReverseStrand\ type wig 0 1406\ windowingFunction mean\ kiddEichlerDiscAbc12 Discordant ABC12 bed 12 HGSV Individual ABC12 (CEPH) Discordant Clone End Alignments 0 3 0 0 0 127 127 127 0 0 0 http://mrhgsv.gs.washington.edu/cgi-bin/hgc?i=$$&c=$S&l=$[&r=$]&db=$D&position=$S:$[-$] varRep 1 longLabel HGSV Individual ABC12 (CEPH) Discordant Clone End Alignments\ parent kiddEichlerDisc\ priority 3\ shortLabel Discordant ABC12\ track kiddEichlerDiscAbc12\ encodeDNDSlarge dN/dS 0.5 to 1.5 bed 4 + ENCODE Exons dN/dS 0.5 to 1.5 0 3 0 0 0 127 127 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeAnalysis 1 longLabel ENCODE Exons dN/dS 0.5 to 1.5\ parent encodeDNDS\ priority 3\ shortLabel dN/dS 0.5 to 1.5\ track encodeDNDSlarge\ encodeAffyEc1BrainFrontalLobeSignal EC1 Sgnl BrainF wig 0 62385 Affy Ext Trans Signal (1-base window) (Brain Frontal Lobe) 0 3 248 0 8 251 127 131 0 0 2 chr21,chr22, encodeTxLevels 0 color 248,0,8\ longLabel Affy Ext Trans Signal (1-base window) (Brain Frontal Lobe)\ parent encodeAffyEcSignal\ priority 3\ shortLabel EC1 Sgnl BrainF\ track encodeAffyEc1BrainFrontalLobeSignal\ encodeAffyEc1BrainFrontalLobeSites EC1 Sites BrainF bed 3 . Affy Ext Trans Sites (1-base window) (Brain Frontal Lobe) 0 3 248 0 8 251 127 131 0 0 2 chr21,chr22, encodeTxLevels 1 color 248,0,8\ longLabel Affy Ext Trans Sites (1-base window) (Brain Frontal Lobe)\ parent encodeAffyEcSites\ priority 3\ shortLabel EC1 Sites BrainF\ track encodeAffyEc1BrainFrontalLobeSites\ encodeEgaspFullEnsembl Ensembl genePred Ensembl Gene Predictions 0 3 22 150 20 138 202 137 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeGenes 1 color 22,150,20\ longLabel Ensembl Gene Predictions\ parent encodeEgaspFull\ priority 3\ shortLabel Ensembl\ track encodeEgaspFullEnsembl\ encodeUncFairePeaksChipotle FAIRE ChIPOTle bedGraph 4 University of North Carolina FAIRE Peaks (ChIPOTle) 0 3 0 0 255 50 100 50 0 0 21 chr1,chr4,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr5,chr6,chr7,chr8,chr9,chr10,chrX, encodeChrom 0 autoScale off\ color 0,0,255\ longLabel University of North Carolina FAIRE Peaks (ChIPOTle)\ maxHeightPixels 128:24:16\ noInherit on\ parent encodeUncFaire\ priority 3\ shortLabel FAIRE ChIPOTle\ spanList 38\ track encodeUncFairePeaksChipotle\ type bedGraph 4\ viewLimits 0.4:3.7\ windowingFunction mean\ encodeGencodeGenePolymorphicMar07 Gencode Polymorph genePred Gencode Polymorphic 0 3 160 32 240 207 143 247 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeGenes 1 color 160,32,240\ longLabel Gencode Polymorphic\ parent encodeGencodeGeneMar07\ priority 3\ shortLabel Gencode Polymorph\ track encodeGencodeGenePolymorphicMar07\ encodeGencodeGenePseudoOct05 Gencode Pseudo genePred Gencode Pseudogenes 0 3 0 91 191 127 173 223 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeGenes 1 color 0,91,191\ longLabel Gencode Pseudogenes\ parent encodeGencodeGeneOct05\ priority 3\ shortLabel Gencode Pseudo\ track encodeGencodeGenePseudoOct05\ ntHumChimpCodingDiff H-C Coding Diffs bed 9 . Neandertal Alleles in Human/Chimp Coding Non-synonymous Differences in Human Lineage 0 3 0 0 0 127 127 127 0 0 24 chr1,chr2,chr3,chr4,chr5,chr6,chr7,chr8,chr9,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr17,chr18,chr19,chr20,chr21,chr22,chrX,chrY,\ This track displays Neandertal alleles for human-chimp protein-coding\ differences on the human lineage using orangutan as the outgroup to \ determine which allele is more likely to be ancestral. \
\ \\ Neandertal ancestral alleles are colored blue; derived \ (human) alleles are colored green. \
\\ The item names show the number of Neandertal reads for the ancestral\ and derived alleles, followed by the ancestral and derived codons enclosed in\ parentheses.\ For example, if no Neandertal reads matched the ancestral base G and\ three Neandertal reads matched the derived base A, and the ancestral and\ derived codons were GTA and ATA respectively, then the item name would\ be "0G>3A(GTA>ATA)".\ If N Neandertal reads match neither ancestral nor derived\ base, then a "+N?" is added before the codons\ (i.e. "0G>3A+N?(GTA>ATA)").\
\ \\ Neandertal DNA was extracted from a ~49,000-year-old bone\ (Sidrón 1253), which was excavated in El Sidrón cave,\ Asturias, Spain. Non-synonymous changes that occurred on the human\ lineage since the ancestral split with chimpanzee were identified by\ aligning human, chimpanzee and orangutan protein sequences for all\ orthologous proteins in\ HomoloGene\ (Build 58) . Comparison of these three species allowed the assignment\ of human/chimpanzee differences to their respective evolutionary\ lineages. An Agilent custom oligonucleotide array covering the 13,841\ non-synonymous changes inferred to have occurred in the human lineage\ was designed and used to capture Neandertal sequences. \
\ \\ Burbano HA, Hodges E, Green RE, Briggs AW, Krause J, Meyer M, Good JM, \ Maricic T, Johnson PLF, Xuan Z et al.\ Targeted Investigation of the Neandertal Genome by Array-Based \ Sequence Capture.\ Science. 2010 7 May;328(5979):723-5.\
\ neandertal 1 chromosomes chr1,chr2,chr3,chr4,chr5,chr6,chr7,chr8,chr9,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr17,chr18,chr19,chr20,chr21,chr22,chrX,chrY\ group neandertal\ itemRgb on\ longLabel Neandertal Alleles in Human/Chimp Coding Non-synonymous Differences in Human Lineage\ noScoreFilter .\ priority 3\ shortLabel H-C Coding Diffs\ track ntHumChimpCodingDiff\ type bed 9 .\ visibility hide\ encodeHapMapCovJPT HapMap Cov JPT wig 0.0 100.0 HapMap Resequencing Coverage Japanese (JPT) 0 3 0 0 0 127 127 127 0 0 7 chr2,chr4,chr7,chr8,chr9,chr12,chr18, encodeVariation 0 longLabel HapMap Resequencing Coverage Japanese (JPT)\ parent encodeHapMapCov\ priority 3\ shortLabel HapMap Cov JPT\ track encodeHapMapCovJPT\ hapmapSnpsCHB HapMap SNPs CHB bed 6 + HapMap SNPs from the CHB Population (Han Chinese in Beijing, China) 0 3 0 0 0 127 127 127 0 0 0 varRep 1 longLabel HapMap SNPs from the CHB Population (Han Chinese in Beijing, China)\ parent hapmapSnps\ priority 3\ shortLabel HapMap SNPs CHB\ track hapmapSnpsCHB\ hgdpHzyMideast Hetzgty Mideast bedGraph 4 Human Genome Diversity Proj Smoothd Expec Heterozygosity (Mideast) 0 3 0 0 200 127 127 227 0 0 22 chr1,chr2,chr3,chr4,chr5,chr6,chr7,chr8,chr9,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr17,chr18,chr19,chr20,chr21,chr22, varRep 0 color 0,0,200\ longLabel Human Genome Diversity Proj Smoothd Expec Heterozygosity (Mideast)\ parent hgdpHzy\ priority 3\ shortLabel Hetzgty Mideast\ track hgdpHzyMideast\ hgdpIhsEurope iHS Europe bedGraph 4 Human Genome Diversity Project iHS (Europe) 0 3 240 144 0 247 199 127 0 0 23 chr1,chr2,chr3,chr4,chr5,chr6,chr7,chr8,chr9,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr17,chr18,chr19,chr20,chr21,chr22,chrX, varRep 0 color 240,144,0\ longLabel Human Genome Diversity Project iHS (Europe)\ parent hgdpIhs\ priority 3\ shortLabel iHS Europe\ track hgdpIhsEurope\ encodeGencodeIntergenicProximal Intergenic Prox bed 4 . Gencode Intergenic Proximal Regions 0 3 0 0 0 127 127 127 0 0 0 encodeAnalysis 1 longLabel Gencode Intergenic Proximal Regions\ parent encodeGencodeRegions\ priority 3\ shortLabel Intergenic Prox\ track encodeGencodeIntergenicProximal\ encodeAllIntersectEl Intersect bed 5 . TBA and MLAGAN PhastCons/BinCons/GERP Intersection Conserved Elements 0 3 80 145 105 167 200 180 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeCompGeno 1 color 80,145,105\ longLabel TBA and MLAGAN PhastCons/BinCons/GERP Intersection Conserved Elements\ parent encodeAllElements\ priority 3\ shortLabel Intersect\ track encodeAllIntersectEl\ encodeAllIntronsDistal Intronic Dist bed 4 Consensus Intronic Distal 0 3 0 0 0 127 127 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeAnalysis 1 longLabel Consensus Intronic Distal\ parent encodeWorkshopSelections\ priority 3\ shortLabel Intronic Dist\ track encodeAllIntronsDistal\ iscaRetrospectivePathogenic ISCA Ret Pathog. gvf Internat. Stds. for Cytogen. Arrays Consort. (ISCA) - Retrospective (Pathogenic) 0 3 0 0 0 127 127 127 0 0 0 varRep 1 longLabel Internat. Stds. for Cytogen. Arrays Consort. (ISCA) - Retrospective (Pathogenic)\ parent iscaRetrospectiveComposite\ priority 3\ shortLabel ISCA Ret Pathog.\ track iscaRetrospectivePathogenic\ hapmapLdChb LD CHB bed 4 + Linkage Disequilibrium for the Han Chinese (CHB) 0 3 0 0 0 127 127 127 0 0 23 chr1,chr2,chr3,chr4,chr5,chr6,chr7,chr8,chr9,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr17,chr18,chr19,chr20,chr21,chr22,chrX, varRep 0 longLabel Linkage Disequilibrium for the Han Chinese (CHB)\ parent hapmapLd\ priority 3\ shortLabel LD CHB\ track hapmapLdChb\ encodeUcsdChipHeLaH3H4tmH3K4_p0 LI H3K4me3 -gIF bedGraph 4 Ludwig Institute ChIP-chip: H3K4me3 ab, HeLa cells, no gamma interferon 0 3 109 51 43 182 153 149 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 0 color 109,51,43\ longLabel Ludwig Institute ChIP-chip: H3K4me3 ab, HeLa cells, no gamma interferon\ parent encodeLIChIPgIF\ priority 3\ shortLabel LI H3K4me3 -gIF\ track encodeUcsdChipHeLaH3H4tmH3K4_p0\ encodeUcsdNgHeLaRnap_p0 LI Ng Pol2 -gIF bedGraph 4 Ludwig Institute/UCSD ChIP/Chip Ng: HeLa, Pol2, no gamma interferon 0 3 0 0 0 127 127 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 0 longLabel Ludwig Institute/UCSD ChIP/Chip Ng: HeLa, Pol2, no gamma interferon\ parent encodeUcsdNgGif\ priority 3\ shortLabel LI Ng Pol2 -gIF\ track encodeUcsdNgHeLaRnap_p0\ encodeUcsdChipRnapImr90_f LI Pol2 IMR90 bedGraph 4 Ludwig Institute ChIP-chip: Pol2 8WG16 ab, IMR90 cells 0 3 109 51 43 182 153 149 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 0 color 109,51,43\ longLabel Ludwig Institute ChIP-chip: Pol2 8WG16 ab, IMR90 cells\ parent encodeLIChIP\ priority 3\ shortLabel LI Pol2 IMR90\ track encodeUcsdChipRnapImr90_f\ encodeMlaganGerpEl MLAGAN GERP bed 5 . MLAGAN GERP Conserved Elements 0 3 120 80 120 187 167 187 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeCompGeno 1 color 120,80,120\ longLabel MLAGAN GERP Conserved Elements\ parent encodeMlaganElements\ priority 3\ shortLabel MLAGAN GERP\ track encodeMlaganGerpEl\ encodeMlaganGerpCons MLAGAN GERP Cons wig 0.0 3.0 MLAGAN GERP Conservation 0 3 120 80 120 187 167 187 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeCompGeno 0 autoScale Off\ color 120,80,120\ longLabel MLAGAN GERP Conservation\ maxHeightPixels 100:25:11\ noInherit on\ parent encodeMlaganCons\ priority 3\ shortLabel MLAGAN GERP Cons\ track encodeMlaganGerpCons\ type wig 0.0 3.0\ windowingFunction mean\ snp131NonUnique Non-Unique SNPs bed 6 + Non-uniquely Mapped Simple Nucleotide Polymorphisms (dbSNP build 131) 0 3 0 0 0 127 127 127 0 0 0 http://www.ncbi.nlm.nih.gov/SNP/snp_ref.cgi?type=rs&rs=$$ varRep 1 longLabel Non-uniquely Mapped Simple Nucleotide Polymorphisms (dbSNP build 131)\ parent snp131Composite\ priority 3\ shortLabel Non-Unique SNPs\ subGroups view=nonu\ track snp131NonUnique\ visibility hide\ numtSMitochondrion NumtS on mitochon bed 6 . Human NumtS on mitochondrion 0 3 0 60 120 127 157 187 1 0 0\ NumtS (Nuclear mitochondrial sequences) are mitochondrial fragments inserted in nuclear genomic sequences. The most credited hypothesis concerning their generation suggests that in presence of mutagenic agents or under stress conditions fragments of mtDNA escape from mitochondria, reach the nucleus and insert into chromosomes during break repair, although NumtS can derive from duplication of genomic fragments. NumtS may be cause of contamination during human mtDNA sequencing and hence frequent false low heteroplasmic evidences have been reported.\ The Bioinformatics group chaired by M.Attimonelli (Bari, Italy) has produced the RHNumtS compilation annotating more than 500 Human NumtS. To allow the scientific community to access to the compilation and to perform genomics comparative analyses inclusive of the NumtS data, the group has designed the Human NumtS tracks below described.\
\ \\ The NumtS tracks show the High Score Pairs (HSPs) obtained by aligning the mitochondrial reference genome (NC_012920) with the hg18 release of the human genome.\
\\ The "NumtS mitochondrial sequences" track shows the mapping of the HSPs returned by BlastN on the nuclear genome. The shading of the items reflects the similarity returned by BlastN, and the direction of the arrows is concordant with the strand of the alignment. For every item, a link pointing to the mitochondrial mapping is provided, thus allowing a fast cross among the NumtS genomic contexts.\
\\ The "NumtS assembled" track shows items obtained by assembling HSPs annotated in the "NumtS" track fulfilling the following conditions:\
\ Exceptions for the second condition arise when a long repetitive element is present between two HSPs.\
\\ The "NumtS on mitochondrion" track shows the mapping of the HSPs on the mitochondrial genome. The shading of the items reflects the similarity returned by BlastN, and the direction of the arrows is concordant with the strand of the alignment. For every item, a link pointing to the nuclear mapping is provided.\
\ \\ The "NumtS on mitochondrion with chromosome placement" shows the mapping of the HSPs on the mitochondrial genome, but the items are coloured according to the colours assigned to each human chromosome on the UCSC genome browser. No shading is here provided. For every item, a link pointing to the nuclear mapping is provided.\
\\ NumtS mappings were obtained by running Blast2seq (program: BlastN) between each chromosome of of the Human Genome hg18 build and the human mitochondrial reference sequence (rCRS, AC: NC_012920), fixing the e-value threshold to 1e-03. The assembling of the HSPs was performed with spreadsheet interpolation and manual inspection.\
\ \\ NumtS predicted in silico were validated by carrying out PCR amplification and sequencing on blood-extracted DNA of a healthy individual of European origin. PCR amplification was successful for 275 NumtS and provided amplicons of the expected length. All PCR fragments were sequenced on both strands, and submitted to the EMBL databank.\
\\ Furthermore, 541 NumtS were validated by merging NumtS nuclear coordinates with HapMap annotations. Our analysis has been carried on eight HapMap individuals (NA18517, NA18507, NA18956, NA19240, NA18555, NA12878, NA19129, NA12156). For each sample, clones with a single best concordant placement (according to the fosmid end-sequence-pair analysis described in Kidd et al., 2008), have been considered. The analysis showed that 541 NumtS (at least 30bp for each one) had been sequenced in such samples.\
\ \\ These data were provided by Domenico Simone and Marcella Attimonelli at Department of Biochemistry and Molecular Biology "Ernesto Quagliariello" (University of Bari, Italy). Primer designing was carried out by Francesco Calabrese and Giuseppe Mineccia. PCR validation was carried out by Martin Lang, Domenico Simone and Giuseppe Gasparre. Merging with HapMap annotations has been performed by Domenico Simone.\
\ \\ Simone D, Calabrese FM, Lang M, Gasparre G, Attimonelli M: Validation and UCSC tracks of the extended RHNumtS compilation (submitted). \
\ \\ Lascaro D, Castellana S, Gasparre G, Romeo G, Saccone S, Attimonelli M. The RHNumtS compilation: features and bioinformatics approaches to locate and quantify Human NumtS. BMC\ Genomics. 2008 June 3;9:267.\
\ \\ Kidd JM, Cooper GM, Donahue WF, et al.\ \ Mapping and sequencing of structural variation from eight human genomes.\ Nature, 2008, 453(7191):56-64.\
\ \ \ \ varRep 1 color 0,60,120\ html numtSeq\ longLabel Human NumtS on mitochondrion\ parent numtSeq\ priority 3\ shortLabel NumtS on mitochon\ track numtSMitochondrion\ type bed 6 .\ useScore 1\ snpRecombRatePerlegen Perlegen bedGraph 4 Oxford Recombination Rates from Perlegen Data 0 3 0 0 0 127 127 127 0 0 22 chr1,chr2,chr3,chr4,chr5,chr6,chr7,chr8,chr9,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr20,chr21,chr22,chrX, varRep 0 longLabel Oxford Recombination Rates from Perlegen Data\ parent snpRecombRate\ priority 3\ shortLabel Perlegen\ track snpRecombRatePerlegen\ encodeGencodeRaceFragsColon RACEfrags Colon genePred Gencode RACEfrags from Colon 0 3 250 5 255 252 130 255 0 0 19 chr1,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr5,chr6,chr7,chr8,chr9,chrX, encodeGenes 1 color 250,5,255\ longLabel Gencode RACEfrags from Colon\ parent encodeGencodeRaceFrags\ priority 3\ shortLabel RACEfrags Colon\ track encodeGencodeRaceFragsColon\ cnpSebat Sebat CNPs bed 4 + Copy Number Polymorphisms from ROMA (Sebat) 0 3 0 0 0 127 127 127 0 0 0\ This track shows 81 regions detected as putative copy number polymorphisms by representational \ oligonucleotide microarray analysis (ROMA) in a population of 20 normal individuals. \ \
\
\
Following digestion of with BglII or HindIII, genomic DNA was hybridized to a \
custom array consisting of 85,000 oligonucleotide probes, \
probes were selected to be free of common repeats and have unique homology within the human genome. \
\
The average resolution of the array is ~35 kb, however only intervals in which 3 consecutive probes \
showed concordant signal were scored as CNPs. \
\
All hybridizations were performed in duplicate incorporating a dye-reversal, \
with the false positive rate estimated to be ~6%.\
\
Note that CNP intervals as detailed by Sebat et al. (2004) were converted from the \
April 2003 (build33) into the July 2003 (build34) assembly using liftover.\
\
\
\ Sebat J, Lakshmi B, Troge J, Alexander J,\ Young J, Lundin P, Maner S, Massa H, Walker M, Chi M, Navin N, Lucito R, Healy\ J, Hicks J, Ye K, Reiner A, Gilliam TC, Trask B, Patterson N, Zetterberg A, Wigler\ M (2004) Large-Scale\ Copy Number Polymorphism in the Human Genome. Science 305:525-528\ varRep 1 longLabel Copy Number Polymorphisms from ROMA (Sebat)\ noInherit on\ parent cnp\ priority 3\ shortLabel Sebat CNPs\ track cnpSebat\ type bed 4 +\ decodeSexAveragedNonCarrier Sex Average Non-carrier bigWig 0.0 113.023 deCODE recombination map, sex-average non-carrier 2 3 252 79 89 253 167 172 0 0 22 chr1,chr2,chr3,chr4,chr5,chr6,chr7,chr8,chr9,chr10,chr11,chr12,chr13,chr16,chr14,chr15,chr17,chr18,chr19,chr20,chr22,chr21, map 0 color 252,79,89\ configurable on\ longLabel deCODE recombination map, sex-average non-carrier\ parent avgView off\ priority 3\ shortLabel Sex Average Non-carrier\ subGroups view=avg\ track decodeSexAveragedNonCarrier\ type bigWig 0.0 113.023\ encodeSangerChipH3K4me3 SI H3K4m3 GM06990 bedGraph 4 Sanger Institute ChIP/Chip (H3K4me3 ab, GM06990 cells) 0 3 10 10 130 132 132 192 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 0 color 10,10,130\ longLabel Sanger Institute ChIP/Chip (H3K4me3 ab, GM06990 cells)\ parent encodeSangerChipH3H4\ priority 3\ shortLabel SI H3K4m3 GM06990\ track encodeSangerChipH3K4me3\ encodeRegulomeQualitySKNSH SKNSH bed 5 . SKNSH Quality 0 3 120 50 180 187 152 217 1 0 10 chr2,chr5,chr7,chr8,chr9,chr11,chr12,chr16,chr18,chrX, encodeChrom 1 color 120,50,180\ longLabel SKNSH Quality\ parent encodeRegulomeQuality\ priority 3\ shortLabel SKNSH\ track encodeRegulomeQualitySKNSH\ encodeRegulomeProbSKNSH SKNSH bedGraph 4 SKNSH DNaseI HSs 0 3 120 50 180 187 152 217 0 0 10 chr2,chr5,chr7,chr8,chr9,chr11,chr12,chr16,chr18,chrX, encodeChrom 0 color 120,50,180\ longLabel SKNSH DNaseI HSs\ parent encodeRegulomeProb\ priority 3\ shortLabel SKNSH\ track encodeRegulomeProbSKNSH\ encodeRegulomeBaseSKNSH SKNSH wig 0.0 3.0 SKNSH DNaseI Sensitivity 0 3 120 50 180 187 152 217 0 0 10 chr2,chr5,chr7,chr8,chr9,chr11,chr12,chr16,chr18,chrX, encodeChrom 0 color 120,50,180\ longLabel SKNSH DNaseI Sensitivity\ parent encodeRegulomeBase\ priority 3\ shortLabel SKNSH\ track encodeRegulomeBaseSKNSH\ tajdSnpEd SNPs ED bed 4 . SNPs from European Descent 0 3 200 100 0 0 100 200 0 0 0 varRep 1 altColor 0,100,200\ color 200,100,0\ longLabel SNPs from European Descent\ parent tajdSnp\ priority 3\ shortLabel SNPs ED\ track tajdSnpEd\ stanfordChipHepG2GABP Stan HepG2 GABP bedGraph 4 Stanford ChIP-chip (HepG2 cells, GABP ChIP) 0 3 120 0 20 150 0 25 0 0 22 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr3,chr4,chr5,chr6,chr7,chr8,chr9,chrX, regulation 0 longLabel Stanford ChIP-chip (HepG2 cells, GABP ChIP)\ parent stanfordChip\ priority 3\ shortLabel Stan HepG2 GABP\ track stanfordChipHepG2GABP\ encodeStanfordChipHepG2GABP Stan HepG2 GABP bedGraph 4 Stanford ChIP-chip (HepG2 cells, GABP ChIP) 0 3 120 0 20 150 0 25 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 0 longLabel Stanford ChIP-chip (HepG2 cells, GABP ChIP)\ parent encodeStanfordChipJohnson\ priority 3\ shortLabel Stan HepG2 GABP\ track encodeStanfordChipHepG2GABP\ encodeStanfordChipJurkatSp1 Stan Jurkat Sp1 bedGraph 4 Stanford ChIP-chip (Jurkat cells, Sp1 ChIP) 0 3 120 0 20 150 0 25 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 0 longLabel Stanford ChIP-chip (Jurkat cells, Sp1 ChIP)\ parent encodeStanfordChip\ priority 3\ shortLabel Stan Jurkat Sp1\ track encodeStanfordChipJurkatSp1\ encodeStanfordMethHCT116 Stan Meth HCT116 bedGraph 4 Stanford Methylation Digest (HCT116 cells) 0 3 120 0 20 150 0 25 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChrom 0 longLabel Stanford Methylation Digest (HCT116 cells)\ parent encodeStanfordMeth\ priority 3\ shortLabel Stan Meth HCT116\ track encodeStanfordMethHCT116\ encodeStanfordMethSmoothedHCT116 Stan Meth Sc HCT116 bedGraph 4 Stanford Methylation Digest Smoothed Score (HCT116 cells) 0 3 120 0 20 150 0 25 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChrom 0 longLabel Stanford Methylation Digest Smoothed Score (HCT116 cells)\ parent encodeStanfordMethSmoothed\ priority 3\ shortLabel Stan Meth Sc HCT116\ track encodeStanfordMethSmoothedHCT116\ encodeStanfordPromotersCRL1690 Stan Pro CRL1690 bed 9 + Stanford Promoter Activity (CRL1690 cells) 0 3 0 0 0 127 127 127 0 0 19 chr1,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr5,chr6,chr7,chr8,chr9,chrX, encodeTxLevels 1 longLabel Stanford Promoter Activity (CRL1690 cells)\ parent encodeStanfordPromoters\ priority 3\ shortLabel Stan Pro CRL1690\ track encodeStanfordPromotersCRL1690\ encodeStanfordChipSmoothedJurkatSp1 Stan Sc Jurkat Sp1 bedGraph 4 Stanford ChIP-chip Smoothed Score (Jurkat cells, Sp1 ChIP) 0 3 120 0 20 150 0 25 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 0 longLabel Stanford ChIP-chip Smoothed Score (Jurkat cells, Sp1 ChIP)\ parent encodeStanfordChipSmoothed\ priority 3\ shortLabel Stan Sc Jurkat Sp1\ track encodeStanfordChipSmoothedJurkatSp1\ stsMarker STS Markers bed 5 + STS Markers on Genetic (blue), FISH (green) and RH (black) Maps 1 3 0 0 0 128 128 255 0 0 0
This track shows locations of Sequence Tagged Site (STS) markers\ along the draft assembly.
\ \The track filter can be used to change the color or include/exclude a map data set \ within the track. This is helpful when many items are shown in the track\ display, especially when only some are relevant to the current task. To use the\ filter:\
Many thanks to the researchers who worked on these\ maps, and to Greg Schuler, Arek Kasprzyk, Wonhee Jang,\ Terry Furey and Sanja Rogic for helping\ process the data. Additional data on the individual maps can be\ found at the following links:
\\ This track shows the S score (Z-score +- variance) for positive selection in humans within a 100 kb\ window surrounding each polymorphic position in the five modern human\ sequences and the human reference genome as described in Green et al.,\ Supplemental Online Material Text 13, Burbano et al.. \ A positive score indicates more derived alleles in Neandertal than\ expected, given the frequency of derived alleles in human. A negative\ score indicates fewer derived alleles in Neandertal, and may indicate an \ episode of positive selection in early humans.\
\\ To view the polymorphic sites on which the S score was computed, open\ the S\ SNPs track.
\ \\ Green et al. identified single-base sites that are\ polymorphic among five modern human genomes of diverse ancestry\ (in the \ Modern Human Seq \ track) plus the human reference\ genome. CpG sites were excluded because of the higher mutation rate\ at CpG sites.\ The ancestral or derived state of each single nucleotide polymorphism\ (SNP) was determined by comparison with the chimpanzee genome.\ The SNPs are displayed in the \ S SNPs track.\ The fact that SNPs with higher frequencies of the derived\ allele in modern humans were more likely to show the derived allele in\ Neandertals was used to calculate the expected number of derived alleles in\ Neandertal within a given region of the human genome.\ The observed numbers of derived alleles were compared to the expected\ numbers to identify regions where the Neandertals carry fewer\ derived alleles than expected given the human allelic states. The\ score assigned to each SNP is\ the z-score of the observed and expected counts relative\ to the variance in the number of the expected counts of derived\ alleles within the 100,000-base window around the SNP.\
\\ Note: In order to display both the score and the variance within\ the same track in the UCSC Genome Browser, the scores were modified as\ follows: at the SNP position, the value displayed is the score plus\ the variance. At the position following the SNP position, the score\ minus the variance is displayed. When viewing large regions (at least\ 100,000 bases), the default mean+whiskers condensation of the scores\ provides an indication of the range covered by the variance.\
\ \\ Green RE, Krause J, Briggs AW, Maricic T, Stenzel U, Kircher M, Patterson N,\ Li H, Zhai W, Fritz MH et al.\ A Draft Sequence of the Neandertal Genome.\ Science. 2010 7 May;328(5979):710-22.\
\ neandertal 0 autoScale off\ chromosomes chr1,chr2,chr3,chr4,chr5,chr6,chr7,chr8,chr9,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr17,chr18,chr19,chr20,chr21,chr22,chrX\ group neandertal\ longLabel Selective Sweep Scan (S) on Neandertal vs. Human Polymorphisms (Z-Score +- Variance)\ maxHeightPixels 128:32:11\ priority 3.1\ shortLabel Sel Swp Scan (S)\ track ntSssZScorePMVar\ type bigWig -8.8332 33.637199\ viewLimits -5:1\ visibility hide\ windowingFunction mean\ yLineMark -2\ yLineOnOff on\ ntSssTop5p 5% Lowest S bed 5 Selective Sweep Scan (S): 5% Smallest S scores 0 3.2 0 0 0 127 127 127 1 0 23 chr1,chr2,chr3,chr4,chr5,chr6,chr7,chr8,chr9,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr17,chr18,chr19,chr20,chr21,chr22,chrX,\ This track shows regions of the human genome with a strong signal for depletion\ of Neandertal-derived alleles (regions from the \ Sel Swp Scan \ (S) track with S scores in the lowest 5%),\ which may indicate an episode of positive selection in early\ humans.
\ \\ Grayscale shading is used as a rough indicator of the strength of the\ score; the darker the item, the stronger its negative score. The\ strongest negative score (-8.7011) is shaded black, and the shading\ lightens from dark to light gray as the negative score weakens (weakest\ score is -4.3202).\
\ \\ Green et al. identified single-base sites that are\ polymorphic among five modern human genomes of diverse ancestry\ (in the \ Modern Human \ Seq track) plus the human reference\ genome, and determined ancestral or derived state of each\ single nucleotide polymorphism (SNP) by comparison with the chimpanzee\ genome. The SNPs are displayed in the \ S SNPs track.\ The human allele states were used to estimate an expected number\ of derived alleles in Neandertal in the 100,000-base window around \ each SNP, and a measure called the S score was developed, displayed in the \ Sel Swp Scan \ (S) track, to compare the observed\ number of Neandertal alleles in each window to the expected number. \ An S score significantly less than zero indicates a reduction of \ Neandertal-derived alleles (or an increase of human-derived alleles not found in\ Neandertal), consistent with the scenario of positive selection in\ the human lineage since divergence from Neandertals.\
\\ Genomic regions of 25,000 or more bases in which all polymorphic sites\ were at least 2 standard deviations below the expected value were\ identified, and S was recomputed on each such region. Regions with S\ scores in the lowest 5% (strongest negative scores) were prioritized\ for further analysis as described in Green et al..\
\ \\ This track was produced at UCSC using data generated by\ Ed Green.\
\ \\ Green RE, Krause J, Briggs AW, Maricic T, Stenzel U, Kircher M, Patterson N,\ Li H, Zhai W, Fritz MH et al.\ A Draft Sequence of the Neandertal Genome.\ Science. 2010 7 May;328(5979):710-22.\
\ \ neandertal 1 bedNameLabel Score\ chromosomes chr1,chr2,chr3,chr4,chr5,chr6,chr7,chr8,chr9,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr17,chr18,chr19,chr20,chr21,chr22,chrX\ group neandertal\ longLabel Selective Sweep Scan (S): 5% Smallest S scores\ noScoreFilter .\ priority 3.2\ shortLabel 5% Lowest S\ track ntSssTop5p\ type bed 5\ useScore .\ visibility hide\ ntSssSnps S SNPs bed 9 SNPS Used for Selective Sweep Scan (S) 0 3.3 0 0 0 127 127 127 0 0 23 chr1,chr2,chr3,chr4,chr5,chr6,chr7,chr8,chr9,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr17,chr18,chr19,chr20,chr21,chr22,chrX,\ This track shows single nucleotide polymorphisms (SNPs) used in a\ genome-wide scan for signals of positive selection in the human\ lineage since divergence from the Neandertal lineage.
\\ SNP labels represent the ancestral (A) or derived (D) status,\ determined by comparison with the chimpanzee reference genome,\ of alleles in the human reference assembly, five modern human\ genomes of diverse ancestry (see the \ Modern Human Seq \ track), and Neandertals. \ The first six characters of an item name show the status of the allele\ (A, D or _ if not known)\ in six genomes: human reference, San, Yoruba, Han, Papuan, and French, in that\ order. These characters are followed by a colon, the number of derived alleles\ found in Neandertals, a comma and the number of ancestral alleles found in\ Neandertals. \ For example, a SNP labeled AAADAA:0D,2A has the ancestral allele in\ the reference human genome and in all of the modern human genomes\ except Han. Among Neandertals, two instances of the ancestral allele \ were found, but no instances of the derived allele. \
\\ SNPs are colored red when at least four of the six modern human\ genomes are derived while all observed Neandertal alleles are\ ancestral. An overrepresentation of such SNPs in a region would \ imply that the region had undergone positive selection in the \ modern human lineage since divergence from Neandertals; the \ Sel Swp Scan \ (S) track displays a signal calculated from these SNPs, and the \ 5% Lowest S\ track contains the regions in which the signal most strongly indicates selective\ pressure on the modern human lineage.\
\ \\ Red\ SNPs are those where at least four of the six modern human\ genomes are derived while all observed Neandertal alleles are\ ancestral. All other SNPs are black.\
\ \\ For the purposes of this analysis, SNPs were defined as single-base\ sites that are polymorphic among 5 modern human genomes of diverse\ ancestry (see the \ Modern Human Seq \ track) plus the human reference\ genome. SNPs at CpG sites were excluded because of the higher\ mutation rate at CpG sites. \ Ancestral or derived state was determined by comparison with the \ chimpanzee genome. \
\ \\ This track was produced at UCSC using data generated by\ Ed Green.\
\ \\ Green RE, Krause J, Briggs AW, Maricic T, Stenzel U, Kircher M, Patterson N,\ Li H, Zhai W, Fritz MH et al.\ A Draft Sequence of the Neandertal Genome.\ Science. 2010 7 May;328(5979):710-22.\
\ neandertal 1 chromosomes chr1,chr2,chr3,chr4,chr5,chr6,chr7,chr8,chr9,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr17,chr18,chr19,chr20,chr21,chr22,chrX\ exonArrows off\ group neandertal\ itemRgb on\ longLabel SNPS Used for Selective Sweep Scan (S)\ noScoreFilter .\ priority 3.3\ shortLabel S SNPs\ track ntSssSnps\ type bed 9\ visibility hide\ ntOoaHaplo Cand. Gene Flow bed 9 + Candidate Regions for Gene Flow from Neandertal to Non-African Modern Humans 0 3.8 0 0 0 127 127 127 0 0 10 chr1,chr4,chr5,chr6,chr9,chr10,chr15,chr17,chr20,chr22,\ This track shows 13 regions of the human genome in which there is\ considerably more haplotype diversity among non-African genomes than\ within African genomes. A prediction of Neandertal-to-modern human\ gene flow is that these deeply divergent haplotypes which exist only in\ non-African populations entered the human gene pool from Neandertals.\ Of the 12 candidate gene flow regions with tag SNP data, there are 10\ regions in which Neandertals match the deep haplotype clade unique to\ non-Africans (out of Africa, OOA) instead of the cosmopolitan\ haplotype clade shared by Africans and non-Africans (cosmopolitan,\ COS).\
\\ The table below was copied from Table 5, "Non-African haplotypes \ match Neandertal at an unexpected rate", from Green et al.:\
\ \\
\\ Region Genomic Size ST \Average
Frequency
in OOAAM DM AN DN \Qualitative
Assessment\ chr1:168,110,001-168,220,000 110,000 2.9 6.3% 5 10 1 0 OOA \ chr1:223,760,001-223,910,000 150,000 2.8 6.3% 1 4 0 0 OOA \ chr4:171,180,001-171,280,000 100,000 1.9 5.2% 1 2 0 0 OOA \ chr5:28,950,001-29,070,000 120,000 3.8 3.1% 16 16 6 0 OOA \ chr6:66,160,001-66,260,000 100,000 5.7 28.1% 6 6 0 0 OOA \ chr9:32,940,001-33,040,000 100,000 2.8 4.2% 7 14 0 0 OOA \ chr10:4,820,001-4,920,000 100,000 2.6 9.4% 9 5 0 0 OOA \ chr10:38,000,001-38,160,000 160,000 3.5 8.3% 5 9 2 0 OOA \ chr10:69,630,001-69,740,000 110,000 4.2 19.8% 2 2 0 1 OOA \ chr15:45,250,001-45,350,000 100,000 2.5 1.1% 5 6 1 0 OOA \ chr17:35,500,001-35,600,000 100,000 2.9 (no tags) n/a n/a n/a n/a n/a \ chr20:20,030,001-20,140,000 110,000 5.1 64.6% 0 0 10 5 COS \ chr22:30,690,001-30,820,000 130,000 3.5 4.2% 0 2 5 2 COS
\ ST = estimated ratio of OOA/African gene tree depth.
\ Average Frequency in OOA = average (across tag SNPs in the region) of the population frequency in the 48 OOA individuals of the OOA-only allele for each tag SNP.
\ AM = Neandertal has ancestral allele and matches OOA-specific clade.
\ DM = Neandertal has derived allele and matches OOA-specific clade.
\ AN = Neandertal has ancestral allele and does not match OOA-specific clade.
\ DN = Neandertal has derived allele and does not match OOA-specific clade.\
\ A region is colored green if its qualitative assessment is OOA, blue\ if COS, and gray if unknown (no tag SNPs in region).\
\ \\ Green et al. used 1,263,750 Perlegen Class A SNPs, identified\ in 71 individuals of diverse ancestry (see Hinds et al.), to\ identify 13 candidate gene flow regions (Supplemental Online Materials\ Text 17).\ 24 individuals of European ancestry and 24 individuals of Han Chinese\ ancestry were used to represent the non-African population, and the\ remaining 23 individuals, of African American ancestry, were used to\ represent the African population.\
\\ From the 1,263,750 Perlegen Class A SNPs, they identified 166 tag\ SNPs that separate (see below) 12 of the haplotype clades in\ non-Africans (OOA) from the cosmopolitan haplotype clades shared between\ Africans and non-Africans (COS) and for which they had data from the\ Neandertals. Of the 13 regions, one had no tag SNPs so could not be\ assessed, two were COS, and 10 were OOA (see final column Table 1).\
\\ Overall, the Neandertals match the deep clade unique to non-Africans\ (OOA) at 133 of the 166 tag SNPs. They assessed the rate at which\ Neandertal matches each of these clades by further subdividing the 133\ tag SNPs based on their ancestral or derived status in\ Neandertal and whether they matched the OOA-specific clade or not.\ Candidate regions were qualitatively assessed to be OOA matches for \ Neandertal when the proportion of tag SNPs matching the OOA-specific \ clade is much more than 50%.\
\ \\ This track was produced at UCSC using data generated by\ Ed Green.\
\ \\ Green RE, Krause J, Briggs AW, Maricic T, Stenzel U, Kircher M, Patterson N,\ Li H, Zhai W, Fritz MH et al.\ A Draft Sequence of the Neandertal Genome.\ Science. 2010 7 May;328(5979):710-22.\
\\ Hinds DA, Stuve LL, Nilsen GB, Halperin E, Eskin E, Ballinger DG, Frazer KA, \ Cox DR.\ Whole-genome patterns of common DNA variation in three human populations.\ Science. 2005 Feb 18;307(5712):1072-9.
\ neandertal 1 chromosomes chr1,chr4,chr5,chr6,chr9,chr10,chr15,chr17,chr20,chr22\ exonArrows off\ group neandertal\ itemRgb on\ longLabel Candidate Regions for Gene Flow from Neandertal to Non-African Modern Humans\ noScoreFilter .\ priority 3.8\ shortLabel Cand. Gene Flow\ track ntOoaHaplo\ type bed 9 +\ visibility hide\ netRBestOtoGar1 Bushbaby RBest Net netAlign otoGar1 chainOtoGar1 Bushbaby (Dec. 2006 (Broad/otoGar1)) Reciprocal Best Alignment Net 0 4 0 0 0 127 127 127 1 0 0 compGeno 0 group compGeno\ longLabel $o_Organism ($o_date) Reciprocal Best Alignment Net\ otherDb otoGar1\ parent rBestNet\ priority 4\ shortLabel $o_Organism RBest Net\ spectrum on\ track netRBestOtoGar1\ type netAlign otoGar1 chainOtoGar1\ visibility hide\ snpArrayAffy250Nsp Affy 250KNsp bed 6 + Affymetrix GeneChip Human Mapping 250K Nsp 0 4 0 0 0 127 127 127 0 0 0 varRep 1 longLabel Affymetrix GeneChip Human Mapping 250K Nsp\ parent snpArray off\ priority 4\ shortLabel Affy 250KNsp\ track snpArrayAffy250Nsp\ type bed 6 +\ encodeAffyChIpHl60SitesBrg1Hr02 Affy Brg1 RA 2h bed 3 . Affymetrix ChIP/Chip (Brg1 retinoic acid-treated HL-60, 2hrs) Sites 0 4 225 0 0 240 127 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 1 color 225,0,0\ longLabel Affymetrix ChIP/Chip (Brg1 retinoic acid-treated HL-60, 2hrs) Sites\ parent encodeAffyChIpHl60Sites\ priority 4\ shortLabel Affy Brg1 RA 2h\ subGroups factor=Brg1 time=2h\ track encodeAffyChIpHl60SitesBrg1Hr02\ encodeAffyChIpHl60SignalStrictH3K9K14DHr32 Affy H3K9ac2 32h wig -2.78 3.97 Affymetrix ChIP-chip (H3K9K14ac2, retinoic acid-treated HL-60, 32hrs) Strict Signal 0 4 225 0 0 240 127 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 0 color 225,0,0\ longLabel Affymetrix ChIP-chip (H3K9K14ac2, retinoic acid-treated HL-60, 32hrs) Strict Signal\ parent encodeAffyChIpHl60SignalStrict\ priority 4\ shortLabel Affy H3K9ac2 32h\ subGroups factor=H3K9K14ac2 time=32h\ track encodeAffyChIpHl60SignalStrictH3K9K14DHr32\ encodeAffyChIpHl60SitesStrictH3K9K14DHr32 Affy H3K9ac2 32h bed 3 . Affymetrix ChIP-chip (H3K9K14ac2, retinoic acid-treated HL-60, 32hrs) Strict Sites 0 4 225 0 0 240 127 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 1 color 225,0,0\ longLabel Affymetrix ChIP-chip (H3K9K14ac2, retinoic acid-treated HL-60, 32hrs) Strict Sites\ parent encodeAffyChIpHl60SitesStrict\ priority 4\ shortLabel Affy H3K9ac2 32h\ subGroups factor=H3K9K14ac2 time=32h\ track encodeAffyChIpHl60SitesStrictH3K9K14DHr32\ encodeAffyChIpHl60PvalStrictH3K9K14DHr32 Affy H3K9ac2 32h wig 0 696.62 Affymetrix ChIP-chip (H3K9K14ac2, retinoic acid-treated HL-60, 32hrs) Strict P-Value 0 4 225 0 0 240 127 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 0 color 225,0,0\ longLabel Affymetrix ChIP-chip (H3K9K14ac2, retinoic acid-treated HL-60, 32hrs) Strict P-Value\ parent encodeAffyChIpHl60PvalStrict\ priority 4\ shortLabel Affy H3K9ac2 32h\ subGroups factor=H3K9K14ac2 time=32h\ track encodeAffyChIpHl60PvalStrictH3K9K14DHr32\ encodeAffyRnaHl60SitesHr02IntronsProximal Affy In PrxHL60 2h bed 4 . Affy Intronic Proximal HL60 Retinoic 2hr Transfrags 0 4 212 0 44 233 127 149 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeAnalysis 1 color 212,0,44\ longLabel Affy Intronic Proximal HL60 Retinoic 2hr Transfrags\ parent encodeNoncodingTransFrags\ priority 4\ shortLabel Affy In PrxHL60 2h\ subGroups region=intronicProximal celltype=hl60 source=affy\ track encodeAffyRnaHl60SitesHr02IntronsProximal\ encodeAffyRnaHl60SignalHr02 Affy RNA RA 2h wig -1168.00 1686.5 Affymetrix PolyA+ RNA (retinoic acid-treated HL-60, 2hrs) Signal 0 4 50 50 180 152 152 217 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeTxLevels 0 color 50,50,180\ longLabel Affymetrix PolyA+ RNA (retinoic acid-treated HL-60, 2hrs) Signal\ parent encodeAffyRnaSignal\ priority 4\ shortLabel Affy RNA RA 2h\ track encodeAffyRnaHl60SignalHr02\ encodeAffyRnaHl60SitesHr02 Affy RNA RA 2h bed 3 . Affymetrix PolyA+ RNA (retinoic acid-treated HL-60, 2hrs) Sites 0 4 50 50 180 152 152 217 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeTxLevels 1 color 50,50,180\ longLabel Affymetrix PolyA+ RNA (retinoic acid-treated HL-60, 2hrs) Sites\ parent encodeAffyRnaTransfrags\ priority 4\ shortLabel Affy RNA RA 2h\ track encodeAffyRnaHl60SitesHr02\ encodeHapMapAlleleFreqYRI Allele Freq YRI bed 6 + HapMap Minor Allele Frequencies Yoruban (YRI) 0 4 0 0 0 127 127 127 1 0 7 chr2,chr4,chr7,chr8,chr9,chr12,chr18, encodeVariation 1 longLabel HapMap Minor Allele Frequencies Yoruban (YRI)\ parent encodeHapMapAlleleFreq\ priority 4\ shortLabel Allele Freq YRI\ track encodeHapMapAlleleFreqYRI\ encodeEgaspUpdAugustusAny August/EST/Ms Upd genePred Augustus + EST/Protein Evidence + Mouse Homology Gene Predictions 0 4 12 100 100 133 177 177 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeGenes 1 color 12,100,100\ longLabel Augustus + EST/Protein Evidence + Mouse Homology Gene Predictions\ parent encodeEgaspUpdate\ priority 4\ shortLabel August/EST/Ms Upd\ track encodeEgaspUpdAugustusAny\ encodeEgaspPartAugustusEst Augustus/EST genePred Augustus + EST/Protein Evidence Gene Predictions 0 4 12 65 165 133 160 210 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeGenes 1 color 12,65,165\ longLabel Augustus + EST/Protein Evidence Gene Predictions\ parent encodeEgaspPartial\ priority 4\ shortLabel Augustus/EST\ track encodeEgaspPartAugustusEst\ encodeBuFirstExonKidney BU Kidney bed 12 + Boston University First Exon Activity in Kidney 0 4 0 0 0 127 127 127 0 0 10 chr11,chr13,chr15,chr16,chr19,chr2,chr5,chr7,chr9,chrX, encodeTxLevels 1 longLabel Boston University First Exon Activity in Kidney\ parent encodeBuFirstExon\ priority 4\ shortLabel BU Kidney\ track encodeBuFirstExonKidney\ cccTrendPvalHt CCC Hypertension chromGraph Case Control Consortium hypertension trend -log10 P-value 0 4 0 0 0 127 127 127 0 0 0 phenDis 0 longLabel Case Control Consortium hypertension trend -log10 P-value\ parent caseControl\ priority 4\ shortLabel CCC Hypertension\ track cccTrendPvalHt\ kiddEichlerDiscAbc11 Discordant ABC11 bed 12 HGSV Individual ABC11 (China) Discordant Clone End Alignments 0 4 0 0 0 127 127 127 0 0 0 http://mrhgsv.gs.washington.edu/cgi-bin/hgc?i=$$&c=$S&l=$[&r=$]&db=$D&position=$S:$[-$] varRep 1 longLabel HGSV Individual ABC11 (China) Discordant Clone End Alignments\ parent kiddEichlerDisc\ priority 4\ shortLabel Discordant ABC11\ track kiddEichlerDiscAbc11\ encodeDNDShuge dN/dS 1.5 to inf bed 4 + ENCODE Exons dN/dS 1.5 to inf 0 4 0 0 0 127 127 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeAnalysis 1 longLabel ENCODE Exons dN/dS 1.5 to inf\ parent encodeDNDS\ priority 4\ shortLabel dN/dS 1.5 to inf\ track encodeDNDShuge\ encodeAffyEc51BrainFrontalLobeSignal EC51 Sgnl BrainF wig 0 62385 Affy Ext Trans Signal (51-base window) (Brain Frontal Lobe) 0 4 248 0 8 251 127 131 0 0 2 chr21,chr22, encodeTxLevels 0 color 248,0,8\ longLabel Affy Ext Trans Signal (51-base window) (Brain Frontal Lobe)\ parent encodeAffyEcSignal\ priority 4\ shortLabel EC51 Sgnl BrainF\ track encodeAffyEc51BrainFrontalLobeSignal\ encodeAffyEc51BrainFrontalLobeSites EC51 Site BrainF bed 3 . Affy Ext Trans Sites (51-base window) (Brain Frontal Lobe) 0 4 248 0 8 251 127 131 0 0 2 chr21,chr22, encodeTxLevels 1 color 248,0,8\ longLabel Affy Ext Trans Sites (51-base window) (Brain Frontal Lobe)\ parent encodeAffyEcSites\ priority 4\ shortLabel EC51 Site BrainF\ track encodeAffyEc51BrainFrontalLobeSites\ encodeEgaspFullExogean Exogean genePred Exogean Gene Predictions 0 4 100 12 100 177 133 177 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeGenes 1 color 100,12,100\ longLabel Exogean Gene Predictions\ parent encodeEgaspFull\ priority 4\ shortLabel Exogean\ track encodeEgaspFullExogean\ decodeFemale Female bigWig 0.0 90.808 deCODE recombination map, female 0 4 200 60 200 227 157 227 0 0 22 chr1,chr2,chr3,chr4,chr5,chr6,chr7,chr8,chr9,chr10,chr11,chr12,chr13,chr16,chr14,chr15,chr17,chr18,chr19,chr20,chr22,chr21, map 0 color 200,60,200\ configurable on\ longLabel deCODE recombination map, female\ parent femaleView\ priority 4\ shortLabel Female\ subGroups view=female\ track decodeFemale\ type bigWig 0.0 90.808\ fox2ClipClusters FOX2 clusters bed 4 . FOX2 binding site clusters 3 4 0 0 0 127 127 127 0 0 0 regulation 1 longLabel FOX2 binding site clusters\ noInherit on\ noScoreFilter .\ parent fox2ClipSeqCompViewclusters\ priority 4\ shortLabel FOX2 clusters\ subGroups view=clusters\ track fox2ClipClusters\ type bed 4 .\ encodeGencodeGenePseudoMar07 Gencode Pseudo genePred Gencode Pseudogenes 0 4 0 91 191 127 173 223 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeGenes 1 color 0,91,191\ longLabel Gencode Pseudogenes\ parent encodeGencodeGeneMar07\ priority 4\ shortLabel Gencode Pseudo\ track encodeGencodeGenePseudoMar07\ encodeHapMapCovYRI HapMap Cov YRI wig 0.0 100.0 HapMap Resequencing Coverage Yoruban (YRI) 0 4 0 0 0 127 127 127 0 0 7 chr2,chr4,chr7,chr8,chr9,chr12,chr18, encodeVariation 0 longLabel HapMap Resequencing Coverage Yoruban (YRI)\ parent encodeHapMapCov\ priority 4\ shortLabel HapMap Cov YRI\ track encodeHapMapCovYRI\ hapmapSnpsCHD HapMap SNPs CHD bed 6 + HapMap SNPs from the CHD Population (Chinese Ancestry in Metropolitan Denver, CO, US) 0 4 0 0 0 127 127 127 0 0 0 varRep 1 longLabel HapMap SNPs from the CHD Population (Chinese Ancestry in Metropolitan Denver, CO, US)\ parent hapmapSnps\ priority 4\ shortLabel HapMap SNPs CHD\ track hapmapSnpsCHD\ hgdpHzyEurope Hetzgty Europe bedGraph 4 Human Genome Diversity Proj Smoothd Expec Heterozygosity (Europe) 0 4 240 144 0 247 199 127 0 0 22 chr1,chr2,chr3,chr4,chr5,chr6,chr7,chr8,chr9,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr17,chr18,chr19,chr20,chr21,chr22, varRep 0 color 240,144,0\ longLabel Human Genome Diversity Proj Smoothd Expec Heterozygosity (Europe)\ parent hgdpHzy\ priority 4\ shortLabel Hetzgty Europe\ track hgdpHzyEurope\ encodeRegulomeQualityHuh7 Huh7 bed 5 . Huh7 Quality 0 4 150 50 150 202 152 202 1 0 10 chr2,chr5,chr7,chr8,chr9,chr11,chr12,chr16,chr18,chrX, encodeChrom 1 color 150,50,150\ longLabel Huh7 Quality\ parent encodeRegulomeQuality\ priority 4\ shortLabel Huh7\ track encodeRegulomeQualityHuh7\ encodeRegulomeProbHuh7 Huh7 bedGraph 4 Huh7 DNaseI HSs 0 4 150 50 150 202 152 202 0 0 10 chr2,chr5,chr7,chr8,chr9,chr11,chr12,chr16,chr18,chrX, encodeChrom 0 color 150,50,150\ longLabel Huh7 DNaseI HSs\ parent encodeRegulomeProb\ priority 4\ shortLabel Huh7\ track encodeRegulomeProbHuh7\ encodeRegulomeBaseHuh7 Huh7 wig 0.0 3.0 Huh7 DNaseI Sensitivity 0 4 150 50 150 202 152 202 0 0 10 chr2,chr5,chr7,chr8,chr9,chr11,chr12,chr16,chr18,chrX, encodeChrom 0 color 150,50,150\ longLabel Huh7 DNaseI Sensitivity\ parent encodeRegulomeBase\ priority 4\ shortLabel Huh7\ track encodeRegulomeBaseHuh7\ hgdpIhsSAsia iHS S. Asia bedGraph 4 Human Genome Diversity Project iHS (South Asia) 0 4 0 0 0 127 127 127 0 0 23 chr1,chr2,chr3,chr4,chr5,chr6,chr7,chr8,chr9,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr17,chr18,chr19,chr20,chr21,chr22,chrX, varRep 0 color 0,0,0\ longLabel Human Genome Diversity Project iHS (South Asia)\ parent hgdpIhs\ priority 4\ shortLabel iHS S. Asia\ track hgdpIhsSAsia\ encodeGencodeIntergenicDistal Intergenic Dist bed 4 . Gencode Intergenic Distal Regions 0 4 0 0 0 127 127 127 0 0 0 encodeAnalysis 1 longLabel Gencode Intergenic Distal Regions\ parent encodeGencodeRegions\ priority 4\ shortLabel Intergenic Dist\ track encodeGencodeIntergenicDistal\ encodeAllIntergenicProximal Intergenic Prox bed 4 Consensus Intergenic Proximal 0 4 0 0 0 127 127 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeAnalysis 1 longLabel Consensus Intergenic Proximal\ parent encodeWorkshopSelections\ priority 4\ shortLabel Intergenic Prox\ track encodeAllIntergenicProximal\ iscaRetrospectiveLikelyPathogenic ISCA Ret Lik.Path. gvf Internat. Stds. for Cytogen. Arrays Consort. (ISCA) - Retrospective (Likely Pathogenic) 0 4 0 0 0 127 127 127 0 0 0 varRep 1 longLabel Internat. Stds. for Cytogen. Arrays Consort. (ISCA) - Retrospective (Likely Pathogenic)\ parent iscaRetrospectiveComposite\ priority 4\ shortLabel ISCA Ret Lik.Path.\ track iscaRetrospectiveLikelyPathogenic\ hapmapLdJpt LD JPT bed 4 + Linkage Disequilibrium for the Japanese from Tokyo (JPT) 0 4 0 0 0 127 127 127 0 0 23 chr1,chr2,chr3,chr4,chr5,chr6,chr7,chr8,chr9,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr17,chr18,chr19,chr20,chr21,chr22,chrX, varRep 0 longLabel Linkage Disequilibrium for the Japanese from Tokyo (JPT)\ parent hapmapLd\ priority 4\ shortLabel LD JPT\ track hapmapLdJpt\ encodeUcsdChipHeLaH3H4tmH3K4_p30 LI H3K4me3 +gIF bedGraph 4 Ludwig Institute ChIP-chip: H3K4me3 ab, HeLa cells, 30 min. after gamma interferon 0 4 109 51 43 182 153 149 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 0 color 109,51,43\ longLabel Ludwig Institute ChIP-chip: H3K4me3 ab, HeLa cells, 30 min. after gamma interferon\ parent encodeLIChIPgIF\ priority 4\ shortLabel LI H3K4me3 +gIF\ track encodeUcsdChipHeLaH3H4tmH3K4_p30\ encodeUcsdNgHeLaRnap_p30 LI Ng Pol2 +gIF bedGraph 4 Ludwig Institute/UCSD ChIP/Chip Ng: HeLa, Pol2, 30 min after gamma interferon 0 4 0 0 0 127 127 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 0 longLabel Ludwig Institute/UCSD ChIP/Chip Ng: HeLa, Pol2, 30 min after gamma interferon\ parent encodeUcsdNgGif\ priority 4\ shortLabel LI Ng Pol2 +gIF\ track encodeUcsdNgHeLaRnap_p30\ encodeUcsdChipRnapHct116_f LI Pol2 HCT116 bedGraph 4 Ludwig Institute ChIP-chip: Pol2 8WG16 ab, HCT116 cells 0 4 58 119 40 156 187 147 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 0 color 58,119,40\ longLabel Ludwig Institute ChIP-chip: Pol2 8WG16 ab, HCT116 cells\ parent encodeLIChIP\ priority 4\ shortLabel LI Pol2 HCT116\ track encodeUcsdChipRnapHct116_f\ encodeMlaganUnionEl MLAGAN Union bed 5 . MLAGAN PhastCons/BinCons/GERP Union Conserved Elements 0 4 80 70 180 167 162 217 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeCompGeno 1 color 80,70,180\ longLabel MLAGAN PhastCons/BinCons/GERP Union Conserved Elements\ parent encodeMlaganElements\ priority 4\ shortLabel MLAGAN Union\ track encodeMlaganUnionEl\ netSyntenyMm8 Mouse Syn Net netAlign mm8 chainMm8 Mouse (Feb. 2006 (NCBI36/mm8)) Syntenic Alignment Net 0 4 0 0 0 127 127 127 1 0 0 compGeno 0 group compGeno\ longLabel $o_Organism ($o_date) Syntenic Alignment Net\ otherDb mm8\ parent syntenicNet\ priority 4\ shortLabel Mouse Syn Net\ spectrum on\ track netSyntenyMm8\ type netAlign mm8 chainMm8\ visibility hide\ encodeAllNcIntersectEl NC Intersect bed 5 . TBA and MLAGAN PhastCons/BinCons/GERP Intersection NonCoding Conserved Elements 0 4 80 180 80 167 217 167 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeCompGeno 1 color 80,180,80\ longLabel TBA and MLAGAN PhastCons/BinCons/GERP Intersection NonCoding Conserved Elements\ parent encodeAllElements\ priority 4\ shortLabel NC Intersect\ track encodeAllNcIntersectEl\ ntSeqContigs Neandertal Cntgs bam Neandertal Sequence Contigs Generated by Genotype Caller 0 4 0 0 0 127 127 127 0 0 0\ The Neandertal Sequence Contigs track shows consensus contigs \ called (after duplicate reads from each library were merged) from \ overlapping, non-redundant reads that passed mapping and base\ quality criteria. \
\ \\ The contigs (query sequences) from each of the six samples are contained in separate\ subtracks. Use the checkboxes to select which samples will be\ displayed in the browser. Click and drag the sample name to reorder the\ subtracks. The order in which the subtracks appear in the subtrack list will\ be the order in which they display in the browser.\
\ The query sequences in the SAM/BAM alignment representation\ are normalized to the + strand of the reference genome\ (see the SAM Format Specification\ for more information on the SAM/BAM file format). If a query sequence was\ originally the reverse of what has been stored and aligned, it will have the\ following\ flag:\
\ (0x10) Read is on '-' strand.\\
\ BAM/SAM alignment representations also have tags. Some tags are predefined and others (those beginning\ with X, Y or Z) are defined by the aligner or data submitter. \ The following tag is associated with this track:\
\ The item labels and display\ colors of features within this track can be configured through the controls at\ the top of the track description page.\
\\ All Neandertal sequence reads from each of the six samples were aligned\ to the human (hg16) genome using the short read aligner/mapper \ ANFO.\
\\ To reduce the effects of sequencing error, the alignments of Neandertal reads to\ the human and chimpanzee reference genomes were used to construct human-based \ and chimpanzee-based consensus "minicontigs". To generate the consensus, \ uniquely placed, overlapping alignments were selected (ANFO MAPQ ≥ 90) and \ these were merged into a single multi-sequence alignment using the common \ reference genome sequence.\
\\ At each position in the resulting alignment, for each observed base, and for \ each possible original base: i) The likelihood of the observation was \ calculated, ii) the likely length of single-stranded overhangs was estimated, \ and iii) the potential for ancient DNA damage using the Briggs-Johnson model was\ considered (Briggs et al. 2007). \ If most observations in a given position showed a gap, the consensus became a \ gap; otherwise the base with the highest quality score (calculated by dividing \ each likelihood by the total likelihood) was used as the consensus.\
\\ At the current coverage, heterozygous sites will appear as low quality bases \ with the second base (not shown) having a similar likelihood to the consensus \ base. \ Likewise, heterozygous indels are included only by chance or may show up as \ stretches of low quality bases. \
\ \\ This track was produced at UCSC using data generated by\ Ed Green.\
\ \\ Briggs AW, Good JM, Green RE, Krause J, Maricic T, Stenzel U, Lalueza-Fox C, \ Rudan P, Brajkovic D, Kucan Z et al.\ Patterns of damage in genomic DNA sequences from a Neandertal.\ Proc Natl Acad Sci USA. 2007. Sep 11;104(37):14616-21.\
\\ Green RE, Krause J, Briggs AW, Maricic T, Stenzel U, Kircher M, Patterson N,\ Li H, Zhai W, Fritz MH et al.\ A Draft Sequence of the Neandertal Genome.\ Science. 2010 7 May;328(5979):710-22.\
\ neandertal 1 aliQualRange 0:254\ allButtonPair on\ baseColorDefault diffBases\ baseColorUseSequence lfExtra\ compositeTrack on\ dimensions dimensionX=sample\ dragAndDrop subTracks\ group neandertal\ indelDoubleInsert on\ indelQueryInsert on\ longLabel Neandertal Sequence Contigs Generated by Genotype Caller\ noColorTag .\ priority 4\ shortLabel Neandertal Cntgs\ showDiffBasesAllScales .\ showDiffBasesMaxZoom 100\ showNames off\ sortOrder sample=+\ subGroup1 sample Sample All=All Feld1=Feld1 Mez1=Mez1 Sid1253=Sid1253 Vi33dot16=Vi33.16 Vi33dot25=Vi33.25 Vi33dot26=Vi33.26\ track ntSeqContigs\ type bam\ visibility hide\ numtSMitochondrionChrPlacement NumtS chr colored bed 9 . Human NumtS on mitochondrion with chromosome placement 0 4 0 0 0 127 127 127 0 0 0\ NumtS (Nuclear mitochondrial sequences) are mitochondrial fragments inserted in nuclear genomic sequences. The most credited hypothesis concerning their generation suggests that in presence of mutagenic agents or under stress conditions fragments of mtDNA escape from mitochondria, reach the nucleus and insert into chromosomes during break repair, although NumtS can derive from duplication of genomic fragments. NumtS may be cause of contamination during human mtDNA sequencing and hence frequent false low heteroplasmic evidences have been reported.\ The Bioinformatics group chaired by M.Attimonelli (Bari, Italy) has produced the RHNumtS compilation annotating more than 500 Human NumtS. To allow the scientific community to access to the compilation and to perform genomics comparative analyses inclusive of the NumtS data, the group has designed the Human NumtS tracks below described.\
\ \\ The NumtS tracks show the High Score Pairs (HSPs) obtained by aligning the mitochondrial reference genome (NC_012920) with the hg18 release of the human genome.\
\\ The "NumtS mitochondrial sequences" track shows the mapping of the HSPs returned by BlastN on the nuclear genome. The shading of the items reflects the similarity returned by BlastN, and the direction of the arrows is concordant with the strand of the alignment. For every item, a link pointing to the mitochondrial mapping is provided, thus allowing a fast cross among the NumtS genomic contexts.\
\\ The "NumtS assembled" track shows items obtained by assembling HSPs annotated in the "NumtS" track fulfilling the following conditions:\
\ Exceptions for the second condition arise when a long repetitive element is present between two HSPs.\
\\ The "NumtS on mitochondrion" track shows the mapping of the HSPs on the mitochondrial genome. The shading of the items reflects the similarity returned by BlastN, and the direction of the arrows is concordant with the strand of the alignment. For every item, a link pointing to the nuclear mapping is provided.\
\ \\ The "NumtS on mitochondrion with chromosome placement" shows the mapping of the HSPs on the mitochondrial genome, but the items are coloured according to the colours assigned to each human chromosome on the UCSC genome browser. No shading is here provided. For every item, a link pointing to the nuclear mapping is provided.\
\\ NumtS mappings were obtained by running Blast2seq (program: BlastN) between each chromosome of of the Human Genome hg18 build and the human mitochondrial reference sequence (rCRS, AC: NC_012920), fixing the e-value threshold to 1e-03. The assembling of the HSPs was performed with spreadsheet interpolation and manual inspection.\
\ \\ NumtS predicted in silico were validated by carrying out PCR amplification and sequencing on blood-extracted DNA of a healthy individual of European origin. PCR amplification was successful for 275 NumtS and provided amplicons of the expected length. All PCR fragments were sequenced on both strands, and submitted to the EMBL databank.\
\\ Furthermore, 541 NumtS were validated by merging NumtS nuclear coordinates with HapMap annotations. Our analysis has been carried on eight HapMap individuals (NA18517, NA18507, NA18956, NA19240, NA18555, NA12878, NA19129, NA12156). For each sample, clones with a single best concordant placement (according to the fosmid end-sequence-pair analysis described in Kidd et al., 2008), have been considered. The analysis showed that 541 NumtS (at least 30bp for each one) had been sequenced in such samples.\
\ \\ These data were provided by Domenico Simone and Marcella Attimonelli at Department of Biochemistry and Molecular Biology "Ernesto Quagliariello" (University of Bari, Italy). Primer designing was carried out by Francesco Calabrese and Giuseppe Mineccia. PCR validation was carried out by Martin Lang, Domenico Simone and Giuseppe Gasparre. Merging with HapMap annotations has been performed by Domenico Simone.\
\ \\ Simone D, Calabrese FM, Lang M, Gasparre G, Attimonelli M: Validation and UCSC tracks of the extended RHNumtS compilation (submitted). \
\ \\ Lascaro D, Castellana S, Gasparre G, Romeo G, Saccone S, Attimonelli M. The RHNumtS compilation: features and bioinformatics approaches to locate and quantify Human NumtS. BMC\ Genomics. 2008 June 3;9:267.\
\ \\ Kidd JM, Cooper GM, Donahue WF, et al.\ \ Mapping and sequencing of structural variation from eight human genomes.\ Nature, 2008, 453(7191):56-64.\
\ \ \ \ varRep 1 html numtSeq\ itemRgb on\ longLabel Human NumtS on mitochondrion with chromosome placement\ parent numtSeq\ priority 4\ shortLabel NumtS chr colored\ track numtSMitochondrionChrPlacement\ type bed 9 .\ encodeGencodeRaceFragsHeart RACEfrags Heart genePred Gencode RACEfrags from Heart 0 4 225 30 255 240 142 255 0 0 19 chr1,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr5,chr6,chr7,chr8,chr9,chrX, encodeGenes 1 color 225,30,255\ longLabel Gencode RACEfrags from Heart\ parent encodeGencodeRaceFrags\ priority 4\ shortLabel RACEfrags Heart\ track encodeGencodeRaceFragsHeart\ encodeSangerChipH3ac SI H3ac GM06990 bedGraph 4 Sanger Institute ChIP/Chip (H3ac ab, GM06990 cells) 0 4 10 10 130 132 132 192 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 0 color 10,10,130\ longLabel Sanger Institute ChIP/Chip (H3ac ab, GM06990 cells)\ parent encodeSangerChipH3H4\ priority 4\ shortLabel SI H3ac GM06990\ track encodeSangerChipH3ac\ stanfordChipHepG2SRF Stan HepG2 SRF bedGraph 4 Stanford ChIP-chip (HepG2 cells, SRF ChIP) 0 4 120 0 20 150 0 25 0 0 22 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr3,chr4,chr5,chr6,chr7,chr8,chr9,chrX, regulation 0 longLabel Stanford ChIP-chip (HepG2 cells, SRF ChIP)\ parent stanfordChip\ priority 4\ shortLabel Stan HepG2 SRF\ track stanfordChipHepG2SRF\ encodeStanfordChipHepG2SRF Stan HepG2 SRF bedGraph 4 Stanford ChIP-chip (HepG2 cells, SRF ChIP) 0 4 120 0 20 150 0 25 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 0 longLabel Stanford ChIP-chip (HepG2 cells, SRF ChIP)\ parent encodeStanfordChipJohnson\ priority 4\ shortLabel Stan HepG2 SRF\ track encodeStanfordChipHepG2SRF\ encodeStanfordChipJurkatSp3 Stan Jurkat Sp3 bedGraph 4 Stanford ChIP-chip (Jurkat cells, Sp3 ChIP) 0 4 120 0 20 150 0 25 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 0 longLabel Stanford ChIP-chip (Jurkat cells, Sp3 ChIP)\ parent encodeStanfordChip\ priority 4\ shortLabel Stan Jurkat Sp3\ track encodeStanfordChipJurkatSp3\ encodeStanfordMethHT1080 Stan Meth HT1080 bedGraph 4 Stanford Methylation Digest (HT1080 cells) 0 4 120 0 20 150 0 25 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChrom 0 longLabel Stanford Methylation Digest (HT1080 cells)\ parent encodeStanfordMeth\ priority 4\ shortLabel Stan Meth HT1080\ track encodeStanfordMethHT1080\ encodeStanfordMethSmoothedHT1080 Stan Meth Sc HT1080 bedGraph 4 Stanford Methylation Digest Smoothed Score (HT1080 cells) 0 4 120 0 20 150 0 25 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChrom 0 longLabel Stanford Methylation Digest Smoothed Score (HT1080 cells)\ parent encodeStanfordMethSmoothed\ priority 4\ shortLabel Stan Meth Sc HT1080\ track encodeStanfordMethSmoothedHT1080\ encodeStanfordPromotersG402 Stan Pro G402 bed 9 + Stanford Promoter Activity (G402 cells) 0 4 0 0 0 127 127 127 0 0 19 chr1,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr5,chr6,chr7,chr8,chr9,chrX, encodeTxLevels 1 longLabel Stanford Promoter Activity (G402 cells)\ parent encodeStanfordPromoters\ priority 4\ shortLabel Stan Pro G402\ track encodeStanfordPromotersG402\ encodeStanfordChipSmoothedJurkatSp3 Stan Sc Jurkat Sp3 bedGraph 4 Stanford ChIP-chip Smoothed Score (Jurkat cells, Sp3 ChIP) 0 4 120 0 20 150 0 25 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 0 longLabel Stanford ChIP-chip Smoothed Score (Jurkat cells, Sp3 ChIP)\ parent encodeStanfordChipSmoothed\ priority 4\ shortLabel Stan Sc Jurkat Sp3\ track encodeStanfordChipSmoothedJurkatSp3\ stsMap STS Markers bed 5 + STS Markers on Genetic (blue) and Radiation Hybrid (black) Maps 1 4 0 0 0 128 128 255 0 0 0This track shows locations of Sequence Tagged Site (STS) markers\ along the draft assembly. These markers have been mapped using either\ genetic mapping (Genethon, Marshfield, and deCODE maps), radiation\ hybridization mapping (Stanford, Whitehead RH, and GeneMap99 maps) or\ YAC mapping (the Whitehead YAC map) techniques. Since August 2001,\ this track no longer displays fluorescent in situ hybridization (FISH)\ clones, which are now displayed in a separate track.
\ \Genetic map markers are shown in blue; radiation hybrid map markers\ are shown in black. When a marker maps to multiple positions in the\ genome, it is shown in a lighter color.
\ \Positions of STS markers are determined using both full sequences\ and primer information. Full sequences are aligned using blat,\ while isPCR (Jim Kent) and ePCR are used to find\ locations using primer information. Both sets of placements are\ combined to give final positions. In nearly all cases, full sequence\ and primer-based locations are in agreement, but in cases of\ disagreement, full sequence positions are used. Sequence and primer\ information for the markers were obtained from the primary sites for\ each of the maps, and from UniSTS.\ \
The track filter can be used to change the color or include/exclude\ a set of map data within the track. This is helpful when many items\ are shown in the track display, especially when only some are relevant\ to the current task. To use the filter: \
When you have finished configuring the filter, click the\ Submit button.
\ \This track was designed and implemented by Terry Furey. Many\ thanks to the researchers who worked on these maps, and to Greg\ Schuler, Arek Kasprzyk, Wonhee Jang, and Sanja Rogic for helping\ process the data. Additional data on the individual maps can be found\ at the following links:\
\ The Neandertal Seq track shows Neandertal sequence reads mapped to the human\ genome. The Neandertal sequence was generated from six Neandertal fossils found\ in Croatia, Germany, Spain and Russia.\
\ \\ The sequence reads (query sequences) from each of the six samples are contained\ in separate subtracks. Use the checkboxes to select which samples \ will be displayed in the browser. Click and drag the sample name to\ reorder the subtracks. The order in which the subtracks appear in the subtrack\ list will be the order in which they display in the browser.\
\ The query sequences in the SAM/BAM alignment representation\ are normalized to the + strand of the reference genome\ (see the SAM Format Specification\ for more information on the SAM/BAM file format). If a query sequence was\ originally the reverse of what has been stored and aligned, it will have the\ following\ flag:\
\ (0x10) Read is on '-' strand.\\
\
\ BAM/SAM alignment representations also have tags. Some tags are predefined and others (those beginning\ with X, Y or Z) are defined by the aligner or data submitter. \ The following tag is associated with this track: \
\ The item labels and display colors of features within this track can be\ configured through the controls at the top of the track description page.\
\\ The Neandertal sequence was genereated from six Neandertal fossils. Vi33.16\ (54.1% genome coverage), Vi33.25 (46.6%) and Vi33.26 (45.2%) were discovered in\ the Vindija cave in Croatia. Feld1 (0.1%) is from the Neandertal type specimen\ from the Neander Valley in Germany, Sid1253 (0.1%) is from El Sidron cave in\ Asturias, Spain, and Mez1 (2%) is from Mezmaiskaya in the Altai Mountains,\ Russia.
\\ To increase the fraction of endogenous Neandertal DNA in the sequencing\ libraries, restriction enzymes were used to deplete libraries of microbial DNA.\ This was done by identifying Neandertal sequencing reads whose best alignment\ was to a primate sequence, and selecting enzymes that would differentially cut\ non-primate fragments. These enzymes all contained CpG dinucleotides in their\ recognition sequences, reflecting the particularly low abundance of this\ dinucleotide in mammalian DNA. Sequencing was carried out on the 454 FLX and\ Titanium platforms and the Illumina GA. Neandertal reads were mapped to the\ human genome (hg16) using a custom mapper called\ ANFO. This custom\ alignment program was developed to take into account the characteristics of \ ancient DNA. Following the observation and implementation by Briggs \ et al., ANFO\ uses different substitution matrices for DNA thought to be double-stranded\ versus single-stranded and changes between them if doing so affords a better\ score.\
\ \\ This track was produced at UCSC using data generated by\ Ed Green.\
\ \\ Briggs AW, Good JM, Green RE, Krause J, Maricic T, Stenzel U, Lalueza-Fox C, \ Rudan P, Brajkovic D, Kucan Z et al.\ Targeted retrieval and analysis of five Neandertal mtDNA \ genomes. Science. 2009 Jul 17;325(5938):318-21.
\\ Green RE, Krause J, Briggs AW, Maricic T, Stenzel U, Kircher M, Patterson N,\ Li H, Zhai W, Fritz MH et al.\ A Draft Sequence of the Neandertal Genome.\ Science. 2010 7 May;328(5979):710-22.\
\ neandertal 1 aliQualRange 0:254\ allButtonPair on\ bamColorMode gray\ bamGrayMode aliQual\ baseColorDefault diffBases\ baseColorUseSequence lfExtra\ compositeTrack on\ dimensions dimensionX=sample\ dragAndDrop subTracks\ group neandertal\ indelDoubleInsert on\ indelQueryInsert on\ longLabel Neandertal Sequence Reads\ noColorTag .\ priority 4.1\ shortLabel Neandertal Seq\ showDiffBasesAllScales .\ showDiffBasesMaxZoom 100\ showNames off\ sortOrder sample=+\ subGroup1 sample Sample Feld1=Feld1 Mez1=Mez1 Sid1253=Sid1253 Vi33dot16=Vi33.16 Vi33dot25=Vi33.25 Vi33dot26=Vi33.26\ track ntSeqReads\ type bam\ visibility hide\ netRBestMicMur1 Mouse lemur RBest Net netAlign micMur1 chainMicMur1 Mouse lemur (Jun. 2003 (Broad/micMur1)) Reciprocal Best Alignment Net 0 5 0 0 0 127 127 127 1 0 0 compGeno 0 group compGeno\ longLabel $o_Organism ($o_date) Reciprocal Best Alignment Net\ otherDb micMur1\ parent rBestNet\ priority 5\ shortLabel $o_Organism RBest Net\ spectrum on\ track netRBestMicMur1\ type netAlign micMur1 chainMicMur1\ visibility hide\ snpArrayAffy250Sty Affy 250KSty bed 6 + Affymetrix GeneChip Human Mapping 250K Sty 0 5 0 0 0 127 127 127 0 0 0 varRep 1 longLabel Affymetrix GeneChip Human Mapping 250K Sty\ parent snpArray off\ priority 5\ shortLabel Affy 250KSty\ track snpArrayAffy250Sty\ type bed 6 +\ encodeAffyChIpHl60PvalBrg1Hr08 Affy Brg1 RA 8h wig 0.0 534.54 Affymetrix ChIP/Chip (Brg1 retinoic acid-treated HL-60, 8hrs) P-Value 0 5 225 0 0 240 127 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 0 color 225,0,0\ longLabel Affymetrix ChIP/Chip (Brg1 retinoic acid-treated HL-60, 8hrs) P-Value\ parent encodeAffyChIpHl60Pval\ priority 5\ shortLabel Affy Brg1 RA 8h\ subGroups factor=Brg1 time=8h\ track encodeAffyChIpHl60PvalBrg1Hr08\ encodeAffyChIpHl60SignalStrictHisH4Hr00 Affy H4Kac4 0h wig -2.78 3.97 Affymetrix ChIP-chip (H4Kac4, retinoic acid-treated HL-60, 0hrs) Strict Signal 0 5 150 75 0 202 165 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 0 color 150,75,0\ longLabel Affymetrix ChIP-chip (H4Kac4, retinoic acid-treated HL-60, 0hrs) Strict Signal\ parent encodeAffyChIpHl60SignalStrict\ priority 5\ shortLabel Affy H4Kac4 0h\ subGroups factor=H4Kac4 time=0h\ track encodeAffyChIpHl60SignalStrictHisH4Hr00\ encodeAffyChIpHl60SitesStrictHisH4Hr00 Affy H4Kac4 0h bed 3 . Affymetrix ChIP-chip (H4Kac4, retinoic acid-treated HL-60, 0hrs) Strict Sites 0 5 150 75 0 202 165 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 1 color 150,75,0\ longLabel Affymetrix ChIP-chip (H4Kac4, retinoic acid-treated HL-60, 0hrs) Strict Sites\ parent encodeAffyChIpHl60SitesStrict\ priority 5\ shortLabel Affy H4Kac4 0h\ subGroups factor=H4Kac4 time=0h\ track encodeAffyChIpHl60SitesStrictHisH4Hr00\ encodeAffyChIpHl60PvalStrictHisH4Hr00 Affy H4Kac4 0h wig 0 696.62 Affymetrix ChIP-chip (H4Kac4, retinoic acid-treated HL-60, 0hrs) Strict P-Value 0 5 150 75 0 202 165 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 0 color 150,75,0\ longLabel Affymetrix ChIP-chip (H4Kac4, retinoic acid-treated HL-60, 0hrs) Strict P-Value\ parent encodeAffyChIpHl60PvalStrict\ priority 5\ shortLabel Affy H4Kac4 0h\ subGroups factor=H4Kac4 time=0h\ track encodeAffyChIpHl60PvalStrictHisH4Hr00\ encodeAffyRnaHl60SitesHr08IntronsProximal Affy In Prx HL60 8h bed 4 . Affy Intronic Proximal HL60 Retinoic 8h Transfrags 0 5 200 0 56 227 127 155 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeAnalysis 1 color 200,0,56\ longLabel Affy Intronic Proximal HL60 Retinoic 8h Transfrags\ parent encodeNoncodingTransFrags\ priority 5\ shortLabel Affy In Prx HL60 8h\ subGroups region=intronicProximal celltype=hl60 source=affy\ track encodeAffyRnaHl60SitesHr08IntronsProximal\ encodeAffyRnaHl60SignalHr08 Affy RNA RA 8h wig -1168.00 1686.5 Affymetrix PolyA+ RNA (retinoic acid-treated HL-60, 8hrs) Signal 0 5 50 50 210 152 152 232 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeTxLevels 0 color 50,50,210\ longLabel Affymetrix PolyA+ RNA (retinoic acid-treated HL-60, 8hrs) Signal\ parent encodeAffyRnaSignal\ priority 5\ shortLabel Affy RNA RA 8h\ track encodeAffyRnaHl60SignalHr08\ encodeAffyRnaHl60SitesHr08 Affy RNA RA 8h bed 3 . Affymetrix PolyA+ RNA (retinoic acid-treated HL-60, 8hrs) Sites 0 5 50 50 210 152 152 232 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeTxLevels 1 color 50,50,210\ longLabel Affymetrix PolyA+ RNA (retinoic acid-treated HL-60, 8hrs) Sites\ parent encodeAffyRnaTransfrags\ priority 5\ shortLabel Affy RNA RA 8h\ track encodeAffyRnaHl60SitesHr08\ encodeEgaspPartAugustusDual Augustus/Mouse genePred Augustus + Mouse Homology Gene Predictions 0 5 12 85 135 133 170 195 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeGenes 1 color 12,85,135\ longLabel Augustus + Mouse Homology Gene Predictions\ parent encodeEgaspPartial\ priority 5\ shortLabel Augustus/Mouse\ track encodeEgaspPartAugustusDual\ encodeBuFirstExonLiver BU Liver bed 12 + Boston University First Exon Activity in Liver 0 5 0 0 0 127 127 127 0 0 10 chr11,chr13,chr15,chr16,chr19,chr2,chr5,chr7,chr9,chrX, encodeTxLevels 1 longLabel Boston University First Exon Activity in Liver\ parent encodeBuFirstExon\ priority 5\ shortLabel BU Liver\ track encodeBuFirstExonLiver\ cccTrendPvalRa CCC Rheum Arth chromGraph Case Control Consortium rheumatoid arthritis trend -log10 P-value 0 5 0 0 0 127 127 127 0 0 0 phenDis 0 longLabel Case Control Consortium rheumatoid arthritis trend -log10 P-value\ parent caseControl\ priority 5\ shortLabel CCC Rheum Arth\ track cccTrendPvalRa\ bamSLDenisova Denisova bam Denisova Sequence Reads 0 5 0 0 0 127 127 127 0 0 0 \\
![]() |
\ Denisova cave entrance in the Altai Mountains\ \ of Siberia, Russia where the bones were found from which\ \ DNA was sequenced\ \ (Copyright (C) 2010, Johannes Krause)\ \ |
\ The Denisova track shows Denisova sequence reads mapped to the\ human genome. The Denisova sequence was generated from a phalanx bone\ excavated from Denisova Cave in the Altai Mountains in southern\ Siberia.\
\ \\ Denisova sequence libraries were prepared by treating DNA extracted\ from a single phalanx bone with two enzymes: uracil-DNA-glycosylase,\ which removes uracil residues from DNA to leave abasic sites, and\ endonuclease VIII, which cuts DNA at the 59 and 39 sides of abasic\ sites. Subsequent incubation with T4 polynucleotide kinase and T4 DNA\ polymerase was used to generate phosphorylated blunt ends that are\ amenable to adaptor ligation. Because the great majority of uracil\ residues occur close to the ends of ancient DNA molecules, this\ procedure leads to only a moderate reduction in average length of the\ molecules in the library, but a several-fold reduction in\ uracil-derived nucleotide misincorporation. Reads were aligned\ to human sequence July 2003 (NCBI34/hg16) using the\ Burrows-Wheeler Aligner.\
\\ Download the \ Denisova track data sets from the Genome Browser downloads server.\
\ \\ Briggs A.W., Stenzel U., Meyer M., Krause J., Kircher M., Pääbo S. \ Removal of deaminated cytosines and detection of in vivo\ methylation in ancient DNA.\ Nucleic Acids Res. 2009 Dec 22:38(6) e87.\
\\ Reich D., Green R.E., Kircher M., Krause J., Patterson N., Durand E.Y., Viola B., \ Briggs A.W., Stenzel U., Johnson P.L.F. et al.\ \ Genetic history of an archaic hominin group from Denisova Cave in Siberia.\ Nature. 2010 Dec 23;468:1053-1060.\
\ \\ This track was produced at UCSC using data generated by the Max Planck\ Institute for Evolutionary Anthropology.\
\ denisova 1 aliQualRange 0:60\ bamColorMode gray\ bamGrayMode aliQual\ baseColorDefault diffBases\ baseColorUseSequence lfExtra\ group denisova\ indelDoubleInsert on\ indelQueryInsert on\ longLabel Denisova Sequence Reads\ maxWindowToDraw 1000000\ noColorTag .\ pairEndsByName on\ priority 5\ shortLabel Denisova\ showDiffBasesAllScales .\ showDiffBasesMaxZoom 100\ showNames off\ track bamSLDenisova\ type bam\ visibility hide\ kiddEichlerDiscAbc10 Discordant ABC10 bed 12 HGSV Individual ABC10 (Yoruba) Discordant Clone End Alignments 0 5 0 0 0 127 127 127 0 0 0 http://mrhgsv.gs.washington.edu/cgi-bin/hgc?i=$$&c=$S&l=$[&r=$]&db=$D&position=$S:$[-$] varRep 1 longLabel HGSV Individual ABC10 (Yoruba) Discordant Clone End Alignments\ parent kiddEichlerDisc\ priority 5\ shortLabel Discordant ABC10\ track kiddEichlerDiscAbc10\ encodeAffyEc1BrainHippocampusSignal EC1 Sgnl Hippoc wig 0 62385 Affy Ext Trans Signal (1-base window) (Brain Hippocampus) 0 5 248 0 8 251 127 131 0 0 2 chr21,chr22, encodeTxLevels 0 color 248,0,8\ longLabel Affy Ext Trans Signal (1-base window) (Brain Hippocampus)\ parent encodeAffyEcSignal\ priority 5\ shortLabel EC1 Sgnl Hippoc\ track encodeAffyEc1BrainHippocampusSignal\ encodeAffyEc1BrainHippocampusSites EC1 Sites Hippoc bed 3 . Affy Ext Trans Sites (1-base window) (Brain Hippocampus) 0 5 248 0 8 251 127 131 0 0 2 chr21,chr22, encodeTxLevels 1 color 248,0,8\ longLabel Affy Ext Trans Sites (1-base window) (Brain Hippocampus)\ parent encodeAffyEcSites\ priority 5\ shortLabel EC1 Sites Hippoc\ track encodeAffyEc1BrainHippocampusSites\ encodeEgaspUpdExogean Exogean Update genePred Exogean Gene Predictions 0 5 100 12 100 177 133 177 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeGenes 1 color 100,12,100\ longLabel Exogean Gene Predictions\ parent encodeEgaspUpdate\ priority 5\ shortLabel Exogean Update\ track encodeEgaspUpdExogean\ encodeEgaspFullExonhunter ExonHunter genePred ExonHunter Gene Predictions 0 5 12 20 150 133 137 202 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeGenes 1 color 12,20,150\ longLabel ExonHunter Gene Predictions\ parent encodeEgaspFull\ priority 5\ shortLabel ExonHunter\ track encodeEgaspFullExonhunter\ decodeFemaleCarrier Female Carrier bigWig 0.0 77.704 deCODE recombination map, female carrier 0 5 187 102 255 221 178 255 0 0 22 chr1,chr2,chr3,chr4,chr5,chr6,chr7,chr8,chr9,chr10,chr11,chr12,chr13,chr16,chr14,chr15,chr17,chr18,chr19,chr20,chr22,chr21, map 0 color 187,102,255\ configurable on\ longLabel deCODE recombination map, female carrier\ parent femaleView\ priority 5\ shortLabel Female Carrier\ subGroups view=female\ track decodeFemaleCarrier\ type bigWig 0.0 77.704\ encodeGencodeGenePolyAMar07 Gencode PolyA bed 9 . Gencode polyA Features 0 5 0 0 0 127 127 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeGenes 1 itemRgb on\ longLabel Gencode polyA Features\ noInherit on\ parent encodeGencodeGeneMar07\ priority 5\ shortLabel Gencode PolyA\ track encodeGencodeGenePolyAMar07\ type bed 9 .\ hapmapSnpsGIH HapMap SNPs GIH bed 6 + HapMap SNPs from the GIH Population (Gujarati Indians in Houston, TX, US) 0 5 0 0 0 127 127 127 0 0 0 varRep 1 longLabel HapMap SNPs from the GIH Population (Gujarati Indians in Houston, TX, US)\ parent hapmapSnps\ priority 5\ shortLabel HapMap SNPs GIH\ track hapmapSnpsGIH\ encodeRegulomeQualityHepG2 HepG2 bed 5 . HepG2 Quality 0 5 180 50 120 217 152 187 1 0 10 chr2,chr5,chr7,chr8,chr9,chr11,chr12,chr16,chr18,chrX, encodeChrom 1 color 180,50,120\ longLabel HepG2 Quality\ parent encodeRegulomeQuality\ priority 5\ shortLabel HepG2\ track encodeRegulomeQualityHepG2\ encodeRegulomeProbHepG2 HepG2 bedGraph 4 HepG2 DNaseI HSs 0 5 180 50 120 217 152 187 0 0 10 chr2,chr5,chr7,chr8,chr9,chr11,chr12,chr16,chr18,chrX, encodeChrom 0 color 180,50,120\ longLabel HepG2 DNaseI HSs\ parent encodeRegulomeProb\ priority 5\ shortLabel HepG2\ track encodeRegulomeProbHepG2\ encodeRegulomeBaseHepG2 HepG2 wig 0.0 3.0 HepG2 DNaseI Sensitivity 0 5 180 50 120 217 152 187 0 0 10 chr2,chr5,chr7,chr8,chr9,chr11,chr12,chr16,chr18,chrX, encodeChrom 0 color 180,50,120\ longLabel HepG2 DNaseI Sensitivity\ parent encodeRegulomeBase\ priority 5\ shortLabel HepG2\ track encodeRegulomeBaseHepG2\ hgdpHzySAsia Hetzgty S. Asia bedGraph 4 Human Genome Diversity Proj Smoothd Expec Heterozygosity (S. Asia) 0 5 0 0 0 127 127 127 0 0 22 chr1,chr2,chr3,chr4,chr5,chr6,chr7,chr8,chr9,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr17,chr18,chr19,chr20,chr21,chr22, varRep 0 color 0,0,0\ longLabel Human Genome Diversity Proj Smoothd Expec Heterozygosity (S. Asia)\ parent hgdpHzy\ priority 5\ shortLabel Hetzgty S. Asia\ track hgdpHzySAsia\ hgdpIhsEAsia iHS E. Asia bedGraph 4 Human Genome Diversity Project iHS (East Asia) 0 5 0 200 0 127 227 127 0 0 23 chr1,chr2,chr3,chr4,chr5,chr6,chr7,chr8,chr9,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr17,chr18,chr19,chr20,chr21,chr22,chrX, varRep 0 color 0,200,0\ longLabel Human Genome Diversity Project iHS (East Asia)\ parent hgdpIhs\ priority 5\ shortLabel iHS E. Asia\ track hgdpIhsEAsia\ encodeAllIntergenicDistal Intergenic Dist bed 4 Consensus Intergenic Distal 0 5 0 0 0 127 127 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeAnalysis 1 longLabel Consensus Intergenic Distal\ parent encodeWorkshopSelections\ priority 5\ shortLabel Intergenic Dist\ track encodeAllIntergenicDistal\ iscaRetrospectiveUncertain ISCA Ret Uncert. gvf Internat. Stds. for Cytogen. Arrays Consort. (ISCA) - Retrospective (Uncertain) 0 5 0 0 0 127 127 127 0 0 0 varRep 1 longLabel Internat. Stds. for Cytogen. Arrays Consort. (ISCA) - Retrospective (Uncertain)\ parent iscaRetrospectiveComposite\ priority 5\ shortLabel ISCA Ret Uncert.\ track iscaRetrospectiveUncertain\ encodeUcsdChipHeLaH3H4acH3_p0 LI H3ac -gIF bedGraph 4 Ludwig Institute ChIP-chip: H3ac ab, HeLa cells, no gamma interferon 0 5 109 51 43 182 153 149 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 0 color 109,51,43\ longLabel Ludwig Institute ChIP-chip: H3ac ab, HeLa cells, no gamma interferon\ parent encodeLIChIPgIF\ priority 5\ shortLabel LI H3ac -gIF\ track encodeUcsdChipHeLaH3H4acH3_p0\ encodeUcsdChipTaf250Hela_f LI TAF1 HeLa bedGraph 4 Ludwig Institute ChIP-chip: TAF1 ab, HeLa cells 0 5 0 0 0 127 127 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 0 longLabel Ludwig Institute ChIP-chip: TAF1 ab, HeLa cells\ parent encodeLIChIP\ priority 5\ shortLabel LI TAF1 HeLa\ track encodeUcsdChipTaf250Hela_f\ delMccarroll McCarroll Dels bed 4 . Deletions from Genotype Analysis (McCarroll) 0 5 0 0 0 127 127 127 0 0 0 varRep 1 longLabel Deletions from Genotype Analysis (McCarroll)\ noInherit on\ parent cnp\ priority 5\ shortLabel McCarroll Dels\ track delMccarroll\ type bed 4 .\ encodeMlaganNcUnionEl MLAGAN NC Union bed 5 . MLAGAN PhastCons/BinCons/GERP Union NonCoding Conserved Elements 0 5 80 105 145 167 180 200 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeCompGeno 1 color 80,105,145\ longLabel MLAGAN PhastCons/BinCons/GERP Union NonCoding Conserved Elements\ parent encodeMlaganElements\ priority 5\ shortLabel MLAGAN NC Union\ track encodeMlaganNcUnionEl\ bamAllNumtSSorted NumtS SNPs bam Human NumtS on mitochondrion SNPs 3 5 0 0 0 127 127 127 0 0 1 chrM, varRep 1 aliQualRange 0:255\ bamColorMode strand\ bamGrayMode aliQual\ bamSkipPrintQualScore .\ baseColorDefault diffBases\ baseColorUseSequence lfExtra\ chromosomes chrM\ configurable on\ indelDoubleInsert on\ indelQueryInsert on\ longLabel Human NumtS on mitochondrion SNPs\ maxWindowToDraw 1000000\ noColorTag .\ pairEndsByName on\ parent numtSeq\ priority 5\ shortLabel NumtS SNPs\ showDiffBasesAllScales .\ showDiffBasesMaxZoom 100\ showNames on\ track bamAllNumtSSorted\ type bam\ visibility pack\ encodeGencodeOtherESTs Other ESTs bed 4 . Gencode Other ESTs 0 5 0 0 0 127 127 127 0 0 0 encodeAnalysis 1 longLabel Gencode Other ESTs\ parent encodeGencodeRegions\ priority 5\ shortLabel Other ESTs\ track encodeGencodeOtherESTs\ hapmapLdPhChbJpt Ph JPT+CHB ld2 LD for the Han Chinese + Japanese from Tokyo (JPT+CHB) from phased genotypes 0 5 0 0 0 127 127 127 0 0 22 chr1,chr2,chr3,chr4,chr5,chr6,chr7,chr8,chr9,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr17,chr18,chr19,chr20,chr21,chr22, varRep 0 longLabel LD for the Han Chinese + Japanese from Tokyo (JPT+CHB) from phased genotypes\ parent hapmapLdPh\ priority 5\ shortLabel Ph JPT+CHB\ track hapmapLdPhChbJpt\ encodeGencodeRaceFragsKidney RACEfrags Kidney genePred Gencode RACEfrags from Kidney 0 5 212 0 44 233 127 149 0 0 19 chr1,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr5,chr6,chr7,chr8,chr9,chrX, encodeGenes 1 color 212,0,44\ longLabel Gencode RACEfrags from Kidney\ parent encodeGencodeRaceFrags\ priority 5\ shortLabel RACEfrags Kidney\ track encodeGencodeRaceFragsKidney\ netSyntenyRn4 Rat Syn Net netAlign rn4 chainRn4 Rat (Nov. 2004 (Baylor 3.4/rn4)) Syntenic Alignment Net 0 5 0 0 0 127 127 127 1 0 0 compGeno 0 group compGeno\ longLabel $o_Organism ($o_date) Syntenic Alignment Net\ otherDb rn4\ parent syntenicNet\ priority 5\ shortLabel Rat Syn Net\ spectrum on\ track netSyntenyRn4\ type netAlign rn4 chainRn4\ visibility hide\ encodeSangerChipH4ac SI H4ac GM06990 bedGraph 4 Sanger Institute ChIP/Chip (H4ac ab, GM06990 cells) 0 5 10 10 130 132 132 192 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 0 color 10,10,130\ longLabel Sanger Institute ChIP/Chip (H4ac ab, GM06990 cells)\ parent encodeSangerChipH3H4\ priority 5\ shortLabel SI H4ac GM06990\ track encodeSangerChipH4ac\ tajdSnpXd SNPs XD bed 4 . SNPs from Chinese Descent 0 5 200 100 0 0 100 200 0 0 0 varRep 1 altColor 0,100,200\ color 200,100,0\ longLabel SNPs from Chinese Descent\ parent tajdSnp\ priority 5\ shortLabel SNPs XD\ track tajdSnpXd\ stanfordChipHeLaGABP Stan HeLa GABP bedGraph 4 Stanford ChIP-chip (HeLa cells, GABP ChIP) 0 5 120 0 20 150 0 25 0 0 22 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr3,chr4,chr5,chr6,chr7,chr8,chr9,chrX, regulation 0 longLabel Stanford ChIP-chip (HeLa cells, GABP ChIP)\ parent stanfordChip\ priority 5\ shortLabel Stan HeLa GABP\ track stanfordChipHeLaGABP\ encodeStanfordChipHeLaGABP Stan HeLa GABP bedGraph 4 Stanford ChIP-chip (HeLa cells, GABP ChIP) 0 5 120 0 20 150 0 25 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 0 longLabel Stanford ChIP-chip (HeLa cells, GABP ChIP)\ parent encodeStanfordChipJohnson\ priority 5\ shortLabel Stan HeLa GABP\ track encodeStanfordChipHeLaGABP\ encodeStanfordChipK562Sp1 Stan K562 Sp1 bedGraph 4 Stanford ChIP-chip (K562 cells, Sp1 ChIP) 0 5 120 0 20 150 0 25 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 0 longLabel Stanford ChIP-chip (K562 cells, Sp1 ChIP)\ parent encodeStanfordChip\ priority 5\ shortLabel Stan K562 Sp1\ track encodeStanfordChipK562Sp1\ encodeStanfordMethHepG2 Stan Meth HepG2 bedGraph 4 Stanford Methylation Digest (HepG2 cells) 0 5 120 0 20 150 0 25 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChrom 0 longLabel Stanford Methylation Digest (HepG2 cells)\ parent encodeStanfordMeth\ priority 5\ shortLabel Stan Meth HepG2\ track encodeStanfordMethHepG2\ encodeStanfordMethSmoothedHepG2 Stan Meth Sc HepG2 bedGraph 4 Stanford Methylation Digest Smoothed Score (HepG2 cells) 0 5 120 0 20 150 0 25 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChrom 0 longLabel Stanford Methylation Digest Smoothed Score (HepG2 cells)\ parent encodeStanfordMethSmoothed\ priority 5\ shortLabel Stan Meth Sc HepG2\ track encodeStanfordMethSmoothedHepG2\ encodeStanfordPromotersHCT116 Stan Pro HCT116 bed 9 + Stanford Promoter Activity (HCT116 cells) 0 5 0 0 0 127 127 127 0 0 19 chr1,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr5,chr6,chr7,chr8,chr9,chrX, encodeTxLevels 1 longLabel Stanford Promoter Activity (HCT116 cells)\ parent encodeStanfordPromoters\ priority 5\ shortLabel Stan Pro HCT116\ track encodeStanfordPromotersHCT116\ encodeStanfordChipSmoothedK562Sp1 Stan Sc K562 Sp1 bedGraph 4 Stanford ChIP-chip Smoothed Score (K562 cells, Sp1 ChIP) 0 5 120 0 20 150 0 25 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 0 longLabel Stanford ChIP-chip Smoothed Score (K562 cells, Sp1 ChIP)\ parent encodeStanfordChipSmoothed\ priority 5\ shortLabel Stan Sc K562 Sp1\ track encodeStanfordChipSmoothedK562Sp1\ stsMapMouse STS Markers bed 5 + STS Markers on Genetic Maps 1 5 0 0 0 128 128 255 0 0 0This track shows locations of Sequence-Tagged Site (STS) markers along \ the mouse draft assembly. These markers appear on the Mouse Genome Informatics (MGI) consensus mouse genetic \ map. Information about the genetic map and STS marker primer sequences are \ provided by the Mouse Genome Informatics database group at The Jackson \ Laboratory.
\ map 1 altColor 128,128,255,\ group map\ longLabel STS Markers on Genetic Maps\ priority 5\ shortLabel STS Markers\ track stsMapMouse\ type bed 5 +\ visibility dense\ encodeTbaNcUnionEl TBA NC Union bed 5 . TBA PhastCons/BinCons/GERP Union NonCoding Conserved Elements 0 5 80 105 145 167 180 200 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeCompGeno 1 color 80,105,145\ longLabel TBA PhastCons/BinCons/GERP Union NonCoding Conserved Elements\ parent encodeTbaElements\ priority 5\ shortLabel TBA NC Union\ track encodeTbaNcUnionEl\ hiSeqDepthTop10Pct Top 0.10 Depth bed 3 Top 0.10 of Read Depth Distribution 0 5 139 69 19 197 162 137 0 0 0 map 1 longLabel Top 0.10 of Read Depth Distribution\ parent hiSeqDepth\ priority 5\ shortLabel Top 0.10 Depth\ track hiSeqDepthTop10Pct\ encodePseudogeneUcsc2 UCSC Pseudogenes genePred UCSC Pseudogene Predictions 0 5 0 0 0 127 127 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeGenes 1 longLabel UCSC Pseudogene Predictions\ parent encodePseudogene\ priority 5\ shortLabel UCSC Pseudogenes\ track encodePseudogeneUcsc2\ encodeUtexChipHeLaMycPeaks UT Myc HeLa Pk bedGraph 4 University of Texas, Austin ChIP-chip (c-Myc, HeLa) Peaks 0 5 50 0 0 152 127 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 0 color 50,0,0\ longLabel University of Texas, Austin ChIP-chip (c-Myc, HeLa) Peaks\ parent encodeUtexChip\ priority 5\ shortLabel UT Myc HeLa Pk\ subGroups dataType=peaks\ track encodeUtexChipHeLaMycPeaks\ encodeUvaDnaRep8 UVa DNA Rep 8h bed 3 . University of Virginia Temporal Profiling of DNA Replication (8-10 hrs) 0 5 60 75 60 10 130 10 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChrom 1 longLabel University of Virginia Temporal Profiling of DNA Replication (8-10 hrs)\ parent encodeUvaDnaRep\ priority 5\ shortLabel UVa DNA Rep 8h\ track encodeUvaDnaRep8\ kiddEichlerValidAbc10 Validated ABC10 bed 9 HGSV Individual ABC10 (Yoruba) Validated Sites of Structural Variation 0 5 0 0 0 127 127 127 0 0 0 varRep 1 longLabel HGSV Individual ABC10 (Yoruba) Validated Sites of Structural Variation\ parent kiddEichlerValid\ priority 5\ shortLabel Validated ABC10\ track kiddEichlerValidAbc10\ hgdpXpehhEAsia XP-EHH E. Asia bedGraph 4 Human Genome Diversity Project XP-EHH (East Asia) 0 5 0 200 0 127 227 127 0 0 22 chr1,chr2,chr3,chr4,chr5,chr6,chr7,chr8,chr9,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr17,chr18,chr19,chr20,chr21,chr22, varRep 0 color 0,200,0\ longLabel Human Genome Diversity Project XP-EHH (East Asia)\ parent hgdpXpehh\ priority 5\ shortLabel XP-EHH E. Asia\ track hgdpXpehhEAsia\ encodeTransFragsYaleIntergenicProximal Yale Intergen Prox bed 4 Yale Intergenic Proximal 0 5 0 0 0 127 127 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeAnalysis 1 longLabel Yale Intergenic Proximal\ parent encodeTransFrags\ priority 5\ shortLabel Yale Intergen Prox\ track encodeTransFragsYaleIntergenicProximal\ encodeYaleMASPlacRNATransMapFwdMless36mer36bp Yale Plc BtF RNA bedGraph 4 Yale Placenta RNA TransMap, MAS array, Forward Direction, Bertone Protocol 0 5 200 50 50 50 50 200 0 0 8 chr5,chr7,chrX,chr11,chr16,chr19,chr21,chr22, encodeTxLevels 0 altColor 50,50,200\ color 200,50,50\ longLabel Yale Placenta RNA TransMap, MAS array, Forward Direction, Bertone Protocol\ parent encodeYaleMASPlacRNATransMap\ priority 5\ shortLabel Yale Plc BtF RNA\ track encodeYaleMASPlacRNATransMapFwdMless36mer36bp\ encodeYaleMASPlacRNATarsFwdMless36mer36bp Yale Plc BtF TAR bed 6 . Yale Placenta RNA TARs, MAS array, Forward Direction, Bertone Protocol 0 5 200 50 50 50 50 200 0 0 8 chr5,chr7,chrX,chr11,chr16,chr19,chr21,chr22, encodeTxLevels 1 altColor 50,50,200\ color 200,50,50\ longLabel Yale Placenta RNA TARs, MAS array, Forward Direction, Bertone Protocol\ parent encodeYaleMASPlacRNATars\ priority 5\ shortLabel Yale Plc BtF TAR\ track encodeYaleMASPlacRNATarsFwdMless36mer36bp\ encodeYaleAffyNeutRNATransMap04 Yale RNA Neu 4 wig -2730 3394 Yale Neutrophil RNA Transcript Map, Sample 4 0 5 50 160 50 152 207 152 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeTxLevels 0 color 50,160,50\ longLabel Yale Neutrophil RNA Transcript Map, Sample 4\ parent encodeYaleAffyRNATransMap\ priority 5\ shortLabel Yale RNA Neu 4\ subGroups celltype=neutro samples=samples\ track encodeYaleAffyNeutRNATransMap04\ encodeYaleAffyNeutRNATars04 Yale TAR Neu 4 bed 3 . Yale Neutrophil RNA Transcriptionally Active Region, Sample 4 0 5 50 160 50 152 207 152 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeTxLevels 1 color 50,160,50\ longLabel Yale Neutrophil RNA Transcriptionally Active Region, Sample 4\ parent encodeYaleAffyRNATars\ priority 5\ shortLabel Yale TAR Neu 4\ subGroups celltype=neutro samples=samples\ track encodeYaleAffyNeutRNATars04\ netRBestTupBel1 Tree shrew RBest Net netAlign tupBel1 chainTupBel1 Tree shrew (Dec. 2006 (Broad/tupBel1)) Reciprocal Best Alignment Net 0 6 0 0 0 127 127 127 1 0 0 compGeno 0 group compGeno\ longLabel $o_Organism ($o_date) Reciprocal Best Alignment Net\ otherDb tupBel1\ parent rBestNet\ priority 6\ shortLabel $o_Organism RBest Net\ spectrum on\ track netRBestTupBel1\ type netAlign tupBel1 chainTupBel1\ visibility hide\ encodeAffyChIpHl60SitesBrg1Hr08 Affy Brg1 RA 8h bed 3 . Affymetrix ChIP/Chip (Brg1 retinoic acid-treated HL-60, 8hrs) Sites 0 6 225 0 0 240 127 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 1 color 225,0,0\ longLabel Affymetrix ChIP/Chip (Brg1 retinoic acid-treated HL-60, 8hrs) Sites\ parent encodeAffyChIpHl60Sites\ priority 6\ shortLabel Affy Brg1 RA 8h\ subGroups factor=Brg1 time=8h\ track encodeAffyChIpHl60SitesBrg1Hr08\ encodeAffyChIpHl60SignalStrictHisH4Hr02 Affy H4Kac4 2h wig -2.78 3.97 Affymetrix ChIP-chip (H4Kac4, retinoic acid-treated HL-60, 2hrs) Strict Signal 0 6 150 75 0 202 165 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 0 color 150,75,0\ longLabel Affymetrix ChIP-chip (H4Kac4, retinoic acid-treated HL-60, 2hrs) Strict Signal\ parent encodeAffyChIpHl60SignalStrict\ priority 6\ shortLabel Affy H4Kac4 2h\ subGroups factor=H4Kac4 time=2h\ track encodeAffyChIpHl60SignalStrictHisH4Hr02\ encodeAffyChIpHl60SitesStrictHisH4Hr02 Affy H4Kac4 2h bed 3 . Affymetrix ChIP-chip (H4Kac4, retinoic acid-treated HL-60, 2hrs) Strict Sites 0 6 150 75 0 202 165 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 1 color 150,75,0\ longLabel Affymetrix ChIP-chip (H4Kac4, retinoic acid-treated HL-60, 2hrs) Strict Sites\ parent encodeAffyChIpHl60SitesStrict\ priority 6\ shortLabel Affy H4Kac4 2h\ subGroups factor=H4Kac4 time=2h\ track encodeAffyChIpHl60SitesStrictHisH4Hr02\ encodeAffyChIpHl60PvalStrictHisH4Hr02 Affy H4Kac4 2h wig 0 696.62 Affymetrix ChIP-chip (H4Kac4, retinoic acid-treated HL-60, 2hrs) Strict P-Value 0 6 150 75 0 202 165 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 0 color 150,75,0\ longLabel Affymetrix ChIP-chip (H4Kac4, retinoic acid-treated HL-60, 2hrs) Strict P-Value\ parent encodeAffyChIpHl60PvalStrict\ priority 6\ shortLabel Affy H4Kac4 2h\ subGroups factor=H4Kac4 time=2h\ track encodeAffyChIpHl60PvalStrictHisH4Hr02\ encodeAffyRnaHl60SitesHr32IntronsProximal Affy In Prx HL60 32h bed 4 . Affy Intronic Proximal HL60 Retinoic 32h Transfrags 0 6 188 0 68 221 127 161 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeAnalysis 1 color 188,0,68\ longLabel Affy Intronic Proximal HL60 Retinoic 32h Transfrags\ parent encodeNoncodingTransFrags\ priority 6\ shortLabel Affy In Prx HL60 32h\ subGroups region=intronicProximal celltype=hl60 source=affy\ track encodeAffyRnaHl60SitesHr32IntronsProximal\ encodeAffyRnaHl60SignalHr32 Affy RNA RA 32h wig -1168.00 1686.5 Affymetrix PolyA+ RNA (retinoic acid-treated HL-60, 32hrs) Signal 0 6 50 50 240 152 152 247 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeTxLevels 0 color 50,50,240\ longLabel Affymetrix PolyA+ RNA (retinoic acid-treated HL-60, 32hrs) Signal\ parent encodeAffyRnaSignal\ priority 6\ shortLabel Affy RNA RA 32h\ track encodeAffyRnaHl60SignalHr32\ encodeAffyRnaHl60SitesHr32 Affy RNA RA 32h bed 3 . Affymetrix PolyA+ RNA (retinoic acid-treated HL-60, 32hrs) Sites 0 6 50 50 240 152 152 247 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeTxLevels 1 color 50,50,240\ longLabel Affymetrix PolyA+ RNA (retinoic acid-treated HL-60, 32hrs) Sites\ parent encodeAffyRnaTransfrags\ priority 6\ shortLabel Affy RNA RA 32h\ track encodeAffyRnaHl60SitesHr32\ encodeEgaspPartAugustusAny Augustus/EST/Mouse genePred Augustus + EST/Protein Evidence + Mouse Homology Gene Predictions 0 6 12 100 100 133 177 177 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeGenes 1 color 12,100,100\ longLabel Augustus + EST/Protein Evidence + Mouse Homology Gene Predictions\ parent encodeEgaspPartial\ priority 6\ shortLabel Augustus/EST/Mouse\ track encodeEgaspPartAugustusAny\ encodeBuFirstExonLung BU Lung bed 12 + Boston University First Exon Activity in Lung 0 6 0 0 0 127 127 127 0 0 10 chr11,chr13,chr15,chr16,chr19,chr2,chr5,chr7,chr9,chrX, encodeTxLevels 1 longLabel Boston University First Exon Activity in Lung\ parent encodeBuFirstExon\ priority 6\ shortLabel BU Lung\ track encodeBuFirstExonLung\ cccTrendPvalT1d CCC T1 Diabetes chromGraph Case Control Consortium type 1 diabetes trend -log10 P-value 0 6 0 0 0 127 127 127 0 0 0 phenDis 0 longLabel Case Control Consortium type 1 diabetes trend -log10 P-value\ parent caseControl\ priority 6\ shortLabel CCC T1 Diabetes\ track cccTrendPvalT1d\ delConrad Conrad Dels bed 8 . Deletions from Genotype Analysis (Conrad) 0 6 0 0 0 127 127 127 0 0 0 varRep 1 longLabel Deletions from Genotype Analysis (Conrad)\ noInherit on\ parent cnp\ priority 6\ shortLabel Conrad Dels\ track delConrad\ type bed 8 .\ kiddEichlerDiscAbc9 Discordant ABC9 bed 12 HGSV Individual ABC9 (Japan) Discordant Clone End Alignments 0 6 0 0 0 127 127 127 0 0 0 http://mrhgsv.gs.washington.edu/cgi-bin/hgc?i=$$&c=$S&l=$[&r=$]&db=$D&position=$S:$[-$] varRep 1 longLabel HGSV Individual ABC9 (Japan) Discordant Clone End Alignments\ parent kiddEichlerDisc\ priority 6\ shortLabel Discordant ABC9\ track kiddEichlerDiscAbc9\ netSyntenyCanFam2 Dog Syn Net netAlign canFam2 chainCanFam2 Dog (May 2005 (Broad/canFam2)) Syntenic Alignment Net 0 6 0 0 0 127 127 127 1 0 0 compGeno 0 group compGeno\ longLabel $o_Organism ($o_date) Syntenic Alignment Net\ otherDb canFam2\ parent syntenicNet\ priority 6\ shortLabel Dog Syn Net\ spectrum on\ track netSyntenyCanFam2\ type netAlign canFam2 chainCanFam2\ visibility hide\ encodeAffyEc51BrainHippocampusSignal EC51 Sgnl Hippoc wig 0 62385 Affy Ext Trans Signal (51-base window) (Brain Hippocampus) 0 6 248 0 8 251 127 131 0 0 2 chr21,chr22, encodeTxLevels 0 color 248,0,8\ longLabel Affy Ext Trans Signal (51-base window) (Brain Hippocampus)\ parent encodeAffyEcSignal\ priority 6\ shortLabel EC51 Sgnl Hippoc\ track encodeAffyEc51BrainHippocampusSignal\ encodeAffyEc51BrainHippocampusSites EC51 Site Hippoc bed 3 . Affy Ext Trans Sites (51-base window) (Brain Hippocampus) 0 6 248 0 8 251 127 131 0 0 2 chr21,chr22, encodeTxLevels 1 color 248,0,8\ longLabel Affy Ext Trans Sites (51-base window) (Brain Hippocampus)\ parent encodeAffyEcSites\ priority 6\ shortLabel EC51 Site Hippoc\ track encodeAffyEc51BrainHippocampusSites\ encodeGencodeExonic Exonic bed 4 . Gencode Exonic Regions 0 6 0 0 0 127 127 127 0 0 0 encodeAnalysis 1 longLabel Gencode Exonic Regions\ parent encodeGencodeRegions\ priority 6\ shortLabel Exonic\ track encodeGencodeExonic\ decodeFemaleNonCarrier Female Non-carrier bigWig 0.0 93.929 deCODE recombination map, female non-carrier 0 6 148 128 200 201 191 227 0 0 22 chr1,chr2,chr3,chr4,chr5,chr6,chr7,chr8,chr9,chr10,chr11,chr12,chr13,chr16,chr14,chr15,chr17,chr18,chr19,chr20,chr22,chr21, map 0 color 148,128,200\ configurable on\ longLabel deCODE recombination map, female non-carrier\ parent femaleView\ priority 6\ shortLabel Female Non-carrier\ subGroups view=female\ track decodeFemaleNonCarrier\ type bigWig 0.0 93.929\ encodeEgaspFullFgenesh Fgenesh++ genePred Fgenesh++ Gene Predictions 0 6 22 150 20 138 202 137 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeGenes 1 color 22,150,20\ longLabel Fgenesh++ Gene Predictions\ parent encodeEgaspFull\ priority 6\ shortLabel Fgenesh++\ track encodeEgaspFullFgenesh\ encodeEgaspUpdFgenesh FGenesh++ Upd genePred Fgenesh++ Gene Predictions 0 6 22 150 20 138 202 137 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeGenes 1 color 22,150,20\ longLabel Fgenesh++ Gene Predictions\ parent encodeEgaspUpdate\ priority 6\ shortLabel FGenesh++ Upd\ track encodeEgaspUpdFgenesh\ fishClones FISH Clones bed 5 + Clones Placed on Cytogenetic Map Using FISH 0 6 0 150 0 127 202 127 0 0 0\ This track shows the location of fluorescent in situ hybridization \ (FISH)-mapped clones along the draft assembly sequence. The locations of \ these clones were contributed as a part of the BAC Consortium paper \ Cheung, V.G. et al. (2001) in the References section below.
\\ More information about the BAC clones, including how they may be obtained, \ can be found at the \ Human BAC Resource and the \ Clone Registry web sites hosted by \ NCBI.\ To view Clone Registry information for a clone, click on the clone name at \ the top of the details page for that item.
\ \\ This track has a filter that can be used to change the color or \ include/exclude the display of a dataset from an individual lab. This is \ helpful when many items are shown in the track display, especially when only \ some are relevant to the current task. The filter is located at the top of \ the track description page, which is accessed via the small button to the \ left of the track's graphical display or through the link on the track's \ control menu. To use the filter:\
\ When you have finished configuring the filter, click the Submit \ button.
\ \\ We would like to thank all of the labs that have contributed to this resource:\
\ Cheung VG, Nowak N, Jang W, Kirsch IR, Zhao S, Chen X-N, Furey TS, Kim U-J, Kuo W-L, Olivier M \ et al. \ Integration of cytogenetic landmarks into the draft sequence of \ the human genome. Nature. 2001 Feb 15;409(6822):953-958.
\ map 1 color 0,150,0,\ group map\ longLabel Clones Placed on Cytogenetic Map Using FISH\ priority 6\ shortLabel FISH Clones\ track fishClones\ type bed 5 +\ visibility hide\ encodePseudogeneGIS GIS Pseudogenes genePred Genome Institute of Singapore (GIS) Pseudogenes 0 6 0 0 0 127 127 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeGenes 1 longLabel Genome Institute of Singapore (GIS) Pseudogenes\ parent encodePseudogene\ priority 6\ shortLabel GIS Pseudogenes\ track encodePseudogeneGIS\ hapmapSnpsJPT HapMap SNPs JPT bed 6 + HapMap SNPs from the JPT Population (Japanese in Tokyo, Japan) 0 6 0 0 0 127 127 127 0 0 0 varRep 1 longLabel HapMap SNPs from the JPT Population (Japanese in Tokyo, Japan)\ parent hapmapSnps\ priority 6\ shortLabel HapMap SNPs JPT\ track hapmapSnpsJPT\ hgdpHzyEAsia Hetzgty E. Asia bedGraph 4 Human Genome Diversity Proj Smoothd Expec Heterozygosity (E. Asia) 0 6 0 200 0 127 227 127 0 0 22 chr1,chr2,chr3,chr4,chr5,chr6,chr7,chr8,chr9,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr17,chr18,chr19,chr20,chr21,chr22, varRep 0 color 0,200,0\ longLabel Human Genome Diversity Proj Smoothd Expec Heterozygosity (E. Asia)\ parent hgdpHzy\ priority 6\ shortLabel Hetzgty E. Asia\ track hgdpHzyEAsia\ hgdpIhsOceania iHS Oceania bedGraph 4 Human Genome Diversity Project iHS (Oceania) 0 6 0 200 200 127 227 227 0 0 23 chr1,chr2,chr3,chr4,chr5,chr6,chr7,chr8,chr9,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr17,chr18,chr19,chr20,chr21,chr22,chrX, varRep 0 color 0,200,200\ longLabel Human Genome Diversity Project iHS (Oceania)\ parent hgdpIhs\ priority 6\ shortLabel iHS Oceania\ track hgdpIhsOceania\ snpArrayIllumina650 Illumina 650 bed 6 + Illumina Human Hap 650v3 0 6 0 0 0 127 127 127 0 0 0 varRep 1 longLabel Illumina Human Hap 650v3\ parent snpArray off\ priority 6\ shortLabel Illumina 650\ track snpArrayIllumina650\ type bed 6 +\ encodeRegulomeQualityK562 K562 bed 5 . K562 Quality 0 6 210 50 90 232 152 172 1 0 10 chr2,chr5,chr7,chr8,chr9,chr11,chr12,chr16,chr18,chrX, encodeChrom 1 color 210,50,90\ longLabel K562 Quality\ parent encodeRegulomeQuality\ priority 6\ shortLabel K562\ track encodeRegulomeQualityK562\ encodeRegulomeProbK562 K562 bedGraph 4 K562 DNaseI HSs 0 6 210 50 90 232 152 172 0 0 10 chr2,chr5,chr7,chr8,chr9,chr11,chr12,chr16,chr18,chrX, encodeChrom 0 color 210,50,90\ longLabel K562 DNaseI HSs\ parent encodeRegulomeProb\ priority 6\ shortLabel K562\ track encodeRegulomeProbK562\ encodeRegulomeBaseK562 K562 wig 0.0 3.0 K562 DNaseI Sensitivity 0 6 210 50 90 232 152 172 0 0 10 chr2,chr5,chr7,chr8,chr9,chr11,chr12,chr16,chr18,chrX, encodeChrom 0 color 210,50,90\ longLabel K562 DNaseI Sensitivity\ parent encodeRegulomeBase\ priority 6\ shortLabel K562\ track encodeRegulomeBaseK562\ encodeUcsdChipHeLaH3H4acH3_p30 LI H3ac +gIF bedGraph 4 Ludwig Institute ChIP-chip: H3ac ab, HeLa cells, 30 min. after gamma interferon 0 6 109 51 43 182 153 149 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 0 color 109,51,43\ longLabel Ludwig Institute ChIP-chip: H3ac ab, HeLa cells, 30 min. after gamma interferon\ parent encodeLIChIPgIF\ priority 6\ shortLabel LI H3ac +gIF\ track encodeUcsdChipHeLaH3H4acH3_p30\ encodeUcsdChipTaf250Thp1_f LI TAF1 THP1 bedGraph 4 Ludwig Institute ChIP-chip: TAF1 ab, THP1 cells 0 6 0 63 135 127 159 195 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 0 color 0,63,135\ longLabel Ludwig Institute ChIP-chip: TAF1 ab, THP1 cells\ parent encodeLIChIP\ priority 6\ shortLabel LI TAF1 THP1\ track encodeUcsdChipTaf250Thp1_f\ encodeMlaganIntersectEl MLAGAN Intersect bed 5 . MLAGAN PhastCons/BinCons/GERP Intersection Conserved Elements 0 6 80 145 105 167 200 180 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeCompGeno 1 color 80,145,105\ longLabel MLAGAN PhastCons/BinCons/GERP Intersection Conserved Elements\ parent encodeMlaganElements\ priority 6\ shortLabel MLAGAN Intersect\ track encodeMlaganIntersectEl\ denisovaModernHumans Modern Human Seq bam Alignments of Sequence Reads from 7 Humans 0 6 0 0 0 127 127 127 0 0 0 \\ The Modern Human Seq track shows human sequence reads of seven individuals mapped \ to the human genome. The purpose of this track is to put the divergence of the \ Denisova genome into perspective with regard to present-day humans.\
\ \\ DNA was obtained for each of seven individuals from the CEPH-Human\ Genome Diversity Panel (HGDP): HGDP00456 (Mbuti), HGDP00998 (Karitiana\ Native American), HGDP00665 (Sardinia), HGDP00491 (Bougainville\ Melanesian), HGDP00711 (Cambodian), HGDP01224 (Mongolian) and\ HGDP00551 (Papuan). Each library was sequenced on the Illumina Genome\ Analyzer IIx using 2x101 + 7 cycles on one flow cell according to the\ manufacturer's instructions for multiplex sequencing. The paired-end\ reads were aligned\ using the Burrows-Wheeler Aligner to the human sequence (NCBI36/hg18)\
\\ Download the \ Modern Human Seq track data sets from the Genome Browser downloads server.\
\ \\ Briggs A.W., Stenzel U., Meyer M., Krause J., Kircher M., Pääbo S. \ Removal of deaminated cytosines and detection of in vivo\ methylation in ancient DNA.\ Nucleic Acids Res. 2009 Dec 22:38(6) e87.\
\\ Reich D., Green R.E., Kircher M., Krause J., Patterson N., Durand E.Y., Viola B., \ Briggs A.W., Stenzel U., Johnson P.L.F. et al.\ \ Genetic history of an archaic hominin group from Denisova Cave in Siberia.\ Nature. 2010 Dec 23;468:1053-1060.\
\ \\ This track was produced at UCSC using data generated by the Max Planck\ Institute for Evolutionary Anthropology.\
\ denisova 1 aliQualRange 0:60\ allButtonPair on\ bamColorMode gray\ bamGrayMode aliQual\ baseColorDefault diffBases\ baseColorUseSequence lfExtra\ compositeTrack on\ dimensions dimensionX=sample\ dragAndDrop subTracks\ group denisova\ indelDoubleInsert on\ indelQueryInsert on\ longLabel Alignments of Sequence Reads from 7 Humans\ maxWindowToDraw 1000000\ noColorTag .\ pairEndsByName on\ priority 6\ shortLabel Modern Human Seq\ showDiffBasesAllScales .\ showDiffBasesMaxZoom 100\ showNames off\ sortOrder sample=+\ subGroup1 sample Sample b2MPygmy=Mbuti_Pygmy c3Mel=Melanesian d4Papuan=Papuan e5Sar=Sardinian f6Cam=Cambodian g7NativeAm=Native_Americans h8Mon=Mongolian\ track denisovaModernHumans\ type bam\ visibility hide\ ntModernHumans Modern Human Seq bam Alignments of Sequence Reads from 5 Modern Humans 0 6 0 0 0 127 127 127 0 0 0\ The Modern Human Seq track shows human sequence reads of five individuals mapped\ to the human genome. The purpose of this track is to put the divergence of the\ Neandertal genomes into perspective with regard to present-day humans.
\ \\ The sequence reads (query sequences) from each of the five individuals are\ contained in separate subtracks. Use the checkboxes to select which\ individuals will be displayed in the browser. Click and drag the\ sample name to reorder the subtracks. The order in which the subtracks appear in\ the subtrack list will be the order in which they display in the browser.
\\ The query sequences in the SAM/BAM alignment representation \ are normalized to the + strand of the reference genome\ (see the SAM Format Specification\ for more information on the SAM/BAM file format). If a query sequence was\ originally the reverse of what has been stored and aligned, it will have the\ following\ flag:\
\ (0x10) Read is on '-' strand.\\
\ BAM/SAM alignment representations also have tags. Some tags are predefined and others (those beginning\ with X, Y or Z) are defined by the aligner or data submitter.\ The following is a list of the tags associated with this track. For this \ track, those starting with X are specific to the \ Burrows-Wheeler Aligner (BWA).\
\ The item labels and display colors of features within this track can be\ configured through the controls at the top of the track description page.\
\\ The genomes of a San individual from Southern Africa (HGDP01029), a Yoruba\ individual from West Africa (HGDP00927), a Han Chinese individual (HGDP00778), \ an individual from Papua New Guinea (HGDP00542), and a French individual\ (HGDP00521) from Western Europe were sequenced to 4- to 6-fold coverage on\ the Illumina GAII platform. These sequences were aligned to the human\ reference genome (NCBI36/hg18) using the Burrows-Wheeler Aligner (BWA). Reads with an alignment\ quality of less than 30 were not included in these data. Those with an alignment\ quality greater than or equal to 30 were analyzed using a similar approach to\ that used for the Neandertal data.\
\ \\ This track was produced at UCSC using data generated by\ Ed Green.\
\ \\ Green RE, Krause J, Briggs AW, Maricic T, Stenzel U, Kircher M, Patterson N,\ Li H, Zhai W, Fritz MH et al.\ A Draft Sequence of the Neandertal Genome.\ Science. 2010 7 May;328(5979):710-22.\
\\ Li H, Durbin R. Fast and accurate short read alignment with Burrows-Wheeler \ Transform. Bioinformatics. 2009 Jul 15;25(14):1754-60.
\ neandertal 1 aliQualRange 0:60\ allButtonPair on\ bamColorMode gray\ bamGrayMode aliQual\ baseColorDefault diffBases\ baseColorUseSequence lfExtra\ compositeTrack on\ dimensions dimensionX=sample\ dragAndDrop subTracks\ group neandertal\ indelDoubleInsert on\ indelQueryInsert on\ longLabel Alignments of Sequence Reads from 5 Modern Humans\ maxWindowToDraw 1000000\ noColorTag .\ pairEndsByName on\ priority 6\ shortLabel Modern Human Seq\ showDiffBasesAllScales .\ showDiffBasesMaxZoom 100\ showNames off\ sortOrder sample=+\ subGroup1 sample Sample a1San=San b2Yoruba=Yoruba c3Han=Han d4Papuan=Papuan e5French=French\ track ntModernHumans\ type bam\ visibility hide\ encodeAllOtherESTs Other ESTs bed 4 Consensus Other ESTs 0 6 0 0 0 127 127 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeAnalysis 1 longLabel Consensus Other ESTs\ parent encodeWorkshopSelections\ priority 6\ shortLabel Other ESTs\ track encodeAllOtherESTs\ encodeGencodeRaceFragsLiver RACEfrags Liver genePred Gencode RACEfrags from Liver 0 6 236 0 20 245 127 137 0 0 19 chr1,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr5,chr6,chr7,chr8,chr9,chrX, encodeGenes 1 color 236,0,20\ longLabel Gencode RACEfrags from Liver\ parent encodeGencodeRaceFrags\ priority 6\ shortLabel RACEfrags Liver\ track encodeGencodeRaceFragsLiver\ encodeSangerChipH3K4me2K562 SI H3K4me2 K562 bedGraph 4 Sanger Institute ChIP/Chip (H3K4me2 ab, K562 cells) 0 6 10 10 10 132 132 132 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 0 color 10,10,10\ longLabel Sanger Institute ChIP/Chip (H3K4me2 ab, K562 cells)\ parent encodeSangerChipH3H4\ priority 6\ shortLabel SI H3K4me2 K562\ track encodeSangerChipH3K4me2K562\ stanfordChipHeLaSRF Stan HeLa SRF bedGraph 4 Stanford ChIP-chip (HeLa cells, SRF ChIP) 0 6 120 0 20 150 0 25 0 0 22 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr3,chr4,chr5,chr6,chr7,chr8,chr9,chrX, regulation 0 longLabel Stanford ChIP-chip (HeLa cells, SRF ChIP)\ parent stanfordChip\ priority 6\ shortLabel Stan HeLa SRF\ track stanfordChipHeLaSRF\ encodeStanfordChipHeLaSRF Stan HeLa SRF bedGraph 4 Stanford ChIP-chip (HeLa cells, SRF ChIP) 0 6 120 0 20 150 0 25 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 0 longLabel Stanford ChIP-chip (HeLa cells, SRF ChIP)\ parent encodeStanfordChipJohnson\ priority 6\ shortLabel Stan HeLa SRF\ track encodeStanfordChipHeLaSRF\ encodeStanfordChipK562Sp3 Stan K562 Sp3 bedGraph 4 Stanford ChIP-chip (K562 cells, Sp3 ChIP) 0 6 120 0 20 150 0 25 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 0 longLabel Stanford ChIP-chip (K562 cells, Sp3 ChIP)\ parent encodeStanfordChip\ priority 6\ shortLabel Stan K562 Sp3\ track encodeStanfordChipK562Sp3\ encodeStanfordMethJEG3 Stan Meth JEG3 bedGraph 4 Stanford Methylation Digest (JEG3 cells) 0 6 120 0 20 150 0 25 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChrom 0 longLabel Stanford Methylation Digest (JEG3 cells)\ parent encodeStanfordMeth\ priority 6\ shortLabel Stan Meth JEG3\ track encodeStanfordMethJEG3\ encodeStanfordMethSmoothedJEG3 Stan Meth Sc JEG3 bedGraph 4 Stanford Methylation Digest Smoothed Score (JEG3 cells) 0 6 120 0 20 150 0 25 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChrom 0 longLabel Stanford Methylation Digest Smoothed Score (JEG3 cells)\ parent encodeStanfordMethSmoothed\ priority 6\ shortLabel Stan Meth Sc JEG3\ track encodeStanfordMethSmoothedJEG3\ encodeStanfordPromotersHMCB Stan Pro HMCB bed 9 + Stanford Promoter Activity (HMCB cells) 0 6 0 0 0 127 127 127 0 0 19 chr1,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr5,chr6,chr7,chr8,chr9,chrX, encodeTxLevels 1 longLabel Stanford Promoter Activity (HMCB cells)\ parent encodeStanfordPromoters\ priority 6\ shortLabel Stan Pro HMCB\ track encodeStanfordPromotersHMCB\ encodeStanfordChipSmoothedK562Sp3 Stan Sc K562 Sp3 bedGraph 4 Stanford ChIP-chip Smoothed Score (K562 cells, Sp3 ChIP) 0 6 120 0 20 150 0 25 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 0 longLabel Stanford ChIP-chip Smoothed Score (K562 cells, Sp3 ChIP)\ parent encodeStanfordChipSmoothed\ priority 6\ shortLabel Stan Sc K562 Sp3\ track encodeStanfordChipSmoothedK562Sp3\ tajdXd Tajima's D XD bedGraph 4 Tajima's D from Chinese Descent 0 6 200 100 0 0 100 200 0 0 0 varRep 0 altColor 0,100,200\ color 200,100,0\ longLabel Tajima's D from Chinese Descent\ parent tajD\ priority 6\ shortLabel Tajima's D XD\ track tajdXd\ encodeTbaIntersectEl TBA Intersect bed 5 . TBA PhastCons/BinCons/GERP Intersection Conserved Elements 0 6 80 145 105 167 200 180 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeCompGeno 1 color 80,145,105\ longLabel TBA PhastCons/BinCons/GERP Intersection Conserved Elements\ parent encodeTbaElements\ priority 6\ shortLabel TBA Intersect\ track encodeTbaIntersectEl\ encodeUtexChip2091fibMycPeaks UT Myc Fb Pk bedGraph 4 University of Texas, Austin ChIP-chip (c-Myc, 2091 fibroblasts) Peaks 0 6 50 0 0 152 127 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 0 color 50,0,0\ longLabel University of Texas, Austin ChIP-chip (c-Myc, 2091 fibroblasts) Peaks\ parent encodeUtexChip\ priority 6\ shortLabel UT Myc Fb Pk\ subGroups dataType=peaks\ track encodeUtexChip2091fibMycPeaks\ kiddEichlerValidAbc9 Validated ABC9 bed 9 HGSV Individual ABC9 (Japan) Validated Sites of Structural Variation 0 6 0 0 0 127 127 127 0 0 0 varRep 1 longLabel HGSV Individual ABC9 (Japan) Validated Sites of Structural Variation\ parent kiddEichlerValid\ priority 6\ shortLabel Validated ABC9\ track kiddEichlerValidAbc9\ hgdpXpehhOceania XP-EHH Oceania bedGraph 4 Human Genome Diversity Project XP-EHH (Oceania) 0 6 0 200 200 127 227 227 0 0 22 chr1,chr2,chr3,chr4,chr5,chr6,chr7,chr8,chr9,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr17,chr18,chr19,chr20,chr21,chr22, varRep 0 color 0,200,200\ longLabel Human Genome Diversity Project XP-EHH (Oceania)\ parent hgdpXpehh\ priority 6\ shortLabel XP-EHH Oceania\ track hgdpXpehhOceania\ encodeTransFragsYaleIntronicDistal Yale Intron Dist bed 4 Yale Intronic Distal 0 6 0 0 0 127 127 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeAnalysis 1 longLabel Yale Intronic Distal\ parent encodeTransFrags\ priority 6\ shortLabel Yale Intron Dist\ track encodeTransFragsYaleIntronicDistal\ encodeYaleMASPlacRNATransMapRevMless36mer36bp Yale Plc BtR RNA bedGraph 4 Yale Placenta RNA Trans Map, MAS Array, Reverse Direction, Bertone Protocol 0 6 50 50 200 200 50 50 0 0 8 chr5,chr7,chrX,chr11,chr16,chr19,chr21,chr22, encodeTxLevels 0 altColor 200,50,50\ color 50,50,200\ longLabel Yale Placenta RNA Trans Map, MAS Array, Reverse Direction, Bertone Protocol\ parent encodeYaleMASPlacRNATransMap\ priority 6\ shortLabel Yale Plc BtR RNA\ track encodeYaleMASPlacRNATransMapRevMless36mer36bp\ encodeYaleMASPlacRNATarsRevMless36mer36bp Yale Plc BtR TAR bed 6 . Yale Placenta RNA TARs, MAS array, Reverse Direction, Bertone Protocol 0 6 50 50 200 200 50 50 0 0 8 chr5,chr7,chrX,chr11,chr16,chr19,chr21,chr22, encodeTxLevels 1 altColor 200,50,50\ color 50,50,200\ longLabel Yale Placenta RNA TARs, MAS array, Reverse Direction, Bertone Protocol\ parent encodeYaleMASPlacRNATars\ priority 6\ shortLabel Yale Plc BtR TAR\ track encodeYaleMASPlacRNATarsRevMless36mer36bp\ encodeYaleAffyNeutRNATransMap05 Yale RNA Neu 5 wig -2730 3394 Yale Neutrophil RNA Transcript Map, Sample 5 0 6 50 145 50 152 200 152 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeTxLevels 0 color 50,145,50\ longLabel Yale Neutrophil RNA Transcript Map, Sample 5\ parent encodeYaleAffyRNATransMap\ priority 6\ shortLabel Yale RNA Neu 5\ subGroups celltype=neutro samples=samples\ track encodeYaleAffyNeutRNATransMap05\ encodeYaleAffyNeutRNATars05 Yale TAR Neu 5 bed 3 . Yale Neutrophil RNA Transcriptionally Active Region, Sample 5 0 6 50 145 50 152 200 152 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeTxLevels 1 color 50,145,50\ longLabel Yale Neutrophil RNA Transcriptionally Active Region, Sample 5\ parent encodeYaleAffyRNATars\ priority 6\ shortLabel Yale TAR Neu 5\ subGroups celltype=neutro samples=samples\ track encodeYaleAffyNeutRNATars05\ encodeAffyChIpHl60PvalBrg1Hr32 Affy Brg1 RA 32h wig 0.0 534.54 Affymetrix ChIP/Chip (Brg1 retinoic acid-treated HL-60, 32hrs) P-Value 0 7 225 0 0 240 127 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 0 color 225,0,0\ longLabel Affymetrix ChIP/Chip (Brg1 retinoic acid-treated HL-60, 32hrs) P-Value\ parent encodeAffyChIpHl60Pval\ priority 7\ shortLabel Affy Brg1 RA 32h\ subGroups factor=Brg1 time=32h\ track encodeAffyChIpHl60PvalBrg1Hr32\ encodeAffyChIpHl60SignalStrictHisH4Hr08 Affy H4Kac4 8h wig -2.78 3.97 Affymetrix ChIP-chip (H4Kac4, retinoic acid-treated HL-60, 8hrs) Strict Signal 0 7 150 75 0 202 165 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 0 color 150,75,0\ longLabel Affymetrix ChIP-chip (H4Kac4, retinoic acid-treated HL-60, 8hrs) Strict Signal\ parent encodeAffyChIpHl60SignalStrict\ priority 7\ shortLabel Affy H4Kac4 8h\ subGroups factor=H4Kac4 time=8h\ track encodeAffyChIpHl60SignalStrictHisH4Hr08\ encodeAffyChIpHl60SitesStrictHisH4Hr08 Affy H4Kac4 8h bed 3 . Affymetrix ChIP-chip (H4Kac4, retinoic acid-treated HL-60, 8hrs) Strict Sites 0 7 150 75 0 202 165 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 1 color 150,75,0\ longLabel Affymetrix ChIP-chip (H4Kac4, retinoic acid-treated HL-60, 8hrs) Strict Sites\ parent encodeAffyChIpHl60SitesStrict\ priority 7\ shortLabel Affy H4Kac4 8h\ subGroups factor=H4Kac4 time=8h\ track encodeAffyChIpHl60SitesStrictHisH4Hr08\ encodeAffyChIpHl60PvalStrictHisH4Hr08 Affy H4Kac4 8h wig 0 696.62 Affymetrix ChIP-chip (H4Kac4, retinoic acid-treated HL-60, 8hrs) Strict P-Value 0 7 150 75 0 202 165 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 0 color 150,75,0\ longLabel Affymetrix ChIP-chip (H4Kac4, retinoic acid-treated HL-60, 8hrs) Strict P-Value\ parent encodeAffyChIpHl60PvalStrict\ priority 7\ shortLabel Affy H4Kac4 8h\ subGroups factor=H4Kac4 time=8h\ track encodeAffyChIpHl60PvalStrictHisH4Hr08\ encodeBuFirstExonSkMuscle BU Skel. Muscle bed 12 + Boston University First Exon Activity in Skeletal Muscle 0 7 0 0 0 127 127 127 0 0 10 chr11,chr13,chr15,chr16,chr19,chr2,chr5,chr7,chr9,chrX, encodeTxLevels 1 longLabel Boston University First Exon Activity in Skeletal Muscle\ parent encodeBuFirstExon\ priority 7\ shortLabel BU Skel. Muscle\ track encodeBuFirstExonSkMuscle\ cccTrendPvalT2d CCC T2 Diabetes chromGraph Case Control Consortium type 2 diabetes trend -log10 P-value 0 7 0 0 0 127 127 127 0 0 0 phenDis 0 longLabel Case Control Consortium type 2 diabetes trend -log10 P-value\ parent caseControl\ priority 7\ shortLabel CCC T2 Diabetes\ track cccTrendPvalT2d\ encodeTransFragsConservedDistal Cons Distal bed 4 Conserved Distal 0 7 0 0 0 127 127 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeAnalysis 1 longLabel Conserved Distal\ parent encodeTransFrags\ priority 7\ shortLabel Cons Distal\ track encodeTransFragsConservedDistal\ kiddEichlerDiscAbc8 Discordant ABC8 bed 12 HGSV Individual ABC8 (Yoruba) Discordant Clone End Alignments 0 7 0 0 0 127 127 127 0 0 0 http://mrhgsv.gs.washington.edu/cgi-bin/hgc?i=$$&c=$S&l=$[&r=$]&db=$D&position=$S:$[-$] varRep 1 longLabel HGSV Individual ABC8 (Yoruba) Discordant Clone End Alignments\ parent kiddEichlerDisc\ priority 7\ shortLabel Discordant ABC8\ track kiddEichlerDiscAbc8\ encodeAffyEc1BrainHypothalamusSignal EC1 Sgnl BrainH wig 0 62385 Affy Ext Trans Signal (1-base window) (Brain Hypothalamus) 0 7 248 0 8 251 127 131 0 0 2 chr21,chr22, encodeTxLevels 0 color 248,0,8\ longLabel Affy Ext Trans Signal (1-base window) (Brain Hypothalamus)\ parent encodeAffyEcSignal\ priority 7\ shortLabel EC1 Sgnl BrainH\ track encodeAffyEc1BrainHypothalamusSignal\ encodeAffyEc1BrainHypothalamusSites EC1 Sites BrainH bed 3 . Affy Ext Trans Sites (1-base window) (Brain Hypothalamus) 0 7 248 0 8 251 127 131 0 0 2 chr21,chr22, encodeTxLevels 1 color 248,0,8\ longLabel Affy Ext Trans Sites (1-base window) (Brain Hypothalamus)\ parent encodeAffyEcSites\ priority 7\ shortLabel EC1 Sites BrainH\ track encodeAffyEc1BrainHypothalamusSites\ encodeRegulomeQualityERY ERY bed 5 . Adult Erythroblast Quality 0 7 240 50 60 247 152 157 1 0 10 chr2,chr5,chr7,chr8,chr9,chr11,chr12,chr16,chr18,chrX, encodeChrom 1 color 240,50,60\ longLabel Adult Erythroblast Quality\ parent encodeRegulomeQuality\ priority 7\ shortLabel ERY\ track encodeRegulomeQualityERY\ encodeRegulomeProbERY ERY bedGraph 4 Adult Erythroblast DNaseI HSs 0 7 240 50 60 247 152 157 0 0 10 chr2,chr5,chr7,chr8,chr9,chr11,chr12,chr16,chr18,chrX, encodeChrom 0 color 240,50,60\ longLabel Adult Erythroblast DNaseI HSs\ parent encodeRegulomeProb\ priority 7\ shortLabel ERY\ track encodeRegulomeProbERY\ encodeRegulomeBaseERY ERY wig 0.0 3.0 Adult Erythroblast DNaseI Sensitivity 0 7 240 50 60 247 152 157 0 0 10 chr2,chr5,chr7,chr8,chr9,chr11,chr12,chr16,chr18,chrX, encodeChrom 0 color 240,50,60\ longLabel Adult Erythroblast DNaseI Sensitivity\ parent encodeRegulomeBase\ priority 7\ shortLabel ERY\ track encodeRegulomeBaseERY\ encodeEgaspFullSoftberryPseudo Fgenesh Pseudo genePred Fgenesh Pseudogene Predictions 0 7 130 130 130 192 192 192 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeGenes 1 color 130,130,130\ longLabel Fgenesh Pseudogene Predictions\ parent encodeEgaspFull\ priority 7\ shortLabel Fgenesh Pseudo\ track encodeEgaspFullSoftberryPseudo\ encodeEgaspUpdGeneId GeneID Update genePred GeneID Gene Predictions 0 7 100 12 100 177 133 177 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeGenes 1 color 100,12,100\ longLabel GeneID Gene Predictions\ parent encodeEgaspUpdate\ priority 7\ shortLabel GeneID Update\ track encodeEgaspUpdGeneId\ encodeEgaspPartGenezilla GeneZilla genePred GeneZilla Gene Predictions 0 7 22 150 20 138 202 137 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeGenes 1 color 22,150,20\ longLabel GeneZilla Gene Predictions\ parent encodeEgaspPartial\ priority 7\ shortLabel GeneZilla\ track encodeEgaspPartGenezilla\ genMapDb GenMapDB Clones bed 6 + GenMapDB BAC Clones 0 7 0 0 0 127 127 127 0 0 0BAC clones from GenMapDB\ are placed on the draft sequence using BAC end sequence information\ and confirmed using STS markers by Vivian Cheung's lab at the\ Department of Pediatrics, University of Pennsylvania. Further\ information about each clone can be obtained by clicking on the clone\ name on the track detail page.\
\ This track shows the alignment of a complete Neandertal mitochondrial\ sequence to a modern human mitochondrial sequence.\
\\ Note: the mitochondrion used as the genome browser reference sequence\ "chrM" in hg18 and hg19 is\ NC_001807, which has been deprecated. \ Future human genome browsers will use the revised Cambridge Reference\ Sequence (rCRS) NC_012920.\
\ \\ This track follows the display conventions for \ PSL alignment \ tracks.\ Mismatching bases are highlighted as described\ here.\ Several types of alignment gap may also be colored; for more information, click \ here.\
\ \\ DNA was extracted from a 38,000-year-old bone and sequenced using \ methods described in Green, et al.\ The Neandertal mitochondrial sequence \ (NC_011137) was downloaded\ from GenBank and aligned to chrM\ (NC_001807) using BLAT.\
\ \\ Green RE, Malaspinas AS, Krause J, Briggs AW, Johnson PL, Uhler C, Meyer M, \ Good JM, Maricic T, Stenzel U et al.\ A complete \ Neandertal mitochondrial genome sequence determined by high-throughput \ sequencing.\ Cell. 2008 Aug 8;134(3):416-26.\
\ neandertal 1 baseColorDefault diffBases\ baseColorUseSequence seq\ chromosomes chrM\ group neandertal\ indelDoubleInsert on\ indelQueryInsert on\ longLabel Neandertal Mitochondrial Sequence (Vi33.16, 2008)\ priority 7\ shortLabel Neandertal Mito\ showDiffBasesAllScales .\ showDiffBasesMaxZoom 100\ track ntMito\ type psl\ visibility hide\ encodeGencodeRaceFragsLung RACEfrags Lung genePred Gencode RACEfrags from Lung 0 7 200 0 56 227 127 155 0 0 19 chr1,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr5,chr6,chr7,chr8,chr9,chrX, encodeGenes 1 color 200,0,56\ longLabel Gencode RACEfrags from Lung\ parent encodeGencodeRaceFrags\ priority 7\ shortLabel RACEfrags Lung\ track encodeGencodeRaceFragsLung\ encodeSangerChipH3K4me3K562 SI H3K4me3 K562 bedGraph 4 Sanger Institute ChIP/Chip (H3K4me3 ab, K562 cells) 0 7 10 10 10 132 132 132 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 0 color 10,10,10\ longLabel Sanger Institute ChIP/Chip (H3K4me3 ab, K562 cells)\ parent encodeSangerChipH3H4\ priority 7\ shortLabel SI H3K4me3 K562\ track encodeSangerChipH3K4me3K562\ stanfordChipHeLaTAF Stan HeLa TAF bedGraph 4 Stanford ChIP-chip (HeLa cells, TAF ChIP) 0 7 120 0 20 150 0 25 0 0 22 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr3,chr4,chr5,chr6,chr7,chr8,chr9,chrX, regulation 0 longLabel Stanford ChIP-chip (HeLa cells, TAF ChIP)\ parent stanfordChip\ priority 7\ shortLabel Stan HeLa TAF\ track stanfordChipHeLaTAF\ encodeStanfordChipHeLaTAF Stan HeLa TAF bedGraph 4 Stanford ChIP-chip (HeLa cells, TAF ChIP) 0 7 120 0 20 150 0 25 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 0 longLabel Stanford ChIP-chip (HeLa cells, TAF ChIP)\ parent encodeStanfordChipJohnson\ priority 7\ shortLabel Stan HeLa TAF\ track encodeStanfordChipHeLaTAF\ encodeStanfordMethSmoothedSnu182 Stan Meth Sc Snu182 bedGraph 4 Stanford Methylation Digest Smoothed Score (Snu182 cells) 0 7 120 0 20 150 0 25 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChrom 0 longLabel Stanford Methylation Digest Smoothed Score (Snu182 cells)\ parent encodeStanfordMethSmoothed\ priority 7\ shortLabel Stan Meth Sc Snu182\ track encodeStanfordMethSmoothedSnu182\ encodeStanfordMethSnu182 Stan Meth Snu182 bedGraph 4 Stanford Methylation Digest (Snu182 cells) 0 7 120 0 20 150 0 25 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChrom 0 longLabel Stanford Methylation Digest (Snu182 cells)\ parent encodeStanfordMeth\ priority 7\ shortLabel Stan Meth Snu182\ track encodeStanfordMethSnu182\ encodeStanfordPromotersHT1080 Stan Pro HT1080 bed 9 + Stanford Promoter Activity (HT1080 cells) 0 7 0 0 0 127 127 127 0 0 19 chr1,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr5,chr6,chr7,chr8,chr9,chrX, encodeTxLevels 1 longLabel Stanford Promoter Activity (HT1080 cells)\ parent encodeStanfordPromoters\ priority 7\ shortLabel Stan Pro HT1080\ track encodeStanfordPromotersHT1080\ encodeTbaNcIntersectEl TBA NC Intersect bed 5 . TBA PhastCons/BinCons/GERP Intersection NonCoding Conserved Elements 0 7 80 180 80 167 217 167 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeCompGeno 1 color 80,180,80\ longLabel TBA PhastCons/BinCons/GERP Intersection NonCoding Conserved Elements\ parent encodeTbaElements\ priority 7\ shortLabel TBA NC Intersect\ track encodeTbaNcIntersectEl\ encodeUtexChip2091fibMycStimPeaks UT Myc st-Fb Pk bedGraph 4 University of Texas, Austin ChIP-chip (c-Myc, FBS-stimulated 2091 fibroblasts) Peaks 0 7 50 0 0 152 127 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 0 color 50,0,0\ longLabel University of Texas, Austin ChIP-chip (c-Myc, FBS-stimulated 2091 fibroblasts) Peaks\ parent encodeUtexChip\ priority 7\ shortLabel UT Myc st-Fb Pk\ subGroups dataType=peaks\ track encodeUtexChip2091fibMycStimPeaks\ kiddEichlerValidAbc8 Validated ABC8 bed 9 HGSV Individual ABC8 (Yoruba) Validated Sites of Structural Variation 0 7 0 0 0 127 127 127 0 0 0 varRep 1 longLabel HGSV Individual ABC8 (Yoruba) Validated Sites of Structural Variation\ parent kiddEichlerValid\ priority 7\ shortLabel Validated ABC8\ track kiddEichlerValidAbc8\ hgdpXpehhAmericas XP-EHH Americas bedGraph 4 Human Genome Diversity Project XP-EHH (Americas) 0 7 224 192 0 239 223 127 0 0 22 chr1,chr2,chr3,chr4,chr5,chr6,chr7,chr8,chr9,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr17,chr18,chr19,chr20,chr21,chr22, varRep 0 color 224,192,0\ longLabel Human Genome Diversity Project XP-EHH (Americas)\ parent hgdpXpehh\ priority 7\ shortLabel XP-EHH Americas\ track hgdpXpehhAmericas\ encodeYaleAffyNB4RARNATarsIntronsProximal Yale In Prx NB4 RA bed 4 . Yale Intronic Proximal NB4 Retinoic TARs 0 7 176 0 80 215 127 167 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeAnalysis 1 color 176,0,80\ longLabel Yale Intronic Proximal NB4 Retinoic TARs\ parent encodeNoncodingTransFrags\ priority 7\ shortLabel Yale In Prx NB4 RA\ subGroups region=intronicProximal celltype=nb4 source=yale\ track encodeYaleAffyNB4RARNATarsIntronsProximal\ encodeYaleAffyNeutRNATransMap06 Yale RNA Neu 6 wig -2730 3394 Yale Neutrophil RNA Transcript Map, Sample 6 0 7 50 130 50 152 192 152 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeTxLevels 0 color 50,130,50\ longLabel Yale Neutrophil RNA Transcript Map, Sample 6\ parent encodeYaleAffyRNATransMap\ priority 7\ shortLabel Yale RNA Neu 6\ subGroups celltype=neutro samples=samples\ track encodeYaleAffyNeutRNATransMap06\ encodeYaleAffyNeutRNATars06 Yale TAR Neu 6 bed 3 . Yale Neutrophil RNA Transcriptionally Active Region, Sample 6 0 7 50 130 50 152 192 152 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeTxLevels 1 color 50,130,50\ longLabel Yale Neutrophil RNA Transcriptionally Active Region, Sample 6\ parent encodeYaleAffyRNATars\ priority 7\ shortLabel Yale TAR Neu 6\ subGroups celltype=neutro samples=samples\ track encodeYaleAffyNeutRNATars06\ encodeAffyChIpHl60SitesBrg1Hr32 Affy Brg1 RA 32h bed 3 . Affymetrix ChIP/Chip (Brg1 retinoic acid-treated HL-60, 32hrs) Sites 0 8 225 0 0 240 127 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 1 color 225,0,0\ longLabel Affymetrix ChIP/Chip (Brg1 retinoic acid-treated HL-60, 32hrs) Sites\ parent encodeAffyChIpHl60Sites\ priority 8\ shortLabel Affy Brg1 RA 32h\ subGroups factor=Brg1 time=32h\ track encodeAffyChIpHl60SitesBrg1Hr32\ encodeAffyChIpHl60SignalStrictHisH4Hr32 Affy H4Kac4 32h wig -2.78 3.97 Affymetrix ChIP-chip (H4Kac4, retinoic acid-treated HL-60, 32hrs) Strict Signal 0 8 150 75 0 202 165 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 0 color 150,75,0\ longLabel Affymetrix ChIP-chip (H4Kac4, retinoic acid-treated HL-60, 32hrs) Strict Signal\ parent encodeAffyChIpHl60SignalStrict\ priority 8\ shortLabel Affy H4Kac4 32h\ subGroups factor=H4Kac4 time=32h\ track encodeAffyChIpHl60SignalStrictHisH4Hr32\ encodeAffyChIpHl60SitesStrictHisH4Hr32 Affy H4Kac4 32h bed 3 . Affymetrix ChIP-chip (H4Kac4, retinoic acid-treated HL-60, 32hrs) Strict Sites 0 8 150 75 0 202 165 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 1 color 150,75,0\ longLabel Affymetrix ChIP-chip (H4Kac4, retinoic acid-treated HL-60, 32hrs) Strict Sites\ parent encodeAffyChIpHl60SitesStrict\ priority 8\ shortLabel Affy H4Kac4 32h\ subGroups factor=H4Kac4 time=32h\ track encodeAffyChIpHl60SitesStrictHisH4Hr32\ encodeAffyChIpHl60PvalStrictHisH4Hr32 Affy H4Kac4 32h wig 0 696.62 Affymetrix ChIP-chip (H4Kac4, retinoic acid-treated HL-60, 32hrs) Strict P-Value 0 8 150 75 0 202 165 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 0 color 150,75,0\ longLabel Affymetrix ChIP-chip (H4Kac4, retinoic acid-treated HL-60, 32hrs) Strict P-Value\ parent encodeAffyChIpHl60PvalStrict\ priority 8\ shortLabel Affy H4Kac4 32h\ subGroups factor=H4Kac4 time=32h\ track encodeAffyChIpHl60PvalStrictHisH4Hr32\ encodeBuFirstExonSpleen BU Spleen bed 12 + Boston University First Exon Activity in Spleen 0 8 0 0 0 127 127 127 0 0 10 chr11,chr13,chr15,chr16,chr19,chr2,chr5,chr7,chr9,chrX, encodeTxLevels 1 longLabel Boston University First Exon Activity in Spleen\ parent encodeBuFirstExon\ priority 8\ shortLabel BU Spleen\ track encodeBuFirstExonSpleen\ encodeTransFragsConservedProximal Cons Prox bed 4 Conserved Proximal 0 8 0 0 0 127 127 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeAnalysis 1 longLabel Conserved Proximal\ parent encodeTransFrags\ priority 8\ shortLabel Cons Prox\ track encodeTransFragsConservedProximal\ netSyntenyBosTau2 Cow Syn Net netAlign bosTau3 chainBosTau2 Cow (Aug. 2006 (Baylor 3.1/bosTau3)) Syntenic Alignment Net 0 8 0 0 0 127 127 127 1 0 0 compGeno 0 group compGeno\ longLabel $o_Organism ($o_date) Syntenic Alignment Net\ otherDb bosTau3\ parent syntenicNet\ priority 8\ shortLabel Cow Syn Net\ spectrum on\ track netSyntenyBosTau2\ type netAlign bosTau3 chainBosTau2\ visibility hide\ kiddEichlerDiscAbc7 Discordant ABC7 bed 12 HGSV Individual ABC7 (Yoruba) Discordant Clone End Alignments 0 8 0 0 0 127 127 127 0 0 0 http://mrhgsv.gs.washington.edu/cgi-bin/hgc?i=$$&c=$S&l=$[&r=$]&db=$D&position=$S:$[-$] varRep 1 longLabel HGSV Individual ABC7 (Yoruba) Discordant Clone End Alignments\ parent kiddEichlerDisc\ priority 8\ shortLabel Discordant ABC7\ track kiddEichlerDiscAbc7\ encodeAffyEc51BrainHypothalamusSignal EC51 Sgnl BrainH wig 0 62385 Affy Ext Trans Signal (51-base window) (Brain Hypothalamus) 0 8 248 0 8 251 127 131 0 0 2 chr21,chr22, encodeTxLevels 0 color 248,0,8\ longLabel Affy Ext Trans Signal (51-base window) (Brain Hypothalamus)\ parent encodeAffyEcSignal\ priority 8\ shortLabel EC51 Sgnl BrainH\ track encodeAffyEc51BrainHypothalamusSignal\ encodeAffyEc51BrainHypothalamusSites EC51 Sites BrainH bed 3 . Affy Ext Trans Sites (51-base window) (Brain Hypothalamus) 0 8 248 0 8 251 127 131 0 0 2 chr21,chr22, encodeTxLevels 1 color 248,0,8\ longLabel Affy Ext Trans Sites (51-base window) (Brain Hypothalamus)\ parent encodeAffyEcSites\ priority 8\ shortLabel EC51 Sites BrainH\ track encodeAffyEc51BrainHypothalamusSites\ encodeEgaspFullGeneId GeneID genePred GeneID Gene Predictions 0 8 100 12 100 177 133 177 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeGenes 1 color 100,12,100\ longLabel GeneID Gene Predictions\ parent encodeEgaspFull\ priority 8\ shortLabel GeneID\ track encodeEgaspFullGeneId\ encodeEgaspUpdGeneIdU12 GeneID U12 Upd genePred GeneID U12 Intron Predictions 0 8 200 132 12 227 193 133 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeGenes 1 color 200,132,12\ longLabel GeneID U12 Intron Predictions\ parent encodeEgaspUpdate\ priority 8\ shortLabel GeneID U12 Upd\ track encodeEgaspUpdGeneIdU12\ hapmapSnpsMEX HapMap SNPs MEX bed 6 + HapMap SNPs from the MEX Population (Mexican Ancestry in Los Angeles, CA, US) 0 8 0 0 0 127 127 127 0 0 0 varRep 1 longLabel HapMap SNPs from the MEX Population (Mexican Ancestry in Los Angeles, CA, US)\ parent hapmapSnps\ priority 8\ shortLabel HapMap SNPs MEX\ track hapmapSnpsMEX\ hgdpHzyAmericas Hetzgty Americas bedGraph 4 Human Genome Diversity Proj Smoothd Expec Heterozygosity (Americas) 0 8 224 192 0 239 223 127 0 0 22 chr1,chr2,chr3,chr4,chr5,chr6,chr7,chr8,chr9,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr17,chr18,chr19,chr20,chr21,chr22, varRep 0 color 224,192,0\ longLabel Human Genome Diversity Proj Smoothd Expec Heterozygosity (Americas)\ parent hgdpHzy\ priority 8\ shortLabel Hetzgty Americas\ track hgdpHzyAmericas\ snpArrayIllumina300 Illumina 300 bed 6 + Illumina Human Hap 300v3 0 8 0 0 0 127 127 127 0 0 0 varRep 1 longLabel Illumina Human Hap 300v3\ parent snpArray off\ priority 8\ shortLabel Illumina 300\ track snpArrayIllumina300\ type bed 6 +\ encodeUcsdChipHeLaH3H4acH4_p30 LI H4ac +gIF bedGraph 4 Ludwig Institute ChIP-chip: H4ac ab, HeLa cells, 30 min. after gamma interferon 0 8 109 51 43 182 153 149 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 0 color 109,51,43\ longLabel Ludwig Institute ChIP-chip: H4ac ab, HeLa cells, 30 min. after gamma interferon\ parent encodeLIChIPgIF\ priority 8\ shortLabel LI H4ac +gIF\ track encodeUcsdChipHeLaH3H4acH4_p30\ encodeUcsdChipTaf250Hct116_f LI TAF1 HCT116 bedGraph 4 Ludwig Institute ChIP-chip: TAF1 ab, HCT116 cells 0 8 58 119 40 156 187 147 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 0 color 58,119,40\ longLabel Ludwig Institute ChIP-chip: TAF1 ab, HCT116 cells\ parent encodeLIChIP\ priority 8\ shortLabel LI TAF1 HCT116\ track encodeUcsdChipTaf250Hct116_f\ decodeMaleCarrier Male Carrier bigWig 0.0 204.214 deCODE recombination map, male carrier 0 8 0 100 180 127 177 217 0 0 22 chr1,chr2,chr3,chr4,chr5,chr6,chr7,chr8,chr9,chr10,chr11,chr12,chr13,chr16,chr14,chr15,chr17,chr18,chr19,chr20,chr22,chr21, map 0 color 0,100,180\ configurable on\ longLabel deCODE recombination map, male carrier\ parent maleView\ priority 8\ shortLabel Male Carrier\ subGroups view=male\ track decodeMaleCarrier\ type bigWig 0.0 204.214\ encodeGencodeRaceFragsMuscle RACEfrags Muscle genePred Gencode RACEfrags from Muscle 0 8 188 0 68 221 127 161 0 0 19 chr1,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr5,chr6,chr7,chr8,chr9,chrX, encodeGenes 1 color 188,0,68\ longLabel Gencode RACEfrags from Muscle\ parent encodeGencodeRaceFrags\ priority 8\ shortLabel RACEfrags Muscle\ track encodeGencodeRaceFragsMuscle\ recombRate Recomb Rate bed 4 + Recombination Rate from deCODE, Marshfield, or Genethon Maps (deCODE default) 0 8 0 0 0 127 127 127 0 0 0\ The recombination rate track represents\ calculated sex-averaged rates of recombination based on either the\ deCODE, Marshfield, or Genethon genetic maps. By default, the deCODE\ map rates are displayed. Female- and male-specific recombination\ rates, as well as rates from the Marshfield and Genethon maps, can\ also be displayed by choosing the appropriate filter option on the track \ description page.
\ \\ The deCODE genetic map was created at \ deCODE Genetics and is \ based on 5,136 microsatellite markers for 146 families with a total\ of 1,257 meiotic events. For more information on this map, see\ Kong, A. et al. (2002).
\\ The Marshfield genetic map was created at the \ Center for Medical Genetics and is based on 8,325 short \ tandem repeat polymorphisms (STRPs) for 8 CEPH families consisting of 134\ individuals with 186 meioses. For more information on this map, see \ Broman, K.W. et al. (1998).
\\ The Genethon genetic map was created at \ Genethon and is based on 5,264 microsatellites for 8 CEPH \ families consisting of 134 individuals with 186 meioses. For more information \ on this map, see \ Dib et al. (1996).
\\ Each base is assigned the recombination rate calculated by\ assuming a linear genetic distance across the immediately flanking\ genetic markers. The recombination rate assigned to each 1 Mb window\ is the average recombination rate of the bases contained within the\ window.
\ \\ This track has a filter that can be used to change the map or\ gender-specific rate displayed. The filter is located at the top of the track \ description page, which is accessed via the small button to the left of \ the track's graphical display or through the link on the track's control menu.\ To view a particular map or gender-specific rate, select the corresponding\ option from the "Map Distances" pulldown list. By default, the \ browser displays the deCODE sex-averaged distances.
\\ When you have finished configuring the filter, click the Submit \ button.
\ \\ This track was produced at UCSC using data that are freely available for\ the Genethon, Marshfield, and deCODE genetic maps (see above links). Thanks\ to all who played a part in the creation of these maps.
\ \\ Broman, K.W., Murray, J.C., Sheffield, V.C., White, R.L. and Weber, J.L.\ Comprehensive human genetic maps: Individual and sex-specific \ variation in recombination, American Journal of Human Genetics\ 63, 861-689 (1998).
\\ Dib, C., Faure, S., Fizames, C., Samson, D., Drouot, N., Vignal, A., \ Millasseau, P., Marc, S., Hazan, J., Seboun, E., Lathrop, M., Gyapay, G., \ Morissette, J., and Weissenbach, J. \ \ A comprehensive genetic map of the human genome based on 5,264 \ microsatellites, \ Nature 380(6570), 152-154 (1996).
\\ Kong, A., Gudbjartsson, D.F., Sainz, J., Jonsdottir, G.M., Gudjonsson, S.A., \ Richardsson, B., Sigurdardottir, S., Barnard, J., Hallbeck, B., Masson, G., \ Shlien, A., Palsson, S.T., Frigge, M.L., Thorgeirsson, T.E., Gulcher, J.R., \ and Stefansson, K.\ A high-resolution recombination map of the human genome,\ Nature Genetics, 31(3), 241-247 (2002).
\ map 1 exonArrows off\ group map\ longLabel Recombination Rate from deCODE, Marshfield, or Genethon Maps (deCODE default)\ priority 8\ shortLabel Recomb Rate\ track recombRate\ type bed 4 +\ visibility hide\ encodeEgaspPartSaga SAGA genePred SAGA Gene Predictions 0 8 80 20 80 167 137 167 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeGenes 1 color 80,20,80\ longLabel SAGA Gene Predictions\ parent encodeEgaspPartial\ priority 8\ shortLabel SAGA\ track encodeEgaspPartSaga\ encodeSangerChipH3acK562 SI H3ac K562 bedGraph 4 Sanger Institute ChIP/Chip (H3ac ab, K562 cells) 0 8 10 10 10 132 132 132 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 0 color 10,10,10\ longLabel Sanger Institute ChIP/Chip (H3ac ab, K562 cells)\ parent encodeSangerChipH3H4\ priority 8\ shortLabel SI H3ac K562\ track encodeSangerChipH3acK562\ stanfordChipK562GABP Stan K562 GABP bedGraph 4 Stanford ChIP-chip (K562 cells, GABP ChIP) 0 8 120 0 20 150 0 25 0 0 22 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr3,chr4,chr5,chr6,chr7,chr8,chr9,chrX, regulation 0 longLabel Stanford ChIP-chip (K562 cells, GABP ChIP)\ parent stanfordChip\ priority 8\ shortLabel Stan K562 GABP\ track stanfordChipK562GABP\ encodeStanfordChipK562GABP Stan K562 GABP bedGraph 4 Stanford ChIP-chip (K562 cells, GABP ChIP) 0 8 120 0 20 150 0 25 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 0 longLabel Stanford ChIP-chip (K562 cells, GABP ChIP)\ parent encodeStanfordChipJohnson\ priority 8\ shortLabel Stan K562 GABP\ track encodeStanfordChipK562GABP\ encodeStanfordMethSmoothedU87 Stan Meth Sc U87 bedGraph 4 Stanford Methylation Digest Smoothed Score (U87 cells) 0 8 120 0 20 150 0 25 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChrom 0 longLabel Stanford Methylation Digest Smoothed Score (U87 cells)\ parent encodeStanfordMethSmoothed\ priority 8\ shortLabel Stan Meth Sc U87\ track encodeStanfordMethSmoothedU87\ encodeStanfordMethU87 Stan Meth U87 bedGraph 4 Stanford Methylation Digest (U87 cells) 0 8 120 0 20 150 0 25 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChrom 0 longLabel Stanford Methylation Digest (U87 cells)\ parent encodeStanfordMeth\ priority 8\ shortLabel Stan Meth U87\ track encodeStanfordMethU87\ encodeStanfordPromotersHTB11 Stan Pro HTB11 bed 9 + Stanford Promoter Activity (HTB11 cells) 0 8 0 0 0 127 127 127 0 0 19 chr1,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr5,chr6,chr7,chr8,chr9,chrX, encodeTxLevels 1 longLabel Stanford Promoter Activity (HTB11 cells)\ parent encodeStanfordPromoters\ priority 8\ shortLabel Stan Pro HTB11\ track encodeStanfordPromotersHTB11\ encodeUtexChip2091fibE2F4Peaks UT E2F4 st-Fb Pk bedGraph 4 University of Texas, Austin ChIP-chip (E2F4, 2091 fibroblasts) Peaks 0 8 50 0 0 152 127 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 0 color 50,0,0\ longLabel University of Texas, Austin ChIP-chip (E2F4, 2091 fibroblasts) Peaks\ parent encodeUtexChip\ priority 8\ shortLabel UT E2F4 st-Fb Pk\ subGroups dataType=peaks\ track encodeUtexChip2091fibE2F4Peaks\ kiddEichlerValidAbc7 Validated ABC7 bed 9 HGSV Individual ABC7 (Yoruba) Validated Sites of Structural Variation 0 8 0 0 0 127 127 127 0 0 0 varRep 1 longLabel HGSV Individual ABC7 (Yoruba) Validated Sites of Structural Variation\ parent kiddEichlerValid\ priority 8\ shortLabel Validated ABC7\ track kiddEichlerValidAbc7\ encodeYaleAffyNB4TPARNATarsIntronsProximal Yale In Prx NB4 TPA bed 4 . Yale Intronic Proximal TPA-Treated NB4 TARs 0 8 164 0 92 209 127 173 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeAnalysis 1 color 164,0,92\ longLabel Yale Intronic Proximal TPA-Treated NB4 TARs\ parent encodeNoncodingTransFrags\ priority 8\ shortLabel Yale In Prx NB4 TPA\ subGroups region=intronicProximal celltype=nb4 source=yale\ track encodeYaleAffyNB4TPARNATarsIntronsProximal\ encodeYaleAffyNeutRNATransMap07 Yale RNA Neu 7 wig -2730 3394 Yale Neutrophil RNA Transcript Map, Sample 7 0 8 50 115 50 152 185 152 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeTxLevels 0 color 50,115,50\ longLabel Yale Neutrophil RNA Transcript Map, Sample 7\ parent encodeYaleAffyRNATransMap\ priority 8\ shortLabel Yale RNA Neu 7\ subGroups celltype=neutro samples=samples\ track encodeYaleAffyNeutRNATransMap07\ encodeYaleAffyNeutRNATars07 Yale TAR Neu 7 bed 3 . Yale Neutrophil RNA Transcriptionally Active Region, Sample 7 0 8 50 115 50 152 185 152 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeTxLevels 1 color 50,115,50\ longLabel Yale Neutrophil RNA Transcriptionally Active Region, Sample 7\ parent encodeYaleAffyRNATars\ priority 8\ shortLabel Yale TAR Neu 7\ subGroups celltype=neutro samples=samples\ track encodeYaleAffyNeutRNATars07\ decodeRmap deCODE Recomb bed 3 deCODE Recombination maps, 10Kb bin size, October 2010 0 8.5 0 0 0 127 127 127 0 0 22 chr1,chr2,chr3,chr4,chr5,chr6,chr7,chr8,chr9,chr10,chr11,chr12,chr13,chr16,chr14,chr15,chr17,chr18,chr19,chr20,chr22,chr21,\ The deCODE recombination rate track represents\ calculated rates of recombination based on the\ deCODE recombination maps in 10 Kb bins from October 2010.\ Sex averaged-, Female- and male-specific recombination rates\ can be displayed by choosing the appropriate options on the track \ description page.
\\ Corresponding to each of these tracks separate\ tracks for carriers and non-carriers of the PRDM9 14/15 composite\ allele can be displayed as well. There are also tracks depicting\ the difference between male- and female- recombination rates, and a\ track showing recombination hotspots, i.e. bins with standardized\ recombination rates higher than 10.
\\ In addition to the deCODE display, three data tracks from the\ HapMap project\ are included. CEU, YRI and combined maps from release #24 can be\ turned on with the track visibility controls.
\ \\ The deCODE genetic map was created at \ deCODE Genetics and is \ based on 289,658 and 8,411 SNPs on the autosomal and X chromosomes\ for 15,257 parent-offspring pairs. For more information on this map, see\ Kong, A. et al. (2010).
\\ Each base is assigned the recombination rate calculated by\ assuming a linear genetic distance across the immediately flanking\ genetic markers. The recombination rate assigned to each 10 Kb window\ is the average recombination rate of the bases contained within the\ window. The recombination rates are standardized, bringing the average\ to 1 for all bins used for the standardization.
\ \\ This track was produced at UCSC using data that are freely available for\ the deCODE genetic maps. Thanks to all who played a part in the\ creation of these maps.
\ \\ Kong, A., Thorleifsson, G., Gudbjartsson, D.F., Masson, G., Sigurdsson, A.,\ Jonasdottir, A., Walters, G.B., Jonasdottir, A., Gylfason, A.,\ Kristinsson, Kari Th., Gudjonsson, S.A., Frigge, M.L., Helgason, A.,\ Thorsteinsdottir, U., Stefansson, K.\ Fine-scale recombination rate differences between sexes, populations and individuals,\ Nature, 467(7319), 1099-1103 (2010),\ and supplementary data nature09525-s1.pdf
\\ Kong, A., Gudbjartsson, D.F., Sainz, J., Jonsdottir, G.M., Gudjonsson, S.A., \ Richardsson, B., Sigurdardottir, S., Barnard, J., Hallbeck, B., Masson, G., \ Shlien, A., Palsson, S.T., Frigge, M.L., Thorgeirsson, T.E., Gulcher, J.R., \ and Stefansson, K.\ A high-resolution recombination map of the human genome,\ Nature Genetics, 31(3), 241-247 (2002).
\ map 1 chromosomes chr1,chr2,chr3,chr4,chr5,chr6,chr7,chr8,chr9,chr10,chr11,chr12,chr13,chr16,chr14,chr15,chr17,chr18,chr19,chr20,chr22,chr21\ compositeTrack on\ group map\ longLabel deCODE Recombination maps, 10Kb bin size, October 2010\ maxHeightPixels 100:36:11\ noInherit on\ priority 8.5\ shortLabel deCODE Recomb\ subGroup1 view Views male=Male female=Female avg=Sex_Average diff=Sex_Difference hot=Hot_Spots other=Other_maps\ track decodeRmap\ type bed 3\ viewLimits 0:10\ visibility hide\ femaleView Female bed 3 deCODE Recombination maps, 10Kb bin size, October 2010 0 8.5 0 0 0 127 127 127 0 0 22 chr1,chr2,chr3,chr4,chr5,chr6,chr7,chr8,chr9,chr10,chr11,chr12,chr13,chr16,chr14,chr15,chr17,chr18,chr19,chr20,chr22,chr21, map 1 parent decodeRmap\ shortLabel Female\ track femaleView\ view female\ visibility hide\ otherMaps HapMap bigWig -1.0 111.0 HapMap Release 24 recombination maps 0 8.5 0 0 0 127 127 127 0 0 22 chr1,chr2,chr3,chr4,chr5,chr6,chr7,chr8,chr9,chr10,chr11,chr12,chr13,chr16,chr14,chr15,chr17,chr18,chr19,chr20,chr22,chr21, map 0 longLabel HapMap Release 24 recombination maps\ parent decodeRmap\ shortLabel HapMap\ track otherMaps\ type bigWig -1.0 111.0\ view other\ visibility hide\ hotView Hot Spots bed 4 deCODE recombination map, Female and Male hot spots, >= 10.0 1 8.5 0 0 0 127 127 127 0 0 22 chr1,chr2,chr3,chr4,chr5,chr6,chr7,chr8,chr9,chr10,chr11,chr12,chr13,chr16,chr14,chr15,chr17,chr18,chr19,chr20,chr22,chr21, map 1 longLabel deCODE recombination map, Female and Male hot spots, >= 10.0\ parent decodeRmap\ shortLabel Hot Spots\ track hotView\ type bed 4\ view hot\ visibility dense\ maleView Male bed 3 deCODE Recombination maps, 10Kb bin size, October 2010 0 8.5 0 0 0 127 127 127 0 0 22 chr1,chr2,chr3,chr4,chr5,chr6,chr7,chr8,chr9,chr10,chr11,chr12,chr13,chr16,chr14,chr15,chr17,chr18,chr19,chr20,chr22,chr21, map 1 parent decodeRmap\ shortLabel Male\ track maleView\ view male\ visibility hide\ diffView Male-Female bed 3 deCODE Recombination maps, 10Kb bin size, October 2010 2 8.5 0 0 0 127 127 127 0 0 22 chr1,chr2,chr3,chr4,chr5,chr6,chr7,chr8,chr9,chr10,chr11,chr12,chr13,chr16,chr14,chr15,chr17,chr18,chr19,chr20,chr22,chr21, map 1 parent decodeRmap\ shortLabel Male-Female\ track diffView\ view diff\ visibility full\ avgView Sex Average bed 3 deCODE Recombination maps, 10Kb bin size, October 2010 2 8.5 0 0 0 127 127 127 0 0 22 chr1,chr2,chr3,chr4,chr5,chr6,chr7,chr8,chr9,chr10,chr11,chr12,chr13,chr16,chr14,chr15,chr17,chr18,chr19,chr20,chr22,chr21, map 1 parent decodeRmap\ shortLabel Sex Average\ track avgView\ view avg\ visibility full\ encodeAffyChIpHl60PvalCebpeHr00 Affy CEBPe RA 0h wig 0.0 534.54 Affymetrix ChIP/Chip (CEBPe retinoic acid-treated HL-60, 0hrs) P-Value 0 9 200 25 0 227 140 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 0 color 200,25,0\ longLabel Affymetrix ChIP/Chip (CEBPe retinoic acid-treated HL-60, 0hrs) P-Value\ parent encodeAffyChIpHl60Pval\ priority 9\ shortLabel Affy CEBPe RA 0h\ subGroups factor=CEBPe time=0h\ track encodeAffyChIpHl60PvalCebpeHr00\ encodeAffyChIpHl60SignalStrictPol2Hr00 Affy Pol2 0h wig -2.78 3.97 Affymetrix ChIP-chip (Pol2, retinoic acid-treated HL-60, 0hrs) Strict Signal 0 9 50 175 0 152 215 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 0 color 50,175,0\ longLabel Affymetrix ChIP-chip (Pol2, retinoic acid-treated HL-60, 0hrs) Strict Signal\ parent encodeAffyChIpHl60SignalStrict\ priority 9\ shortLabel Affy Pol2 0h\ subGroups factor=Pol2 time=0h\ track encodeAffyChIpHl60SignalStrictPol2Hr00\ encodeAffyChIpHl60SitesStrictRnapHr00 Affy Pol2 0h bed 3 . Affymetrix ChIP-chip (Pol2, retinoic acid-treated HL-60, 0hrs) Strict Sites 0 9 50 175 0 152 215 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 1 color 50,175,0\ longLabel Affymetrix ChIP-chip (Pol2, retinoic acid-treated HL-60, 0hrs) Strict Sites\ parent encodeAffyChIpHl60SitesStrict\ priority 9\ shortLabel Affy Pol2 0h\ subGroups factor=Pol2 time=0h\ track encodeAffyChIpHl60SitesStrictRnapHr00\ encodeAffyChIpHl60PvalStrictPol2Hr00 Affy Pol2 0h wig 0 696.62 Affymetrix ChIP-chip (Pol2, retinoic acid-treated HL-60, 0hrs) Strict P-Value 0 9 50 175 0 152 215 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 0 color 50,175,0\ longLabel Affymetrix ChIP-chip (Pol2, retinoic acid-treated HL-60, 0hrs) Strict P-Value\ parent encodeAffyChIpHl60PvalStrict\ priority 9\ shortLabel Affy Pol2 0h\ subGroups factor=Pol2 time=0h\ track encodeAffyChIpHl60PvalStrictPol2Hr00\ encodeBuFirstExonStomach BU Stomach bed 12 + Boston University First Exon Activity in Stomach 0 9 0 0 0 127 127 127 0 0 10 chr11,chr13,chr15,chr16,chr19,chr2,chr5,chr7,chr9,chrX, encodeTxLevels 1 longLabel Boston University First Exon Activity in Stomach\ parent encodeBuFirstExon\ priority 9\ shortLabel BU Stomach\ track encodeBuFirstExonStomach\ encodeTransFragsConservedIntronicDistal Cons Intron Dist bed 4 Conserved Intronic Distal 0 9 0 0 0 127 127 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeAnalysis 1 longLabel Conserved Intronic Distal\ parent encodeTransFrags\ priority 9\ shortLabel Cons Intron Dist\ track encodeTransFragsConservedIntronicDistal\ kiddEichlerDiscG248 Discordant G248 bed 12 HGSV Individual G248 Discordant Clone End Alignments 0 9 0 0 0 127 127 127 0 0 0 http://mrhgsv.gs.washington.edu/cgi-bin/hgc?i=$$&c=$S&l=$[&r=$]&db=$D&position=$S:$[-$] varRep 1 longLabel HGSV Individual G248 Discordant Clone End Alignments\ parent kiddEichlerDisc\ priority 9\ shortLabel Discordant G248\ track kiddEichlerDiscG248\ encodeAffyEc1FetalKidneySignal EC1 Sgnl FetalK wig 0 62385 Affy Ext Trans Signal (1-base window) (Fetal Kidney) 0 9 176 0 80 215 127 167 0 0 2 chr21,chr22, encodeTxLevels 0 color 176,0,80\ longLabel Affy Ext Trans Signal (1-base window) (Fetal Kidney)\ parent encodeAffyEcSignal\ priority 9\ shortLabel EC1 Sgnl FetalK\ track encodeAffyEc1FetalKidneySignal\ encodeAffyEc1FetalKidneySites EC1 Sites FetalK bed 3 . Affy Ext Trans Sites (1-base window) (Fetal Kidney) 0 9 176 0 80 215 127 167 0 0 2 chr21,chr22, encodeTxLevels 1 color 176,0,80\ longLabel Affy Ext Trans Sites (1-base window) (Fetal Kidney)\ parent encodeAffyEcSites\ priority 9\ shortLabel EC1 Sites FetalK\ track encodeAffyEc1FetalKidneySites\ encodeRegions ENCODE Regions bed 4 . Encyclopedia of DNA Elements (ENCODE) Regions 0 9 150 100 30 202 177 142 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX,\ This track depicts target regions for the \ NHGRI ENCODE \ project.\ The long-term goal of this project is to identify all functional elements \ in the human genome sequence to facilitate a better understanding of human \ biology and disease.
\\ During the pilot phase, 44 regions comprising 30 Mb — approximately \ 1% of the human genome — have been selected for intensive study to identify, \ locate and analyze functional elements within the regions. These targets are \ being studied by a diverse public research consortium to test and evaluate the\ efficacy of various methods, technologies, and strategies for locating \ genomic features. The outcome of this initial phase will form the basis for a \ larger-scale effort to analyze the entire human genome.
\\ See the NHGRI target \ selection process web page for a description of how the target \ regions were selected.
\\ To open a UCSC Genome Browser with a menu for selecting ENCODE regions on the \ human genome, use ENCODE Regions in the UCSC Browser. The UCSC resources \ provided for the ENCODE project are described on the \ UCSC ENCODE Portal.
\ \\ Thanks to the NHGRI ENCODE project for providing this initial set of data.
\ \ encodeGenes 1 chromosomes chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX\ color 150,100,30\ dataVersion ENCODE June 2005 Freeze\ group encodeGenes\ longLabel Encyclopedia of DNA Elements (ENCODE) Regions\ origAssembly hg16\ priority 9.0\ shortLabel ENCODE Regions\ track encodeRegions\ type bed 4 .\ visibility hide\ encodeEgaspFullGeneIdU12 GeneID U12 genePred GeneID U12 Intron Predictions 0 9 200 132 12 227 193 133 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeGenes 1 color 200,132,12\ longLabel GeneID U12 Intron Predictions\ parent encodeEgaspFull\ priority 9\ shortLabel GeneID U12\ track encodeEgaspFullGeneIdU12\ hapmapSnpsMKK HapMap SNPs MKK bed 6 + HapMap SNPs from the MKK Population (Masai in Kinyawa, Kenya) 0 9 0 0 0 127 127 127 0 0 0 varRep 1 longLabel HapMap SNPs from the MKK Population (Masai in Kinyawa, Kenya)\ parent hapmapSnps\ priority 9\ shortLabel HapMap SNPs MKK\ track hapmapSnpsMKK\ snpArrayIllumina1M Illumina 1M-Duo bed 6 + Illumina Human1M-Duo 0 9 0 0 0 127 127 127 0 0 0 varRep 1 longLabel Illumina Human1M-Duo\ parent snpArray\ priority 9\ shortLabel Illumina 1M-Duo\ track snpArrayIllumina1M\ type bed 6 +\ encodeEgaspUpdJigsaw Jigsaw Update genePred Jigsaw Gene Predictions 0 9 22 150 20 138 202 137 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeGenes 1 color 22,150,20\ longLabel Jigsaw Gene Predictions\ parent encodeEgaspUpdate\ priority 9\ shortLabel Jigsaw Update\ track encodeEgaspUpdJigsaw\ encodeUcsdChipAch3Imr90_f LI H3ac IMR90 bedGraph 4 Ludwig Institute ChIP-chip: H3ac ab, IMR90 cells 0 9 109 51 43 182 153 149 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 0 color 109,51,43\ longLabel Ludwig Institute ChIP-chip: H3ac ab, IMR90 cells\ parent encodeLIChIP\ priority 9\ shortLabel LI H3ac IMR90\ track encodeUcsdChipAch3Imr90_f\ encodeUcsdChipHeLaH3H4stat1_p0 LI STAT1 -gIF bedGraph 4 Ludwig Institute ChIP-chip: STAT1 ab, HeLa cells, no gamma interferon 0 9 109 51 43 182 153 149 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 0 color 109,51,43\ longLabel Ludwig Institute ChIP-chip: STAT1 ab, HeLa cells, no gamma interferon\ parent encodeLIChIPgIF\ priority 9\ shortLabel LI STAT1 -gIF\ track encodeUcsdChipHeLaH3H4stat1_p0\ decodeMaleNonCarrier Male Non-carrier bigWig 0.0 151.353 deCODE recombination map, male non-carrier 0 9 0 128 140 127 191 197 0 0 22 chr1,chr2,chr3,chr4,chr5,chr6,chr7,chr8,chr9,chr10,chr11,chr12,chr13,chr16,chr14,chr15,chr17,chr18,chr19,chr20,chr22,chr21, map 0 color 0,128,140\ configurable on\ longLabel deCODE recombination map, male non-carrier\ parent maleView\ priority 9\ shortLabel Male Non-carrier\ subGroups view=male\ track decodeMaleNonCarrier\ type bigWig 0.0 151.353\ ctgPos Map Contigs ctgPos Physical Map Contigs 0 9 150 0 0 202 127 127 0 0 0\ This track shows the locations of human contigs on the physical map. \ The underlying data is derived from the NCBI seq_contig.md file \ that accompanies this assembly. All contigs are "+" oriented in\ the assembly.
\ \\ For human genome reference sequences dated April 2003 and later,\ the individual chromosome sequencing centers are responsible\ for preparing the assembly of their chromosomes in \ AGP format. The\ files provided by these centers are checked and validated at NCBI, and\ form the basis for the seq_contig.md file that defines the physical \ map contigs.
\\ For more information on the human genome assembly process, see \ The NCBI Handbook.
\ map 0 color 150,0,0\ group map\ longLabel Physical Map Contigs\ priority 9\ shortLabel Map Contigs\ track ctgPos\ type ctgPos\ visibility hide\ netSyntenyMonDom4 Opossum Syn Net netAlign monDom4 chainMonDom4 Opossum (Jan. 2006 (Broad/monDom4)) Syntenic Alignment Net 0 9 0 0 0 127 127 127 1 0 0 compGeno 0 group compGeno\ longLabel $o_Organism ($o_date) Syntenic Alignment Net\ otherDb monDom4\ parent syntenicNet\ priority 9\ shortLabel Opossum Syn Net\ spectrum on\ track netSyntenyMonDom4\ type netAlign monDom4 chainMonDom4\ visibility hide\ encodeGencodeRaceFragsPlacenta RACEfrags Placenta genePred Gencode RACEfrags from Placenta 0 9 176 0 80 215 127 167 0 0 19 chr1,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr5,chr6,chr7,chr8,chr9,chrX, encodeGenes 1 color 176,0,80\ longLabel Gencode RACEfrags from Placenta\ parent encodeGencodeRaceFrags\ priority 9\ shortLabel RACEfrags Placenta\ track encodeGencodeRaceFragsPlacenta\ encodeSangerChipH4acK562 SI H4ac K562 bedGraph 4 Sanger Institute ChIP/Chip (H4ac ab, K562 cells) 0 9 10 10 10 132 132 132 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 0 color 10,10,10\ longLabel Sanger Institute ChIP/Chip (H4ac ab, K562 cells)\ parent encodeSangerChipH3H4\ priority 9\ shortLabel SI H4ac K562\ track encodeSangerChipH4acK562\ stanfordChipK562SRF Stan K562 SRF bedGraph 4 Stanford ChIP-chip (K562 cells, SRF ChIP) 0 9 120 0 20 150 0 25 0 0 22 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr3,chr4,chr5,chr6,chr7,chr8,chr9,chrX, regulation 0 longLabel Stanford ChIP-chip (K562 cells, SRF ChIP)\ parent stanfordChip\ priority 9\ shortLabel Stan K562 SRF\ track stanfordChipK562SRF\ encodeStanfordChipK562SRF Stan K562 SRF bedGraph 4 Stanford ChIP-chip (K562 cells, SRF ChIP) 0 9 120 0 20 150 0 25 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 0 longLabel Stanford ChIP-chip (K562 cells, SRF ChIP)\ parent encodeStanfordChipJohnson\ priority 9\ shortLabel Stan K562 SRF\ track encodeStanfordChipK562SRF\ encodeStanfordPromotersHela Stan Pro Hela bed 9 + Stanford Promoter Activity (HeLa cells) 0 9 0 0 0 127 127 127 0 0 19 chr1,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr5,chr6,chr7,chr8,chr9,chrX, encodeTxLevels 1 longLabel Stanford Promoter Activity (HeLa cells)\ parent encodeStanfordPromoters\ priority 9\ shortLabel Stan Pro Hela\ track encodeStanfordPromotersHela\ kiddEichlerValidG248 Validated G248 bed 9 HGSV Individual G248 Validated Sites of Structural Variation 0 9 0 0 0 127 127 127 0 0 0 varRep 1 longLabel HGSV Individual G248 Validated Sites of Structural Variation\ parent kiddEichlerValid\ priority 9\ shortLabel Validated G248\ track kiddEichlerValidG248\ encodeYaleAffyNB4UntrRNATarsIntronsProximal Yale In Prx NB4 Un bed 4 . Yale Intronic Proximal NB4 TARs 0 9 152 0 104 203 127 179 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeAnalysis 1 color 152,0,104\ longLabel Yale Intronic Proximal NB4 TARs\ parent encodeNoncodingTransFrags\ priority 9\ shortLabel Yale In Prx NB4 Un\ subGroups region=intronicProximal celltype=nb4 source=yale\ track encodeYaleAffyNB4UntrRNATarsIntronsProximal\ encodeYaleAffyNeutRNATransMap08 Yale RNA Neu 8 wig -2730 3394 Yale Neutrophil RNA Transcript Map, Sample 8 0 9 50 100 50 152 177 152 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeTxLevels 0 color 50,100,50\ longLabel Yale Neutrophil RNA Transcript Map, Sample 8\ parent encodeYaleAffyRNATransMap\ priority 9\ shortLabel Yale RNA Neu 8\ subGroups celltype=neutro samples=samples\ track encodeYaleAffyNeutRNATransMap08\ encodeYaleAffyNeutRNATars08 Yale TAR Neu 8 bed 3 . Yale Neutrophil RNA Transcriptionally Active Region, Sample 8 0 9 50 100 50 152 177 152 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeTxLevels 1 color 50,100,50\ longLabel Yale Neutrophil RNA Transcriptionally Active Region, Sample 8\ parent encodeYaleAffyRNATars\ priority 9\ shortLabel Yale TAR Neu 8\ subGroups celltype=neutro samples=samples\ track encodeYaleAffyNeutRNATars08\ encodeAffyChIpHl60SitesCebpeHr00 Affy CEBPe RA 0h bed 3 . Affymetrix ChIP/Chip (CEBPe retinoic acid-treated HL-60, 0hrs) Sites 0 10 200 25 0 227 140 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 1 color 200,25,0\ longLabel Affymetrix ChIP/Chip (CEBPe retinoic acid-treated HL-60, 0hrs) Sites\ parent encodeAffyChIpHl60Sites\ priority 10\ shortLabel Affy CEBPe RA 0h\ subGroups factor=CEBPe time=0h\ track encodeAffyChIpHl60SitesCebpeHr00\ encodeAffyChIpHl60SignalStrictPol2Hr02 Affy Pol2 2h wig -2.78 3.97 Affymetrix ChIP-chip (Pol2, retinoic acid-treated HL-60, 2hrs) Strict Signal 0 10 50 175 0 152 215 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 0 color 50,175,0\ longLabel Affymetrix ChIP-chip (Pol2, retinoic acid-treated HL-60, 2hrs) Strict Signal\ parent encodeAffyChIpHl60SignalStrict\ priority 10\ shortLabel Affy Pol2 2h\ subGroups factor=Pol2 time=2h\ track encodeAffyChIpHl60SignalStrictPol2Hr02\ encodeAffyChIpHl60SitesStrictRnapHr02 Affy Pol2 2h bed 3 . Affymetrix ChIP-chip (Pol2, retinoic acid-treated HL-60, 2hrs) Strict Sites 0 10 50 175 0 152 215 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 1 color 50,175,0\ longLabel Affymetrix ChIP-chip (Pol2, retinoic acid-treated HL-60, 2hrs) Strict Sites\ parent encodeAffyChIpHl60SitesStrict\ priority 10\ shortLabel Affy Pol2 2h\ subGroups factor=Pol2 time=2h\ track encodeAffyChIpHl60SitesStrictRnapHr02\ encodeAffyChIpHl60PvalStrictPol2Hr02 Affy Pol2 2h wig 0 696.62 Affymetrix ChIP-chip (Pol2, retinoic acid-treated HL-60, 2hrs) Strict P-Value 0 10 50 175 0 152 215 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 0 color 50,175,0\ longLabel Affymetrix ChIP-chip (Pol2, retinoic acid-treated HL-60, 2hrs) Strict P-Value\ parent encodeAffyChIpHl60PvalStrict\ priority 10\ shortLabel Affy Pol2 2h\ subGroups factor=Pol2 time=2h\ track encodeAffyChIpHl60PvalStrictPol2Hr02\ gold Assembly bed 3 + Assembly from Fragments 0 10 150 100 30 230 170 40 0 0 0This track shows the draft assembly of the human genome.\ This assembly merges contigs from overlapping drafts and\ finished clones into longer sequence contigs. The sequence\ contigs are ordered and oriented when possible by mRNA, EST,\ paired plasmid reads (from the SNP Consortium) and BAC end\ sequence pairs.
\In dense mode, this track depicts the path through the draft and \ finished clones (aka the golden path) used to create the assembled sequence. \ Clone boundaries are distinguished by the use of alternating gold and brown \ coloration. Where gaps\ exist in the path, spaces are shown between the gold and brown\ blocks. If the relative order and orientation of the contigs\ between the two blocks is known, a line is drawn to bridge the\ blocks.
\\ Clone Type Key:\
\ The Gencode Gene track shows high-quality manual annotations in the\ ENCODE regions generated by the\ GENCODE project.\ A companion track, Gencode Introns, shows experimental gene structure \ validations for these annotations.
\\ The gene annotations are colored based on the Havana annotation type.\ Known and validated transcripts \ are colored dark green,\ putative and unconfirmed are light green,\ pseudogenes are blue,\ and artifacts are grey. \ The transcript types are defined in more detail in the accompanying table.\
\ The Gencode project recommends that the annotations\ with known and validated transcripts; i.e., the types Known, \ Novel_CDS, Novel_transcript_gencode_conf, and \ Putative_gencode_conf (which are colored dark green in the track display) \ be used as the reference annotation.\ \
\
Type | \Color | \Description | \|
---|---|---|---|
Known | \dark green | \Known protein coding genes (referenced in Entrez Gene, NCBI) | \|
Novel_CDS | \dark green | \Novel protein coding genes annotated by Havana (not referenced in Entrez Gene, NCBI) | \|
Novel_transcript_gencode_conf | \dark green | \Novel transcripts annotated by Havana (no ORF assigned) with at least\ one junction validated by RT-PCR | \|
Putative_gencode_conf | \dark green | \Putative transcripts (similar to "novel transcripts", EST supported,\ short, no viable ORF) with at least one junction validated by RT-PCR | \|
Novel_transcript | \light green | \Novel transcripts annotated by Havana (no ORF assigned) not validated \ by RT-PCR | \|
Putative | \light green | \Putative transcripts (similar to "novel transcripts", EST supported,\ short, no viable ORF) not validated by RT-PCR | \|
TEC | \light green | \Single exon objects (supported by multiple ESTs with polyA \ sites and signals) undergoing experimental validation/extension. | \ |
Processed_pseudogene | \blue | \Pseudogenes arising via retrotransposition (exon structure of parent gene lost) | \|
Unprocessed_pseudogene | \blue | \Pseudogenes arising via gene duplication (exon structure of parent gene retained) | \|
Artifact | \grey | \Transcript evidence and/or its translation equivocal | \
\ The Human and Vertebrate Analysis and Annotation manual curation process \ (HAVANA) was\ used to produce these annotations.\
\ Finished genomic sequence was analyzed on a clone-by-clone basis using a\ combination of similarity searches against DNA and protein databases, as\ well as a series of ab initio gene predictions. Nucleotide sequence \ databases were searched with WUBLASTN and significant hits were realigned\ to the unmasked genomic sequence by EST2GENOME. WUBLASTX was used to search \ the Uniprot protein database, and the accession numbers of significant hits \ were retrieved from the Pfam database. Hidden Markov models for Pfam protein \ domains were aligned against the genomic sequence using Genewise to provide\ annotation of protein domains. \
\ A number of ab initio\ prediction algorithms were also run: Genscan and Fgenesh for genes, tRNAscan \ to find tRNA genes, and Eponine TSS for transcription start site predictions.\
\ The annotators used the (AceDB-based) Otterlace interface to create and\ edit gene objects, which were then stored in a local database named \ Otter. In cases where predicted transcript structures from Ensembl \ are available, these can be viewed from within the Otterlace interface and \ may be used as starting templates for gene curation. Annotation in the Otter \ database is submitted to the EMBL/Genbank/DDBJ nucleotide database.
\ \\ The gene objects selected for verification came from various\ computational prediction methods and HAVANA annotations. \
RT-PCR and RACE experiments were performed on them, using a variety of human\ tissues, to confirm their structure. Human cDNAs from 24 different\ tissues (brain, heart, kidney, spleen, liver, colon, small intestine,\ muscle, lung, stomach, testis, placenta, skin, peripheral blood\ leucocytes, bone marrow, fetal brain, fetal liver, fetal kidney, fetal\ heart, fetal lung, thymus, pancreas, mammary gland, prostate) were\ synthesized using 12 poly(A)+ RNAs from Origene, eight from Clemente\ Associates/Quantum Magnetics and four from BD Biosciences as described in\ [Reymond et al., 2002a,b]. The relative amount of each cDNA was \ normalized by quantitative PCR using SyberGreen as intercalator and an \ ABI Prism 7700 Sequence Detection System.
\\ Predictions of human genes junctions were assayed experimentally by\ RT-PCR as previously described and modified [Reymond, 2002b;\ Mouse Genome Sequencing Consortium, 2002; Guigo, 2003]. \
\ Similar amounts of Homo\ sapiens cDNAs were mixed with JumpStart REDTaq ReadyMix (Sigma) and four\ ng/ul primers (Sigma-Genosys) with a BioMek 2000 robot (Beckman). The\ ten first cycles of PCR amplification were performed with a touchdown\ annealing temperatures decreasing from 60 to 50°C; annealing\ temperature of the next 30 cycles was carried out at 50°C. Amplimers\ were separated on "Ready to Run" precast gels (Pharmacia) and\ sequenced. RACE experiments were performed with the BD SMART RACE cDNA\ Amplification Kit following the manufacturer instructions (BD\ Biosciences).
\ \\ Click here for a complete list of people who participated in the \ GENCODE project.
\ \\ Ashurst, J.L. et al. \ The Vertebrate Genome Annotation (Vega) database. \ Nucleic Acids Res 33 (Database Issue), D459-65 \ (2005).
\\ Guigo, R. et al. \ Comparison of mouse and human genomes followed by experimental \ verification yields an estimated 1,019 additional genes. \ Proc Natl Acad Sci U S A 100(3), 1140-5 (2003).
\\ Mouse Genome Sequencing Consortium. \ Initial sequencing and comparative analysis of the mouse \ genome. Nature 420(6915), 520-62 (2002).
\\ Reymond, A. et al. \ Human chromosome 21 gene expression atlas in the mouse. \ Nature 420(6915), 582-6 (2002).
\\ Reymond, A. et al. \ Nineteen additional unpredicted transcripts from human \ chromosome 21. Genomics 79(6), 824-32 (2002).
\ encodeGenes 1 baseColorDefault genomicCodons\ baseColorUseCds given\ cdsDrawDefault genomic codons\ chromosomes chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX\ color 73,76,73\ dataVersion ENCODE June 2005 Freeze\ gClass_Artifact 163,168,163\ gClass_Known 33,91,51\ gClass_Novel_CDS 33,91,51\ gClass_Novel_transcript 84,188,0\ gClass_Novel_transcript_gencode_conf 33,91,51\ gClass_Processed_pseudogene 0,91,191\ gClass_Putative 84,188,0\ gClass_Putative_gencode_conf 33,91,51\ gClass_TEC 84,188,0\ gClass_Unprocessed_pseudogene 0,91,191\ geneClasses Artifact Known Novel_CDS Novel_transcript Novel_transcript_gencode_conf Putative Putative_gencode_conf TEC Processed_pseudogene Unprocessed_pseudogene\ group encodeGenes\ itemClassTbl gencodeGeneClass\ longLabel Gencode Gene Annotations\ origAssembly hg17\ priority 10.1\ shortLabel Gencode Genes\ track encodeGencodeGene\ type genePred\ visibility hide\ encodeGencodeIntron Gencode Introns bed 6 + Gencode Intron Validation 0 10.2 0 0 0 127 127 127 0 0 20 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr8,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr9,chrX,\ The Gencode Intron Validation track shows\ gene structure validations generated by the\ GENCODE project.\ This track serves as a companion to the Gencode Genes track.
\\ The items in this track are colored based on the validation status determined \ via RT-PCR of exons flanking the intron:
\\
Status | \Color | \Validation Result | \
---|---|---|
RT_positive | \green | \Intron validated (PCR product corresponds to expected junction) | \
RT_negative\ | red | \Intron not validated (no PCR product was obtained) | \
RT_wrong_junction | \gold | \Intron not validated, but another junction exists between the two\ (PCR product does not correspond to the expected junction) | \
\ Selected gene models from the Genecode Genes track were picked for RT-PCR \ and RACE verification experiments.\ RT-PCR and RACE experiments were performed on the objects, using a variety of \ human tissues, to confirm their structure. Human cDNAs from 24 different\ tissues (brain, heart, kidney, spleen, liver, colon, small intestine,\ muscle, lung, stomach, testis, placenta, skin, peripheral blood\ leucocytes, bone marrow, fetal brain, fetal liver, fetal kidney, fetal\ heart, fetal lung, thymus, pancreas, mammary gland, prostate) were\ synthesized using twelve poly(A)+ RNAs from Origene, eight from Clemente\ Associates/Quantum Magnetics and four from BD Biosciences as described in\ [Reymond et al., 2002a,b]. The relative amount of each cDNA was \ normalized with glyceraldehyde-3-phosphate dehydrogenase (GAPDH) by quantitative\ PCR using SyberGreen as intercalator and \ an ABI Prism 7700 Sequence Detection System.
\\ Predictions of human genes junctions were assayed experimentally by\ RT-PCR as previously described and modified [Reymond, 2002b;\ Mouse Genome Sequencing Consortium, 2002; Guigo, 2003]. \
\ Similar amounts of Homo\ sapiens cDNAs were mixed with JumpStart REDTaq ReadyMix (Sigma) and 4\ ng/ul primers (Sigma-Genosys) with a BioMek 2000 robot (Beckman). The\ ten first cycles of PCR amplification were performed with a touchdown\ annealing temperatures decreasing from 60 to 50°C; annealing\ temperature of the next 30 cycles was carried out at 50°C. Amplimers\ were separated on "Ready to Run" precast gels (Pharmacia) and\ sequenced. RACE experiments were performed with the BD SMART RACE cDNA\ Amplification Kit following the manufacturer instructions (BD\ Biosciences).
\ \\ Click here for a complete list of people who participated in the \ GENCODE project.
\ \\ Ashurst, J.L. et al. \ The Vertebrate Genome Annotation (Vega) database. \ Nucleic Acids Res 33 (Database Issue), D459-65 \ (2005).
\\ Guigo, R. et al. \ Comparison of mouse and human genomes followed by experimental \ verification yields an estimated 1,019 additional genes. \ Proc Natl Acad Sci U S A 100(3), 1140-5 (2003).
\\ Mouse Genome Sequencing Consortium.\ Initial sequencing and comparative analysis of the mouse \ genome. Nature 420(6915), 520-62 (2002).
\\ Reymond, A. et al. \ Human chromosome 21 gene expression atlas in the mouse. \ Nature 420(6915), 582-6 (2002).
\\ Reymond, A. et al. \ Nineteen additional unpredicted transcripts from human \ chromosome 21. Genomics 79(6), 824-32 (2002).
\ encodeGenes 1 chromosomes chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr8,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr9,chrX\ dataVersion ENCODE June 2005 Freeze\ group encodeGenes\ longLabel Gencode Intron Validation\ origAssembly hg17\ priority 10.2\ shortLabel Gencode Introns\ track encodeGencodeIntron\ type bed 6 +\ visibility hide\ encodeGencodeSuper Gencode Genes Gencode Gene Annotation 0 10.3 0 0 0 127 127 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX,\ This super-track combines related tracks from the\ GENCODE project.\ The goal of this project is to identify all protein-coding genes\ in the ENCODE regions using a pipeline that uses computational predictions, \ experimental verification, and manual annotation, based on\ the Sanger\ Havana process.\
\ \\ This track shows gene annotations from the GENCODE \ release v3.1 (March 2007). These annotations contain\ updates and corrections to the GENCODE October 2005 annotations,\ based on validation data from 5' RACE and RT-PCR experiments,\ which are displayed in the Gencode RACEfrags and Gencode Introns Oct05\ tracks.\
\ \\ This track shows the products of 5' RACE reactions performed on\ GENCODE genes in 12 tissues and 3 cell lines,\ as assayed on Affymetrix ENCODE 20nt tiling arays.\ The results were used to annotate 5' transcription\ start sites and internal exons of all annotated protein-coding\ loci in the Oct. 2005 GENCODE freeze.\ \
\ This track shows gene annotations from the GENCODE release v2.2 (Oct 2005), \ which was released as part of the ENCODE October 2005\ data freeze.\
\ \\
\ \\
This GENCODE release is the result of a collaborative effort among\
the following laboratories:\
\
\
\
\ Lab/Institution \ | \
Contributors \ | \
HAVANA annotation\ \ group, Wellcome Trust Sanger Insitute, Hinxton, UK | \Adam Frankish, Jonathan Mudge, James\
\ Gilbert, Tim Hubbard, Jennifer Harrow \ | \
Genome Bioinformatics\ \ Lab CRG, Barcelona, Spain | \France Denoeud, Julien Lagarde, Sylvain\ \ Foissac, Robert Castelo, Roderic Guigó (GENCODE Principal\ \ Investigator) | \
Department of\ \ Genetic Medicine and Development, University of Geneva, Switzerland | \Catherine Ucla, Carine Wyss,\ \ Caroline Manzano, Colette Rossier, Stylianos E. Antonorakis | \
Center for\ \ Integrative Genomics, University of Lausanne, Switzerland | \Jacqueline Chrast, Charlotte N.\ \ Henrichsen, Alexandre Reymond | \
Affymetrix, Inc.,\ \ Santa Clara, CA, USA | \Philipp Kapranov, Thomas R. Gingeras | \
\
The RACEfrags result from a collaborative effort among the following\
laboratories:\
\
\
\ Lab/Institution \ \ | \
Contributors \ | \
Genome Bioinformatics\ \ Lab CRG, Barcelona, Spain | \France Denoeud, Julien Lagarde, Tyler Alioto, Sylvain\ \ Foissac, Robert Castelo, Roderic Guigó | \
Department of\ \ Genetic Medicine and Development, University of Geneva, Switzerland | \Catherine Ucla, Carine Wyss,\ \ Caroline Manzano, Colette Rossier, Stylianos E. Antonorakis | \
Center for\ \ Integrative Genomics, University of Lausanne, Switzerland | \Jacqueline Chrast, Charlotte N.\ \ Henrichsen, Alexandre Reymond | \
Affymetrix, Inc.,\ \ Santa Clara, CA, USA | \Philipp Kapranov, Jorg Drenkow, Sujit Dike, Jill Cheng, Thomas R. Gingeras | \
HAVANA annotation\ \ group, Wellcome Trust Sanger Insitute, Hinxton, UK | \Adam Frankish, James\
\ Gilbert, Tim Hubbard, Jennifer Harrow \ \ | \
\ Denoeud F, Kapranov P, Ucla C, Frankish A, Castelo R, Drenkow J,\ Lagarde J, Alioto TS, Manzano C, Chrast J et al.\ Prominent use of distal 5' transcription start sites and discovery\ of a large number of additional exons in ENCODE regions.\ Genome Res. 2007 Jun;17(6):746-59.
\\ Harrow J, Denoeud F, Frankish A, Reymond A, Chen CK, Chrast J,\ Lagarde J, Gilbert JG, Storey R, Swarbreck D et al.\ GENCODE: producing a reference annotation for ENCODE.\ Genome Biol. 2006;7 Suppl 1:S4.1-9.
\ encodeGenes 0 chromosomes chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX\ group encodeGenes\ longLabel Gencode Gene Annotation\ priority 10.3\ shortLabel Gencode Genes\ superTrack on\ track encodeGencodeSuper\ encodeGencodeGeneMar07 Gencode Genes Mar07 genePred Gencode Gene Annotations (March 2007) 0 10.4 0 0 0 127 127 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX,\ The Gencode Genes track (v3.1, March 2007) shows high-quality manual\ annotations in the ENCODE regions generated by the\ GENCODE project.\
\\ The gene annotations are colored based on the HAVANA annotation type. See the \ table below for the color key, as well as more detail about the transcript\ and feature types. The Gencode project recommends that the annotations\ with known and validated transcripts; i.e., the types Known\ and Novel_CDS (which are colored\ dark green in the track\ display) be used as the reference gene annotation.\
\The v3.1 release includes the following updates and enhancements to v2.2 \ (Oct. 2005):\
\
Type | \Color | \Description | \
---|---|---|
Known | \dark green | \Known protein-coding genes (i.e., referenced\ \ in Entrez Gene) | \
Novel_CDS | \dark green | \Have an open reading frame (ORF) and are identical, or\ \ have homology, to cDNAs or proteins but do not fall into the above\ \ category. These can be known in the sense that they are represented by\ \ mRNA sequences in the public databases, but they are not yet \ \ represented\ \ in Entrez Gene or have not received an official gene name. They can \ \ also\ \ be novel in that they are not yet represented by an mRNA sequence in\ \ human. | \
Novel_transcript | \light green | \Similar to Novel_CDS; however, cannot be assigned an unambigous \ \ ORF. | \
Putative | \light green | \Have identical, or have homology to spliced ESTs, but\ \ are devoid of significant ORF and polyA features. These are\ \ generally short (two or three exon) genes or gene fragments. | \
TEC | \light green | \(To Experimentally Confirm)\ \ Single-exon objects (supported by multiple unspliced ESTs with polyA\ \ sites and signals). | \
Polymorphic | \purple | \Have functional transcripts in one haplotype and "pseudo"\ \ (non-functional) transcripts in another. | \
Processed_pseudogene | \blue | \Pseudogenes that lack introns and are thought to arise\ \ from reverse transcription of mRNA followed by reinsertion of\ \ DNA into the genome. | \
Unprocessed_pseudogene | \blue | \Pseudogenes that can contain introns, as they are\ \ produced by gene duplication. | \
Artifact | \grey | \Transcript evidence and/or its translation equivocal.\ \ Usually these arise from high-throughput cDNA sequencing projects that\ \ submit automatic annotation, sometimes resulting in erroneous CDSs in\ \ what turns out to be, for example, 3' UTRs. In addition HAVANA has\ \ extended this category to include cDNAs with non-canonical splice sites\ \ due to deletion/sequencing errors. | \
PolyA_signal | \brown | \Polyadenylation signal | \
PolyA_site | \orange | \Polyadenylation site | \
Pseudo_polyA | \pink | \ "Pseudo"-polyadenylation signal detected in the sequence\
\ of a processed pseudogene. \ Warning: \ Pseudo_polyA features and processed_pseudogenes\ generally don't overlap. The reason is that pseudogene annotations are\ based solely on protein evidence, whereas pseudo_polyA signals are\ identified from transcript evidence; as they are found at the end of\ the 3' UTR, they can lie several kb downstream of the 3' end of the\ pseudogene. | \
\
The current full set of GENCODE annotations is available for download \
here.
\
\ For a detailed description of the methods and references used, see Harrow \ et al., 2006 and Denoeud et al., 2007. \ \
\ A combination of 5’ RACE and \ high-density tiling microarrays were used to empirically annotate 5’ \ transcription start sites (TSSs) and internal exons of all 410 annotated\ protein-coding loci across the 44 ENCODE regions (Oct. 2005 GENCODE\ freeze). The 5’ RACE reactions were performed with oligonucleotides\ mapping to a coding exon common to most of the transcripts of a protein-coding \ gene locus annotated by GENCODE (Oct. 2005 freeze) on polyA+ RNA\ from twelve adult human tissues (brain, heart, kidney, spleen, liver,\ colon, small intestine, muscle, lung, stomach, testis, placenta) and\ three cell lines \ (GM06990 (lymphoblastoid), \ HL60 (acute promyelocytic leukemia) and\ HeLaS3 (cervix carcinoma)).
\\ The RACE reactions were then hybridized to 20 nucleotide-resolution\ Affymetrix tiling arrays covering the non-repeated regions of the 44\ ENCODE regions. The resulting "RACEfrags"\ -- array-detected fragments of RACE products -- were assessed for\ novelty by comparing their genome coordinates to those of\ GENCODE-annotated exons. Connectivity between novel RACEfrags and their\ respective index exon were further investigated by RT-PCR, cloning and\ sequencing. The resulting cDNA sequences (deposited in GenBank under\ accession numbers DQ655905-DQ656069 and\ EF070113-EF070122) were then fed into the HAVANA annotation pipeline as\ mRNA evidence (see "HAVANA manual annotations" below).\ \
\ The HAVANA\ process was used to produce these annotations.\
\\ Before the manual annotation process begins, an automated analysis pipeline\ for similarity searches and ab initio predictions is run\ on a computer farm and stored in an Ensembl MySQL\ database using a modified Ensembl analysis pipeline system. All\ searches and prediction algorithms, except CpG island prediction (see\ cpgreport in the EMBOSS application suite), are run on repeat-masked\ sequence. RepeatMasker is used to mask interspersed repeats, followed by Tandem\ repeats finder to mask tandem repeats.\
\ Nucleotide sequence databases are searched with wuBLASTN, and\ significant hits are re-aligned to the unmasked genomic sequence using\ est2genome.\ The UniProt protein database is searched with wuBLASTX, and the\ accession numbers of significant hits are found in the Pfam\ database. The hidden Markov models for Pfam protein domains are aligned\ against the genomic sequence using Genewise to provide annotation of\ protein domains.
\\ Several ab initio prediction algorithms are also run:\ Genescan and Fgenesh for genes, tRNAscan to find tRNAgenes and Eponine\ TSS to predict transcription start sites.
\\ Once the automated analysis is complete, the annotator uses a Perl/Tk\ based graphical interface, "otterlace", developed in-house at \ the Wellcome Trust Sanger Institute to edit annotation data held in a \ separate MySQL database system. The interface displays a rich,\ interactive graphical view of the genomic region, showing features such as \ database matches, gene predictions, and transcripts created by the\ annotators. Gapped alignments of nucleotide and protein blast hits to\ the genomic sequence are viewed and explored using the "Blixem"\ alignment viewer.
\\ Additionally, the "Dotter" dot plot tool is used to show the\ pairwise alignments of unmasked sequence, thus revealing the location\ of exons that are occasionally missed by the automated blast searches\ because of their small size and/or match to repeat-masked sequence.
\\ The interface provides a number of tools that the annotator uses to\ build genes and edit annotations: adding transcripts, exon coordinates,\ translation regions, gene names and descriptions, remarks and\ polyadenlyation signals and sites.
\ \\ See Harrow et al., 2006 for information on verification techniques.\
\ \\
This GENCODE release is the result of a collaborative effort among\
the following laboratories:\
\
\
\
\ Lab/Institution \ | \
Contributors \ | \
HAVANA annotation \ \ group, Wellcome Trust Sanger Insitute, Hinxton, UK | \Adam Frankish, Jonathan Mudge, James \
\ Gilbert, Tim Hubbard, Jennifer Harrow \ | \
Genome Bioinformatics\ \ Lab CRG, Barcelona, Spain | \France Denoeud, Julien Lagarde, Sylvain \ \ Foissac, Robert Castelo, Roderic Guigó (GENCODE Principal \ \ Investigator) | \
Department of \ \ Genetic Medicine and Development, University of Geneva, Switzerland | \Catherine Ucla, Carine Wyss,\ \ Caroline Manzano, Colette Rossier, Stylianos E. Antonorakis | \
Center for \ \ Integrative Genomics, University of Lausanne, Switzerland | \Jacqueline Chrast, Charlotte N.\ \ Henrichsen, Alexandre Reymond | \
Affymetrix, Inc., \ \ Santa Clara, CA, USA | \Philipp Kapranov, Thomas R. Gingeras | \
\ Denoeud F, Kapranov P, Ucla C, Frankish A, Castelo R, Drenkow J, Lagarde J, \ Alioto TS, Manzano C, Chrast J et al. \ \ Prominent use of distal 5' transcription start sites and discovery of a large number of additional exons in ENCODE regions \ Genome Res. 2007 Jun;17(6):746-759.
\\ Harrow J, Denoeud F, Frankish A, Reymond A, Chen CK, Chrast J,\ Lagarde J, Gilbert JG, Storey R, Swarbreck D et al. \ GENCODE: \ producing a reference annotation for ENCODE. Genome Biol. 2006;7 \ Suppl 1:S4.1-9.
\\ The ENCODE Project Consortium. \ Identification and analysis of\ functional elements in 1% of the human genome by the ENCODE pilot\ project\ Nature. 2007 Jun 14;447(7146):799-816.
\ encodeGenes 1 baseColorDefault genomicCodons\ baseColorUseCds given\ cdsDrawDefault genomic codons\ chromosomes chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX\ compositeTrack on\ dataVersion Mar 2007\ gClass_Artifact 163,168,163\ gClass_Known 33,91,51\ gClass_Novel_CDS 33,91,51\ gClass_Novel_Transcript 84,188,0\ gClass_Polymorphic 160,32,240\ gClass_Processed_pseudogene 0,91,191\ gClass_Putative 84,188,0\ gClass_TEC 84,188,0\ gClass_Unprocessed_pseudogene 0,91,191\ geneClasses Artifact Known Novel_CDS Novel_Transcript Putative TEC Polymorphic Processed_pseudogene Unprocessed_pseudogene\ group encodeGenes\ itemClassTbl encodeGencodeGeneClassMar07\ longLabel Gencode Gene Annotations (March 2007)\ origAssembly hg17\ priority 10.4\ shortLabel Gencode Genes Mar07\ superTrack encodeGencodeSuper dense\ track encodeGencodeGeneMar07\ type genePred\ visibility hide\ encodeGencodeRaceFrags Gencode RACEfrags genePred 5' RACE-Array experiments on Gencode loci 0 10.5 0 0 0 127 127 127 0 0 19 chr1,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr5,chr6,chr7,chr8,chr9,chrX,\ RACEfrags are the products of 5’ RACE reactions performed on GENCODE \ genes (using the primers displayed in the subtrack "Gencode 5’ \ RACE primer") in 12 tissues and 3 cell lines (15 subtracks) followed \ by hybridization on ENCODE tiling arrays. Each RACEfrag is linked to the \ 5’ RACE primer but no other connectivity information is available \ from this experiment.\ \
\ For a detailed description of the methods and references used, see \ Denoeud et al., 2007.
\\ A combination of 5’ RACE and \ high-density tiling microarrays were used to empirically annotate 5’ \ transcription start sites (TSSs) and internal exons of all 410 annotated\ protein-coding loci across the 44 ENCODE regions (Oct. 2005 GENCODE\ freeze ; Harrow et al., 2006). Oligonucleotides for 5’ RACE \ experiments were chosen such that they map to a coding exon (the index exon) \ common to most of the transcripts of protein-coding gene loci annotated by \ the GENCODE (Oct. 2005 freeze). The 5’ RACE reactions were \ performed with oligonucleotides mapping to a coding exon (the index exon) \ on polyA+ RNA from twelve adult human tissues (brain, heart, kidney, spleen, \ liver, colon, small intestine, muscle, lung, stomach, testis, placenta) and \ three cell lines \ (GM06990 (lymphoblastoid), \ HL60 (acute promyelocytic leukemia) and\ HeLaS3 (cervix carcinoma)).
\\ The RACE reactions were then hybridized to 20 nucleotide-resolution \ Affymetrix tiling arrays covering the non-repeated regions of the 44 ENCODE \ regions. The resulting "RACEfrags" -- array-detected fragments of \ RACE products -- were assessed for novelty by comparing their genomic\ coordinates to those of GENCODE-annotated exons. \ \
\ Connectivity between novel RACEfrags and their respective index exon were \ investigated by RT-PCR using the 5’ RACE primer as one of the primers, \ followed by hybridization on tiling arrays. 385 RT-PCR reactions corresponding \ to 199 GENCODE loci were positive after hybridization on tiling arrays \ (244 RACE reactions). All positive RT-PCR reactions and a subset of those \ that were negative in the hybridization experiments were further verified by \ cloning and sequencing of the RT-PCR products. In most cases, eight clones were \ selected from each set of RT-PCR products for sequencing. To be retained in \ the dataset, these sequences must unambiguously map to the \ correct location, show splicing and pass manual inspection by the \ HAVANA team. By these criteria, 89 of these RT-PCR \ reactions (69 GENCODE loci) were positive after cloning and sequencing.\ (see Denoeud et al., 2007 for further details). \ The resulting cDNA sequences were deposited in GenBank under accession \ numbers DQ655905-DQ656069 and EF070113-EF070122. See additional information \ about the sequences \ here.\
\ \\
The RACEfrags result from a collaborative effort among the following \
laboratories:\
\
\
\ Lab/Institution \ \ | \
Contributors \ | \
Genome Bioinformatics\ \ Lab CRG, Barcelona, Spain | \France Denoeud, Julien Lagarde, Tyler Alioto, Sylvain \ \ Foissac, Robert Castelo, Roderic Guigó | \
Department of \ \ Genetic Medicine and Development, University of Geneva, Switzerland | \Catherine Ucla, Carine Wyss,\ \ Caroline Manzano, Colette Rossier, Stylianos E. Antonorakis | \
Center for \ \ Integrative Genomics, University of Lausanne, Switzerland | \Jacqueline Chrast, Charlotte N.\ \ Henrichsen, Alexandre Reymond | \
Affymetrix, Inc., \ \ Santa Clara, CA, USA | \Philipp Kapranov, Jorg Drenkow, Sujit Dike, Jill Cheng, Thomas R. Gingeras | \
HAVANA annotation \ \ group, Wellcome Trust Sanger Insitute, Hinxton, UK | \Adam Frankish, James \
\ Gilbert, Tim Hubbard, Jennifer Harrow \ \ | \
\ Denoeud F, Kapranov P, Ucla C, Frankish A, Castelo R, Drenkow J, Lagarde J, \ Alioto TS, Manzano C, Chrast J et al. \ \ Prominent use of distal 5' transcription start sites and discovery of a large number of additional exons in ENCODE regions.\ Genome Res. 2007 Jun;17(6):746-759.
\\ \ Harrow J, Denoeud F, Frankish A, Reymond A, Chen CK, Chrast J, Lagarde J, \ Gilbert JG, Storey R, Swarbreck D et al. \ GENCODE: \ producing a reference annotation for ENCODE. Genome Biol. 2006;7 \ Suppl 1:S4.1-9.
\\ The ENCODE Project Consortium. \ Identification and analysis of\ functional elements in 1% of the human genome by the ENCODE pilot\ project.\ Nature. 2007 Jun 14;447(7146):799-816.
\ encodeGenes 1 autoTranslate 0\ chromosomes chr1,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr5,chr6,chr7,chr8,chr9,chrX\ compositeTrack on\ dataVersion Mar 2007\ group encodeGenes\ longLabel 5' RACE-Array experiments on Gencode loci\ origAssembly hg17\ priority 10.5\ shortLabel Gencode RACEfrags\ superTrack encodeGencodeSuper\ track encodeGencodeRaceFrags\ type genePred\ visibility hide\ encodeGencodeGeneOct05 Gencode Genes Oct05 genePred Gencode Gene Annotations (October 2005) 0 10.6 0 0 0 127 127 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX,\ The Gencode Gene track shows high-quality manual annotations in the\ ENCODE regions generated by the\ GENCODE project.\ A companion track, Gencode Introns, shows experimental gene structure \ validations for these annotations.
\\ The gene annotations are colored based on the Havana annotation type.\ Known and validated transcripts \ are colored dark green,\ putative and unconfirmed are light green,\ pseudogenes are blue,\ and artifacts are grey. \ The transcript types are defined in more detail in the accompanying table.\
\ The Gencode project recommends that the annotations\ with known and validated transcripts; i.e., the types Known, \ Novel_CDS, Novel_transcript_gencode_conf, and \ Putative_gencode_conf (which are colored dark green in the track display) \ be used as the reference annotation.\ \
\
Type | \Color | \Description | \|
---|---|---|---|
Known | \dark green | \Known protein coding genes (referenced in Entrez Gene, NCBI) | \|
Novel_CDS | \dark green | \Novel protein coding genes annotated by Havana (not referenced in Entrez Gene, NCBI) | \|
Novel_transcript_gencode_conf | \dark green | \Novel transcripts annotated by Havana (no ORF assigned) with at least\ one junction validated by RT-PCR | \|
Putative_gencode_conf | \dark green | \Putative transcripts (similar to "novel transcripts", EST supported,\ short, no viable ORF) with at least one junction validated by RT-PCR | \|
Novel_transcript | \light green | \Novel transcripts annotated by Havana (no ORF assigned) not validated \ by RT-PCR | \|
Putative | \light green | \Putative transcripts (similar to "novel transcripts", EST supported,\ short, no viable ORF) not validated by RT-PCR | \|
TEC | \light green | \Single exon objects (supported by multiple ESTs with polyA \ sites and signals) undergoing experimental validation/extension. | \ |
Processed_pseudogene | \blue | \Pseudogenes arising via retrotransposition (exon structure of parent gene lost) | \|
Unprocessed_pseudogene | \blue | \Pseudogenes arising via gene duplication (exon structure of parent gene retained) | \|
Artifact | \grey | \Transcript evidence and/or its translation equivocal | \
\ The Human and Vertebrate Analysis and Annotation manual curation process \ (HAVANA) was\ used to produce these annotations.\
\ Finished genomic sequence was analyzed on a clone-by-clone basis using a\ combination of similarity searches against DNA and protein databases, as\ well as a series of ab initio gene predictions. Nucleotide sequence \ databases were searched with WUBLASTN and significant hits were realigned\ to the unmasked genomic sequence by EST2GENOME. WUBLASTX was used to search \ the Uniprot protein database, and the accession numbers of significant hits \ were retrieved from the Pfam database. Hidden Markov models for Pfam protein \ domains were aligned against the genomic sequence using Genewise to provide\ annotation of protein domains. \
\ A number of ab initio\ prediction algorithms were also run: Genscan and Fgenesh for genes, tRNAscan \ to find tRNA genes, and Eponine TSS for transcription start site predictions.\
\ The annotators used the (AceDB-based) Otterlace interface to create and\ edit gene objects, which were then stored in a local database named \ Otter. In cases where predicted transcript structures from Ensembl \ are available, these can be viewed from within the Otterlace interface and \ may be used as starting templates for gene curation. Annotation in the Otter \ database is submitted to the EMBL/Genbank/DDBJ nucleotide database.
\ \\ The gene objects selected for verification came from various\ computational prediction methods and HAVANA annotations. \
RT-PCR and RACE experiments were performed on them, using a variety of human\ tissues, to confirm their structure. Human cDNAs from 24 different\ tissues (brain, heart, kidney, spleen, liver, colon, small intestine,\ muscle, lung, stomach, testis, placenta, skin, peripheral blood\ leucocytes, bone marrow, fetal brain, fetal liver, fetal kidney, fetal\ heart, fetal lung, thymus, pancreas, mammary gland, prostate) were\ synthesized using 12 poly(A)+ RNAs from Origene, eight from Clemente\ Associates/Quantum Magnetics and four from BD Biosciences as described in\ [Reymond et al., 2002a,b]. The relative amount of each cDNA was \ normalized by quantitative PCR using SyberGreen as intercalator and an \ ABI Prism 7700 Sequence Detection System.
\\ Predictions of human genes junctions were assayed experimentally by\ RT-PCR as previously described and modified [Reymond, 2002b;\ Mouse Genome Sequencing Consortium, 2002; Guigo, 2003]. \
\ Similar amounts of Homo\ sapiens cDNAs were mixed with JumpStart REDTaq ReadyMix (Sigma) and four\ ng/ul primers (Sigma-Genosys) with a BioMek 2000 robot (Beckman). The\ ten first cycles of PCR amplification were performed with a touchdown\ annealing temperatures decreasing from 60 to 50°C; annealing\ temperature of the next 30 cycles was carried out at 50°C. Amplimers\ were separated on "Ready to Run" precast gels (Pharmacia) and\ sequenced. RACE experiments were performed with the BD SMART RACE cDNA\ Amplification Kit following the manufacturer instructions (BD\ Biosciences).
\ \\ Click here for a complete list of people who participated in the \ GENCODE project.
\ \\ Ashurst, J.L. et al. \ The Vertebrate Genome Annotation (Vega) database. \ Nucleic Acids Res 33 (Database Issue), D459-65 \ (2005).
\\ Guigo, R. et al. \ Comparison of mouse and human genomes followed by experimental \ verification yields an estimated 1,019 additional genes. \ Proc Natl Acad Sci U S A 100(3), 1140-5 (2003).
\\ Mouse Genome Sequencing Consortium. \ Initial sequencing and comparative analysis of the mouse \ genome. Nature 420(6915), 520-62 (2002).
\\ Reymond, A. et al. \ Human chromosome 21 gene expression atlas in the mouse. \ Nature 420(6915), 582-6 (2002).
\\ Reymond, A. et al. \ Nineteen additional unpredicted transcripts from human \ chromosome 21. Genomics 79(6), 824-32 (2002).
\ encodeGenes 1 baseColorDefault genomicCodons\ baseColorUseCds given\ cdsDrawDefault genomic codons\ chromosomes chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX\ compositeTrack on\ dataVersion ENCODE Oct 2005 Freeze\ gClass_Artifact 163,168,163\ gClass_Known 33,91,51\ gClass_Novel_CDS 33,91,51\ gClass_Novel_transcript 84,188,0\ gClass_Novel_transcript_gencode_conf 33,91,51\ gClass_Processed_pseudogene 0,91,191\ gClass_Putative 84,188,0\ gClass_Putative_gencode_conf 33,91,51\ gClass_TEC 84,188,0\ gClass_Unprocessed_pseudogene 0,91,191\ geneClasses Artifact Known Novel_CDS Novel_transcript Novel_transcript_gencode_conf Putative Putative_gencode_conf TEC Processed_pseudogene Unprocessed_pseudogene\ group encodeGenes\ itemClassTbl encodeGencodeGeneClassOct05\ longLabel Gencode Gene Annotations (October 2005)\ origAssembly hg17\ priority 10.6\ shortLabel Gencode Genes Oct05\ superTrack encodeGencodeSuper\ track encodeGencodeGeneOct05\ type genePred\ visibility hide\ encodeGencodeIntronOct05 Gencode Introns Oct05 bed 6 + Gencode Intron Validation (October 2005) 0 10.7 0 0 0 127 127 127 0 0 20 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr8,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr9,chrX,\ The Gencode Intron Validation track shows\ gene structure validations generated by the\ GENCODE project.\ This track serves as a companion to the Gencode Genes track.
\\ The items in this track are colored based on the validation status determined \ via RT-PCR of exons flanking the intron:
\\
Status | \Color | \Validation Result | \
---|---|---|
RT_positive | \green | \Intron validated (RT-PCR product corresponds to expected junction) | \
RACE_validated | \green | \Intron validated (RACE product corresponds to expected junction) | \
RT_negative\ | red | \Intron not validated (no RT-PCR product was obtained) | \
RT_wrong_junction | \gold | \Intron not validated, but another junction exists between the two\ (RT-PCR product does not correspond to the expected junction) | \
\ Selected gene models from the Genecode Genes track were picked for RT-PCR \ and RACE verification experiments.\ RT-PCR and RACE experiments were performed on the objects, using a variety of \ human tissues, to confirm their structure. Human cDNAs from 24 different\ tissues (brain, heart, kidney, spleen, liver, colon, small intestine,\ muscle, lung, stomach, testis, placenta, skin, peripheral blood\ leucocytes, bone marrow, fetal brain, fetal liver, fetal kidney, fetal\ heart, fetal lung, thymus, pancreas, mammary gland, prostate) were\ synthesized using twelve poly(A)+ RNAs from Origene, eight from Clemente\ Associates/Quantum Magnetics and four from BD Biosciences as described in\ [Reymond et al., 2002a,b]. The relative amount of each cDNA was \ normalized with glyceraldehyde-3-phosphate dehydrogenase (GAPDH) by quantitative\ PCR using SyberGreen as intercalator and \ an ABI Prism 7700 Sequence Detection System.
\\ Predictions of human genes junctions were assayed experimentally by\ RT-PCR as previously described and modified [Reymond, 2002b;\ Mouse Genome Sequencing Consortium, 2002; Guigo, 2003]. \
\ Similar amounts of Homo\ sapiens cDNAs were mixed with JumpStart REDTaq ReadyMix (Sigma) and 4\ ng/ul primers (Sigma-Genosys) with a BioMek 2000 robot (Beckman). The\ ten first cycles of PCR amplification were performed with a touchdown\ annealing temperatures decreasing from 60 to 50°C; annealing\ temperature of the next 30 cycles was carried out at 50°C. Amplimers\ were separated on "Ready to Run" precast gels (Pharmacia) and\ sequenced. RACE experiments were performed with the BD SMART RACE cDNA\ Amplification Kit following the manufacturer instructions (BD\ Biosciences).
\ \\ Click here for a complete list of people who participated in the \ GENCODE project.
\ \\ Ashurst, J.L. et al. \ The Vertebrate Genome Annotation (Vega) database. \ Nucleic Acids Res 33 (Database Issue), D459-65 \ (2005).
\\ Guigo, R. et al. \ Comparison of mouse and human genomes followed by experimental \ verification yields an estimated 1,019 additional genes. \ Proc Natl Acad Sci U S A 100(3), 1140-5 (2003).
\\ Mouse Genome Sequencing Consortium.\ Initial sequencing and comparative analysis of the mouse \ genome. Nature 420(6915), 520-62 (2002).
\\ Reymond, A. et al. \ Human chromosome 21 gene expression atlas in the mouse. \ Nature 420(6915), 582-6 (2002).
\\ Reymond, A. et al. \ Nineteen additional unpredicted transcripts from human \ chromosome 21. Genomics 79(6), 824-32 (2002).
\ encodeGenes 1 chromosomes chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr8,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr9,chrX\ dataVersion ENCODE Oct 2005 Freeze\ group encodeGenes\ longLabel Gencode Intron Validation (October 2005)\ origAssembly hg17\ priority 10.7\ shortLabel Gencode Introns Oct05\ superTrack encodeGencodeSuper\ track encodeGencodeIntronOct05\ type bed 6 +\ visibility hide\ encodeAffyChIpHl60PvalCebpeHr02 Affy CEBPe RA 2h wig 0.0 534.54 Affymetrix ChIP/Chip (CEBPe retinoic acid-treated HL-60, 2hrs) P-Value 0 11 200 25 0 227 140 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 0 color 200,25,0\ longLabel Affymetrix ChIP/Chip (CEBPe retinoic acid-treated HL-60, 2hrs) P-Value\ parent encodeAffyChIpHl60Pval\ priority 11\ shortLabel Affy CEBPe RA 2h\ subGroups factor=CEBPe time=2h\ track encodeAffyChIpHl60PvalCebpeHr02\ encodeAffyChIpHl60SignalStrictPol2Hr08 Affy Pol2 8h wig -2.78 3.97 Affymetrix ChIP-chip (Pol2, retinoic acid-treated HL-60, 8hrs) Strict Signal 0 11 50 175 0 152 215 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 0 color 50,175,0\ longLabel Affymetrix ChIP-chip (Pol2, retinoic acid-treated HL-60, 8hrs) Strict Signal\ parent encodeAffyChIpHl60SignalStrict\ priority 11\ shortLabel Affy Pol2 8h\ subGroups factor=Pol2 time=8h\ track encodeAffyChIpHl60SignalStrictPol2Hr08\ encodeAffyChIpHl60SitesStrictRnapHr08 Affy Pol2 8h bed 3 . Affymetrix ChIP-chip (Pol2, retinoic acid-treated HL-60, 8hrs) Strict Sites 0 11 50 175 0 152 215 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 1 color 50,175,0\ longLabel Affymetrix ChIP-chip (Pol2, retinoic acid-treated HL-60, 8hrs) Strict Sites\ parent encodeAffyChIpHl60SitesStrict\ priority 11\ shortLabel Affy Pol2 8h\ subGroups factor=Pol2 time=8h\ track encodeAffyChIpHl60SitesStrictRnapHr08\ encodeAffyChIpHl60PvalStrictPol2Hr08 Affy Pol2 8h wig 0 696.62 Affymetrix ChIP-chip (Pol2, retinoic acid-treated HL-60, 8hrs) Strict P-Value 0 11 50 175 0 152 215 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 0 color 50,175,0\ longLabel Affymetrix ChIP-chip (Pol2, retinoic acid-treated HL-60, 8hrs) Strict P-Value\ parent encodeAffyChIpHl60PvalStrict\ priority 11\ shortLabel Affy Pol2 8h\ subGroups factor=Pol2 time=8h\ track encodeAffyChIpHl60PvalStrictPol2Hr08\ encodeAffyEc1FetalSpleenSignal EC1 Sgnl Spleen wig 0 62385 Affy Ext Trans Signal (1-base window) (Fetal Spleen) 0 11 176 0 80 215 127 167 0 0 2 chr21,chr22, encodeTxLevels 0 color 176,0,80\ longLabel Affy Ext Trans Signal (1-base window) (Fetal Spleen)\ parent encodeAffyEcSignal\ priority 11\ shortLabel EC1 Sgnl Spleen\ track encodeAffyEc1FetalSpleenSignal\ encodeAffyEc1FetalSpleenSites EC1 Sites Spleen bed 3 . Affy Ext Trans Sites (1-base window) (Fetal Spleen) 0 11 176 0 80 215 127 167 0 0 2 chr21,chr22, encodeTxLevels 1 color 176,0,80\ longLabel Affy Ext Trans Sites (1-base window) (Fetal Spleen)\ parent encodeAffyEcSites\ priority 11\ shortLabel EC1 Sites Spleen\ track encodeAffyEc1FetalSpleenSites\ gap Gap bed 3 + Gap Locations 1 11 0 0 0 127 127 127 0 0 0\ This track depicts gaps in the assembly. Most of these gaps - with the\ exception of intractable heterochromatic, centromeric, telomeric, and short-arm \ gaps - have been closed during the finishing process, although a small number \ still remain. \
\ Gaps are represented as black boxes in this track.\ If the relative order and orientation of the contigs on either side\ of the gap is known, it is a bridged gap. In this case, a white line is \ drawn through the black box representing the gap and the gap is labeled \ "yes". \
\This assembly contains the following types of gaps:\
\ This super-track combines related tracks from the\ ENCODE Gene Annotation Assessment Project (EGASP)\ \ 2005 Gene Prediction Workshop. \ The goal of the workshop was to evaluate automatic methods for\ gene annotation of the human genome, with a focus on protein-coding genes.\ Predictions were evaluated in terms of their ability to\ reproduce the high-quality manually assisted GENCODE gene annotations \ and to predict novel transcripts.\
\ The EGASP Full track shows gene predictions covering all\ 44 ENCODE regions submitted before the GENCODE annotations were released.\ The EGASP Partial track shows gene predictions that cover\ some of the ENCODE regions, submitted before the GENCODE release.\ The EGASP Update track shows gene predictions that cover\ all ENCODE regions, submitted after the GENCODE release.\
\ These annotations were originally produced using the hg17 assembly.
\\ The following gene predictions are included:\
\ Click here for a complete list of people who participated in the\ GENCODE project.
\\ The following individuals and institutions provided the data for the subtracks\ in this annotation:\
\ Ashurst JL, Chen CK, Gilbert JG, Jekosch K, Keenan S, Meidl P, Searle SM,\ Stalker J, Storey R, Trevanion S et al.\ The Vertebrate Genome Annotation (Vega) database.\ Nucleic Acids Res. 2005 Jan 1;33(Database issue):D459-65.
\\ Guigo R, Dermitzakis ET, Agarwal P, Ponting CP, Parra G, Reymond A, Abril JF,\ Keibler E, Lyle R, Ucla C et al.\ Comparison of mouse and human genomes followed by experimental\ verification yields an estimated 1,019 additional genes.\ Proc Natl Acad Sci U S A. 2003 Feb 4;100(3):1140-5.\
\\ Mouse Genome Sequencing Consortium.\ Initial sequencing and comparative analysis of the mouse\ genome. Nature. 2002 Dec 5;420(6915):520-62.
\\ Reymond A, Marigo V, Yaylaoglu MB, Leoni A, Ucla C, Scamuffa N, Caccioppoli C, Dermitzakis ET, Lyle R, Banfi S et al.\ Human chromosome 21 gene expression atlas in the mouse.\ Nature. 2002 Dec 5;420(6915):582-6.
\\ Reymond A, Camargo AA, Deutsch S, Stevenson BJ, Parmigiani RB, Ucla C, Bettoni F, Rossier C, Lyle R, Guipponi M et al.\ Nineteen additional unpredicted transcripts from human\ chromosome 21. Genomics. 2002 Jun;79(6):824-32.
\\ Chatterji S, Pachter L.\ Multiple organism gene finding by collapsed Gibbs sampling.\ J Comput Biol. 2005 Jul-Aug;12(6):599-608.
\\ Siepel A, Haussler D.\ Computational identification of evolutionarily conserved\ exons.\ Proc. 8th Int'l Conf. on Research in Computational Molecular Biology.\ 2004;177-186.
\ \\ Stanke M, Waack S.\ Gene prediction with a hidden Markov model and a new intron\ submodel.\ Bioinformatics. 2003;19(Suppl. 2):ii215-ii225.
\\ Stanke M, Steinkamp R, Waack S, Morgenstern B.\ AUGUSTUS: a web server for gene finding in eukaryotes.\ Nucleic Acids Res. 2004 Jul 1;32(Web Server issue):W309-12.
\ \\ Solovyev VV.\ "Statistical approaches in Eukaryotic gene prediction".\ In Handbook of Statistical Genetics (eds. Balding D et al.)\ (John Wiley & Sons, Inc., 2001). p. 83-127.
\ \\ Blanco E, Parra G, Guigó R.\ "Using geneid to identify genes".\ In Current Protocols in Bioinformatics, Unit 4.3. (eds. Baxevanis AD.)\ (John Wiley & Sons, Inc., 2002).
\\ Guigó R.\ Assembling genes from predicted exons in linear time with\ dynamic programming.\ J Comput Biol. 1998 Winter;5(4):681-702.
\\ Guigó R, Knudsen S, Drake N, Smith T.\ Prediction of gene structure.\ J Mol Biol. 1992 Jul 5;226(1):141-57.
\\ Parra G, Blanco E, Guigó R.\ GeneID in Drosophila.\ Genome Res. 2000 Apr;10(4):511-5.
\ \\ Allen JE, Pertea M, Salzberg SL.\ Computational gene prediction using multiple sources of\ evidence.\ Genome Res. 2004 Jan;14(1):142-8.
\\ Allen JE, Salzberg SL.\ JIGSAW: integration of multiple sources of evidence for gene\ prediction.\ Bioinformatics. 2005 Sep 15;21(18):3596-603.
\ \\ Guigó R, Dermitzakis ET, Agarwal P, Ponting CP, Parra G,\ Reymond A, Abril JF, Keibler E, Lyle R, Ucla C et al.\ Comparison of mouse and human genomes followed by experimental\ verification yields an estimated 1,019 additional genes.\ Proc Natl Acad Sci U S A. 2003 Feb 4;100(3):1140-5.
\\ Parra G, Agarwal P, Abril JF, Wiehe T, Fickett JW, Guigó R.\ Comparative gene prediction in human and mouse.\ Genome Res. 2003 Jan;13(1):108-17.
\ \ encodeGenes 0 chromosomes chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX\ group encodeGenes\ longLabel ENCODE Gene Prediction Workshop (EGASP)\ priority 12.0\ shortLabel EGASP\ superTrack on\ track encodeEgaspSuper\ encodeEgaspFull EGASP Full genePred ENCODE Gene Prediction Workshop (EGASP) All ENCODE Regions 0 12 0 0 0 127 127 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX,\ This track shows full sets of gene predictions covering all 44 ENCODE regions \ originally submitted for the ENCODE Gene Annotation Assessment Project \ (EGASP) Gene Prediction Workshop 2005. \ The following gene predictions are included:\
\ Data for each gene prediction method within this composite annotation track \ are displayed in a separate subtrack. See the top of the track description \ page for configuration options allowing display of selected subsets of gene\ predictions. To remove a subtrack from the display,\ uncheck the appropriate box.\
\ The individual subtracks within this annotation follow the display conventions \ for gene prediction\ tracks. Display characteristics specific to individual subtracks are \ described in the Methods section. The track description page offers the option \ to color and label codons in a zoomed-in display of the subtracks to facilitate \ validation and comparison of gene predictions. To enable this feature, select \ the genomic codons option from the "Color track by codons"\ menu. Click the\ Help on codon coloring\ link for more information about this feature.
\\ Color differences among the subtracks are arbitrary. They provide a\ visual cue for distinguishing the different gene prediction methods.
\ \\ These annotations were generated using AceView. All mRNAs\ and cDNAs available in GenBank, excluding NMs, were co-aligned on the Gencode\ sections. The results were then examined and filtered to resemble Havana. \ The very restrictive view of Havana on CDS was not reproduced, due to a lack of\ experimental data.
\ \\ Candidate splice sites and coding starts/stops were evaluated using DNA\ alignments between the human assembly and seven other vertebrate species \ (UCSC multiz alignments, adding the frog and removing the chimp). Genes\ (single transcripts only) were then predicted using dynamic programming.
\ \\ The Ensembl annotation includes two types of predictions: protein-coding \ genes (the Ensembl Gene Predictions subtrack)\ and pseudogenes of protein-coding genes \ (the Ensembl Pseudogene Predictions subtrack). \ The Ensembl Pseudo track is not intended as a comprehensive annotation of \ pseudogenes, but rather\ an attempt to identify and label those gene predictions made by the Ensembl \ pipeline that have pseudogene characteristics. Exons that lie partially outside \ the ENCODE region are not included in the data set. The "Alternate \ Name" field on the subtrack details page shows the Ensembl ID for the \ selected gene or transcript.
\ \\ ExonHunter is a comprehensive gene-finder based on hidden Markov models (HMMs)\ allowing the use of a variety of additional sources of information (ESTs, \ proteins, genome-genome comparisons).
\ \\ Exogean annotates protein coding genes by combining mRNA and cross-species\ protein alignments in directed acyclic colored multigraphs where nodes and\ edges respectively represent biological objects and human expertise.\ Additional predictions and methods for this subtrack are available in the\ EGASP Updates track.
\ \\ Fgenesh is an HMM gene structure prediction program.\ This data set shows predictions of potential pseudogenes.
\ \\ These gene predictions were generated by Fgenesh++, a gene-finding program that\ uses both HMMs and protein similarity to find \ genes in a completely automated manner.
\ \\ The GeneID-U12 gene prediction set, generated using a version of GeneID modified\ to detect U12-dependent introns (both GT-AG and AT-AC subtypes) when present,\ employs a single-genome ab initio method.\ This modified version of GeneID uses matrices for U12 donor,\ acceptor and branch sites constructed from examples of published U12 \ intron splice junctions \ (both experimentally confirmed and expressed-sequence-validated predictions). \ Two GeneID-U12 subtracks are \ included: GeneID Gene Predictions and GeneID U12 Intron Predictions. The U12\ splice sites for features in the U12 Intron Predictions track are displayed\ on the track details pages. \ Additional predictions and methods for this subtrack are available in the\ EGASP Updates track.
\ \\ The eukaryotic version of the GeneMark.hmm (release 2.2) gene prediction\ program utilizes the HMM statistical model with duration or hidden\ semi-Markov model (HSMM). The HMM includes hidden states for initial, \ internal and terminal exons, introns, intergenic regions and single exon genes. \ It also includes the "border" states, such as start site (initiation \ codon), stop site (termination codons), and donor and acceptor splice sites. \ Sequences of all protein-coding regions were modeled by three periodic \ inhomogeneous Markov chains; sequences of non-coding regions were modeled by \ homogeneous Markov chains. Nucleotide sequences corresponding to the site \ states were modeled by position-specific inhomogeneous Markov chains. \ Parameters of the gene models were derived from the set of genes obtained by \ cDNA mapping to genomic DNA. To reflect variations in G+C composition of the\ genome, the gene model parameters were estimated separately for the three G+C \ regions.
\ \\ Jigsaw uses the output from gene-finders, splice-site prediction programs and \ sequence alignments to predict gene models. Annotation data downloaded from \ the UCSC Genome Browser and TIGR gene-finder output was used as input for these\ predictions. Jigsaw predicts both partial and complete genes. \ Additional predictions and methods for this subtrack are available in the\ EGASP Updates track.
\ \\ The pairHMM-based alignment program, Pairagon, was used to align\ high-quality mRNA sequences to the ENCODE regions. These were\ supplemented with N-SCAN EST predictions which are displayed in the\ Pairgn/NSCAN-E subtrack, and extended further with additional\ transcripts from the Brent Lab to produce the predictions\ displayed as the Pairgn/NSCAN-E/+ subtrack. The NSCAN subtrack \ contains only predictions from the N-SCAN program. \
\ \\ The SGP2-U12 gene prediction set, generated using a version of GeneID modified \ to detect U12-dependent introns (both AT-AC and GT-AG subtypes) when present,\ employs a dual-genome method (SGP2) that utilizes similarity (tblastx) to \ mouse genomic sequence syntenic to the ENCODE regions (Oct. 2004 MSA freeze). \ This modified version of GeneID uses matrices for U12 \ donor, acceptor and branch sites constructed from examples of published U12 \ intron splice junctions (both experimentally confirmed and \ expressed-sequence-validated predictions). Two SGP2-U12 subtracks are \ included: SGP2 Gene Predictions and SGP2 U12 Intron Predictions.\ The U12 splice sites for features in the U12 Intron Predictions track are \ displayed on the track details pages. \ Additional predictions and methods for this subtrack are available in the\ EGASP Updates track.
\ \\ This exon-only prediction set was produced using SPIDA (Substitution Periodicity\ Index and Domain Analysis). Exons derived by mapping ESTs to the genome were\ validated by seeking periodic substitution patterns in the aligned informant \ DNA sequences. First, all\ available ESTs were mapped to the genome using Exonerate. The resulting\ transcript structures were "flattened" to remove redundancy. Each \ exon of the flattened transcripts was subjected to SPI analysis, which involves\ identifying periodicity in the pattern of mutations occurring between the human\ and an informant species DNA sequence (the informant sequences and their TBA\ alignments were provided by Elliott Margulies). SPI was calculated for all \ available human-informant pairs for whole exons and in a sliding 48 bp window. \ SPI analysis requires that a threshold level of periodicity be identified in at\ least two of the informant species if the exon is to be accepted. If accepted,\ SPI provides the correct frame for translation of the exon. This exon was used \ as a starting point for extending the ORF coding region of the flattened\ transcript from which it came. This gave a full or partial CDS; different exons\ may give different CDSs. The CDSs were translated and searched for domains using\ hmmpfam and Pfam_fs. Only transcripts with a domain hit with e > 1.0 were\ retained. Heuristics were applied to the retained CDSs to identify problems with\ the transcript structure, particularly frame-shifts. Many transcripts may\ identify the same exon, but only a single instance of each exon has been \ retained.
\ \\ This gene prediction set was produced by a version of Twinscan that employs \ multiple pairwise genome comparisons to identify protein-coding genes (including\ alternative splices) using nucleotide homology information. No expression or \ protein data were used.
\ \\ The following individuals and institutions provided the data for the subtracks \ in this annotation:\
\ The pseudoautosomal regions (PARs) are sections of the X and Y chromosomes\ that undergo homologous recombination. Genes in these regions are inherited\ in the same manner as autosomal genes. Two PAR regions, PAR1 and PAR2,\ have been identified in human.\ The pseudoautosomal regions allow the X and Y chromosomes to pair and segregate during\ meiosis in males.\
\\ This track shows gene predictions submitted for the ENCODE Gene Annotation \ Assessment Project \ (EGASP) Gene Prediction Workshop 2005 that cover only\ a partial set of the 44 ENCODE regions. The partial set excludes\ the 13 ENCODE regions for which high-quality annotations were released in late\ 2004.\ The following gene predictions are included:\
\ The EGASP Full companion track shows original gene prediction submissions for \ the full set of 44 ENCODE regions using Gene Prediction algorithms other than \ those used here; the EGASP Update track shows updated versions\ of some of the submitted predictions.\ \\ Data for each gene prediction method within this composite annotation track \ is displayed in a separate subtrack. See the top of the track description page \ for a complete list of the subtracks available for this annotation. To display\ only selected subtracks, uncheck the boxes next to the tracks you wish to\ hide. \
\ The individual subtracks within this annotation follow the display conventions \ for gene prediction\ tracks. The track description page offers the option \ to color and label codons in a zoomed-in display of the subtracks to facilitate \ validation and comparison of gene predictions. To enable this feature, select \ the genomic codons option from the "Color track by codons"\ menu. Click the\ Help on codon coloring\ link for more information about this feature.
\\ Color differences among the subtracks are arbitrary. They provide a\ visual cue for distinguishing the different gene prediction methods.
\ \\ ACEScan (Alternative Conserved Exons Scan) \ indicates alternative splicing that is evolutionarily conserved in human and \ mouse/rat. The Conserved Alternative Exon Predictions subtrack shows\ predicted alternative conserved exons. The Unconserved Alternative and \ Constitutive Exon Predictions subtrack shows exons that \ are predicted to be constitutive or may have species-specific alternative \ splicing.
\ \
\ Augustus uses a generalized hidden Markov model (GHMM) that\ models coding and non-coding sequence, splice sites, the branch point region, \ translation start and end, and lengths of exons and introns. The track \ contains four different sets of predictions. Ab initio\ single genome predictions are based solely on the input sequence. EST and\ protein evidence predictions were generated using AGRIPPA hints based on \ alignments of human sequence from the dbEST and nr databases. Mouse homology \ gene predictions were produced using mouse genomic sequence only; BLAST, CHAOS, \ DIALIGN were used to generate the hints for Augustus. The combined \ EST/protein evidence and mouse homology gene predictions were created using \ human sequence from the dbEST and nr databases and mouse genomic sequence to \ generate hints for Augustus.\ Additional predictions and methods for this subtrack are available in the\ EGASP Updates track.
\ \\ GeneZilla is a program for the computational prediction of protein-coding genes \ in eukaryotic DNA, based on the generalized hidden Markov model (GHMM) \ framework. These predictions were generated using GeneZilla and \ IsoScan, which uses a four-state hidden Markov model to \ predict isochores (regions of homogeneous G+C content) in genomic DNA.
\ \\ SAGA is an ab initio multiple-species gene-finding program based on the\ Gibbs sampling-based method described in Chatterji et al. (2004). In \ addition to sampling parameters, SAGA also uses a phyloHMM based model to \ boost the scores, similar to the method described in Siepel et al.\ (2004).
\ \\ The gene prediction data sets were submitted by the following individuals and \ institutions:\
\ Chatterji, S. and Pachter, L. \ Multiple organism gene finding by collapsed Gibbs sampling. \ Proc. 8th Int'l Conf. on Research in Computational Molecular Biology, \ 187-193 (2004).
\\ Siepel, A. and Haussler, D. \ Computational identification of evolutionarily conserved \ exons. \ Proc. 8th Int'l Conf. on Research in Computational Molecular Biology, \ 177-186 (2004).
\ encodeGenes 1 baseColorDefault genomicCodons\ baseColorUseCds given\ cdsDrawDefault genomic codons\ chromosomes chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX\ compositeTrack on\ dataVersion ENCODE June 2005 Freeze\ group encodeGenes\ longLabel ENCODE Gene Prediction Workshop (EGASP) for Partial ENCODE Regions\ origAssembly hg17\ priority 12.5\ shortLabel EGASP Partial\ superTrack encodeEgaspSuper\ track encodeEgaspPartial\ type genePred\ visibility hide\ encodeEgaspUpdate EGASP Update genePred ENCODE Gene Prediction Workshop (EGASP) Updates 0 12.7 0 0 0 127 127 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX,\ This track shows updated versions of gene predictions submitted for the \ ENCODE Gene Annotation Assessment Project \ (EGASP) Gene Prediction Workshop 2005.\ The following gene predictions are included:\
\ The original EGASP submissions are displayed in the companion tracks, \ EGASP Full and EGASP Partial.\ \\ Data for each gene prediction method within this composite annotation track \ are displayed in separate subtracks. See the top of the track description page \ for a complete list of the subtracks available for this annotation. To display\ only selected subtracks, uncheck the boxes next to the tracks you wish to\ hide. \
\ The individual subtracks within this annotation follow the display conventions \ for gene prediction\ tracks. Display characteristics specific to individual subtracks are \ described in the Methods section. The track description page offers the option \ to color and label codons in a zoomed-in display of the subtracks to facilitate \ validation and comparison of gene predictions. To enable this feature, select \ the genomic codons option from the "Color track by codons"\ menu. Click the\ Help on codon coloring\ link for more information about this feature.
\\ Color differences among the subtracks are arbitrary. They provide a\ visual cue for distinguishing the different gene prediction methods.
\ \\ Augustus uses a generalized hidden Markov model (GHMM) that models \ coding and non-coding sequence, splice sites, the branch point region, \ the translation start and end, and the lengths of exons and introns. \ This version has been trained on a set of 1284 human genes.\ The track contains four sets of predictions: ab initio,\ EST and protein-based, mouse homology-based, and those using\ EST/protein and mouse homology evidence as additional input to Augustus\ for the predictions.
\\ The EST and protein evidence was generated by aligning sequences from the dbEST \ and nr databases to the ENCODE region using wublastn and wublastx.\ The resulting alignments were used to generate hints about putative splice \ sites, exons, coding regions, introns, translation start and \ translation stop.
\\ The mouse homology evidence was generated by aligning pairs of human and\ mouse genomic sequences using the program \ DIALIGN. Regions conserved at the peptide level were used to \ generate hints about coding regions.
\ \\ Exogean produces alternative transcripts by combining mRNA and cross-species \ sequence alignments using heuristic rules. The program implements a generic \ framework based on directed acyclic colored multigraphs (DACMs). In Exogean, \ DACM nodes represent biological objects (mRNA or protein HSPs/transcripts) and\ multiple edges between nodes represent known relationships between these \ objects derived from human expertise. Exogean DACMs are succesively built and \ reduced, leading to increasingly complex objects. This process\ enables the production of alternative transcripts from initial HSPs.
\ \\ FGenesh++ predictions are based on hidden Markov models and protein similarity to\ the NR database. For more information, see the reference below.\ \
\ The GeneID program predicts genes in anonymous genomic sequences \ designed with a hierarchical structure.\ In the first step, splice sites, start and stop codons are predicted and scored \ along the sequence using position weight arrays (PWAs).\ Next, exons are built from the sites. Exons are scored as the sum of the scores \ of the defining sites plus the the log-likelihood ratio of a Markov model for \ coding DNA.\ Finally, the gene structure is assembled from the set of predicted exons, \ maximizing the sum of the scores of the assembled exons.\ The modified version of GeneID used to generate the predictions in this track \ incorporates models for U12-dependent splice signals in addition to U2 splice \ signals.
\\ The GeneID subtrack shows all GeneID genes. Only U12 introns\ and their flanking exons are displayed in the GeneID U12 subtrack.\ Exons flanking predicted U12-dependent introns are assigned a type\ attribute reflecting their splice sites, displayed on\ the details page of the GeneID U12 subtrack as the "Alternate Name" \ of the item composed of the intron plus flanking exons.
\ \\ Jigsaw is a gene prediction program that determines genes based on \ target genomic sequence and output from a gene structure annotation database.\ Data downloaded from UCSC's annotation database is \ used as input and includes the following tracks of evidence:\ Known Genes, Ensembl, RefSeq, GeneID, Genscan, SGP, Twinscan, Human mRNAs,\ TIGR Gene Index, UniGene, Most Conserved Elements and Non-human RefSeq Genes.\ GlimmerHMM and GeneZilla, two open source ab initio gene-finding \ programs based on GHMMs, are also used.
\ \\ To predict genes in a genomic query, SGP2 combines GeneID predictions with \ tblastx comparisons of the genomic query against other genomic sequences.\ This modified version of SGP2 uses models for U12-dependent splice signals \ in addition to U2 splice signals. The reference genomic sequence for this data \ set is the Oct. 2004 release of mouse sequence syntenic to ENCODE regions.
\\ The SGP2 and SGP2 U12 tracks follow the same display conventions as the \ GeneID and GeneID U12 subtracks described above.
\ \\ For this analysis, pseudogenes were defined as genomic sequences similar \ to known human genes and with various disablements (premature stop codons or\ frameshifts) in their "putative" protein-coding regions.
\\ The protein sequences of known human genes (as annotated by ENSEMBL) were used\ to search for similar nongenic sequences in ENCODE regions. The matching\ sequences were assessed as disabled copies of genes based on the occurrences of\ premature stop codons or frameshifts. The intron-exon structure of the\ functional gene was further used to infer whether a pseudogene was duplicated\ or processed (a duplicated pseudogene keeps the intron-exon structure of its\ parent functional gene). Small pseudogene sequences were labeled as fragments or\ other types.
\\ All pseudogenes in this track were manually curated.\ In the browser, the track details page shows the pseudogene type.
\ \\ Augustus was written by Mario Stanke at the\ Department of \ Bioinformatics of the University of Göttingen in Germany.
\\ Exogean was developed by Sarah Djebali and Hugues Roest Crollius from the\ Dyogen Lab, Ecole \ Normale Supérieure (Paris, France) and Franck Delaplace\ from the Laboratoire de Méthodes Informatiques \ (LaMI), (Evry, \ France).
\\ The FGenesh++ gene predictions were provided by Victor Solovyev of\ Softberry Inc.\
\ The GeneID-U12 and SGP2-U12 programs were developed by the\ Grup de Recerca en Informàtica Biomèdica \ (GRIB) at \ the Institut Municipal d'Investigació Mèdica (IMIM) in Barcelona.\ The version of GeneID on which GeneID-U12 is based (geneid_v1.2) was written by \ Enrique Blanco and Roderic Guigó.\ The parameter files were constructed by Genis Parra and Francisco Camara.\ Additional contributions were made by Josep F. Abril, Moises Burset and Xavier \ Messeguer. Modifications to GeneID that allow for the prediction of \ U12-dependent splice sites and incorporation of U12 introns into gene models \ were made by Tyler Alioto.
\\ Jigsaw was developed at The Institute for Genomic Research \ (TIGR)\ by Jonathan Allen and Steven Salzberg,\ with computational gene-finder contributions from Mihaela Pertea and William \ Majoros. Continued maintenance and development of Jigsaw will\ be provided by the Salzberg group at the Center for Bioinformatics \ and Computational Biology \ (CBCB) at the \ University of Maryland, College Park.
\\ The Yale Pseudogenes were generated by the pseudogene annotation group of \ Mark Gerstein at Yale \ University.
\ \\ Stanke, M. \ Gene prediction with a hidden Markov model.\ Ph.D. thesis, Universität Göttingen, Germany (2004).
\\ Stanke, M. and Waack, S. \ Gene prediction with a hidden Markov model and a new intron \ submodel.\ Bioinformatics, 19(Suppl. 2), ii215-ii225 (2003).
\\ Stanke, M., Steinkamp, R., Waack, S. and Morgenstern, B. \ AUGUSTUS: a web server for gene finding in eukaryotes.\ Nucl. Acids Res., 32, W309-W312 (2004).
\ \\ Solovyev V.V. \ "Statistical approaches in Eukaryotic gene prediction".\ In Handbook of Statistical Genetics (eds. Balding D. et al.)\ (John Wiley & Sons, Inc., 2001). p. 83-127.
\ \\ Blanco, E., Parra, G. and Guigó, R. \ "Using geneid to identify genes". \ In Current Protocols in Bioinformatics, Unit 4.3. (ed. Baxevanis, A.D.)\ (John Wiley & Sons, Inc., 2002).
\\ Guigó, R. \ Assembling genes from predicted exons in linear time with \ dynamic programming. \ J Comput Biol. 5(4), 681-702 (1998).
\\ Guigó, R., Knudsen, S., Drake, N. and Smith, T.\ Prediction of gene structure. \ J Mol Biol. 226(1), 141-57 (1992).
\\ Parra, G., Blanco, E. and Guigó, R. \ GeneID in Drosophila. \ Genome Research 10(4), 511-515 (2000).
\ \\ Allen, J.E., Pertea, M. and Salzberg, S.L.\ Computational gene prediction using multiple sources of \ evidence. \ Genome Res., 14(1), 142-8 (2004).
\\ Allen, J.E. and Salzberg, S.L.\ JIGSAW: integration of multiple sources of evidence for gene \ prediction.\ Bioinformatics 21(18), 3596-3603 (2005).
\ \\ Guigó, R., Dermitzakis, E.T., Agarwal, P., Ponting, C.P., Parra, G., \ Reymond, A., Abril, J.F., Keibler, E., Lyle, R., Ucla, C. et al. \ Comparison of mouse and human genomes followed by experimental \ verification yields an estimated 1,019 additional genes. \ Proc Natl Acad Sci U S A 100(3), 1140-5 (2003).
\\ Parra, G., Agarwal, P., Abril, J.F., Wiehe, T., Fickett, J.W. and Guigó, R. \ Comparative gene prediction in human and mouse. \ Genome Res. 13(1), 108-17 (2003).
\ encodeGenes 1 baseColorDefault genomicCodons\ baseColorUseCds given\ cdsDrawDefault genomic codons\ chromosomes chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX\ compositeTrack on\ dataVersion ENCODE June 2005 Freeze\ group encodeGenes\ longLabel ENCODE Gene Prediction Workshop (EGASP) Updates\ origAssembly hg17\ priority 12.7\ shortLabel EGASP Update\ superTrack encodeEgaspSuper dense\ track encodeEgaspUpdate\ type genePred\ visibility hide\ encodeAffyChIpHl60PvalCebpeHr08 Affy CEBPe RA 8h wig 0.0 534.54 Affymetrix ChIP/Chip (CEBPe retinoic acid-treated HL-60, 8hrs) P-Value 0 13 200 25 0 227 140 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 0 color 200,25,0\ longLabel Affymetrix ChIP/Chip (CEBPe retinoic acid-treated HL-60, 8hrs) P-Value\ parent encodeAffyChIpHl60Pval\ priority 13\ shortLabel Affy CEBPe RA 8h\ subGroups factor=CEBPe time=8h\ track encodeAffyChIpHl60PvalCebpeHr08\ encodeAffyRnaHeLaSitesIntronsDistal Affy In Dst HeLa bed 4 . Affy Intronic Distal HeLa Transfrags 0 13 225 30 255 240 142 255 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeAnalysis 1 color 225,30,255\ longLabel Affy Intronic Distal HeLa Transfrags\ parent encodeNoncodingTransFrags\ priority 13\ shortLabel Affy In Dst HeLa\ subGroups region=intronicDistal celltype=hela source=affy\ track encodeAffyRnaHeLaSitesIntronsDistal\ encodeAffyChIpHl60SignalStrictp63_ActD Affy p63 ME-180+ wig -2.78 3.97 Affymetrix ChIP-chip (p63, actinomycin-D treated ME-180) Strict Signal 0 13 0 0 200 127 127 227 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 0 color 0,0,200\ longLabel Affymetrix ChIP-chip (p63, actinomycin-D treated ME-180) Strict Signal\ parent encodeAffyChIpHl60SignalStrict\ priority 13\ shortLabel Affy p63 ME-180+\ subGroups factor=actd time=0h\ track encodeAffyChIpHl60SignalStrictp63_ActD\ encodeAffyChIpHl60SitesStrictP63_ActD Affy p63 ME-180+ bed 3 . Affymetrix ChIP-chip (p63, actinomycin-D treated ME-180) Strict Sites 0 13 0 0 200 127 127 227 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 1 color 0,0,200\ longLabel Affymetrix ChIP-chip (p63, actinomycin-D treated ME-180) Strict Sites\ parent encodeAffyChIpHl60SitesStrict\ priority 13\ shortLabel Affy p63 ME-180+\ subGroups factor=actd time=0h\ track encodeAffyChIpHl60SitesStrictP63_ActD\ encodeAffyChIpHl60PvalStrictp63_ActD Affy p63 ME-180+ wig 0 696.62 Affymetrix ChIP-chip (p63, actinomycin-D treated ME-180) Strict P-Value 0 13 0 0 200 127 127 227 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 0 color 0,0,200\ longLabel Affymetrix ChIP-chip (p63, actinomycin-D treated ME-180) Strict P-Value\ parent encodeAffyChIpHl60PvalStrict\ priority 13\ shortLabel Affy p63 ME-180+\ subGroups factor=actd time=0h\ track encodeAffyChIpHl60PvalStrictp63_ActD\ encodeAffyEc1PlacentaSignal EC1 Sgnl Placen wig 0 62385 Affy Ext Trans Signal (1-base window) (Placenta) 0 13 176 0 80 215 127 167 0 0 2 chr21,chr22, encodeTxLevels 0 color 176,0,80\ longLabel Affy Ext Trans Signal (1-base window) (Placenta)\ parent encodeAffyEcSignal\ priority 13\ shortLabel EC1 Sgnl Placen\ track encodeAffyEc1PlacentaSignal\ encodeAffyEc1PlacentaSites EC1 Sites Placen bed 3 . Affy Ext Trans Sites (1-base window) (Placenta) 0 13 176 0 80 215 127 167 0 0 2 chr21,chr22, encodeTxLevels 1 color 176,0,80\ longLabel Affy Ext Trans Sites (1-base window) (Placenta)\ parent encodeAffyEcSites\ priority 13\ shortLabel EC1 Sites Placen\ track encodeAffyEc1PlacentaSites\ hapMapRelease24CombinedRecombMap HapMap bigWig -1.0 91.6 HapMap Release 24 combined recombination map 0 13 50 50 50 152 152 152 0 0 22 chr1,chr2,chr3,chr4,chr5,chr6,chr7,chr8,chr9,chr10,chr11,chr12,chr13,chr16,chr14,chr15,chr17,chr18,chr19,chr20,chr22,chr21, map 0 color 50,50,50\ configurable on\ longLabel HapMap Release 24 combined recombination map\ parent otherMaps\ priority 13\ shortLabel HapMap\ subGroups view=other\ track hapMapRelease24CombinedRecombMap\ type bigWig -1.0 91.6\ encodeUcsdChipHeLaH3H4TAF250_p0 LI TAF1 -gIF bedGraph 4 Ludwig Institute ChIP-chip: TAF1, HeLa cells, no gamma interferon 0 13 109 51 43 182 153 149 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 0 color 109,51,43\ longLabel Ludwig Institute ChIP-chip: TAF1, HeLa cells, no gamma interferon\ parent encodeLIChIPgIF\ priority 13\ shortLabel LI TAF1 -gIF\ track encodeUcsdChipHeLaH3H4TAF250_p0\ missingHg Missing Human psl . Unplaced Human RefSeq Genes Blatted against Mouse Translated 0 13 0 100 0 255 240 200 0 0 0 map 1 altColor 255,240,200\ color 0,100,0\ group map\ longLabel Unplaced Human RefSeq Genes Blatted against Mouse Translated\ priority 13\ shortLabel Missing Human\ track missingHg\ type psl .\ visibility hide\ encodeEgaspFullPairagonAny Pairgn/NSCAN-E/+ genePred Pairagon/NSCAN Any Evidence Gene Predictions 0 13 12 65 165 133 160 210 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeGenes 1 color 12,65,165\ longLabel Pairagon/NSCAN Any Evidence Gene Predictions\ parent encodeEgaspFull\ priority 13\ shortLabel Pairgn/NSCAN-E/+\ track encodeEgaspFullPairagonAny\ encodeGencodeRaceFragsTestis RACEfrags Testis genePred Gencode RACEfrags from Testis 0 13 128 0 128 191 127 191 0 0 19 chr1,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr5,chr6,chr7,chr8,chr9,chrX, encodeGenes 1 color 128,0,128\ longLabel Gencode RACEfrags from Testis\ parent encodeGencodeRaceFrags\ priority 13\ shortLabel RACEfrags Testis\ track encodeGencodeRaceFragsTestis\ encodeStanfordPromotersMRC5 Stan Pro MRC5 bed 9 + Stanford Promoter Activity (MRC5 cells) 0 13 0 0 0 127 127 127 0 0 19 chr1,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr5,chr6,chr7,chr8,chr9,chrX, encodeTxLevels 1 longLabel Stanford Promoter Activity (MRC5 cells)\ parent encodeStanfordPromoters\ priority 13\ shortLabel Stan Pro MRC5\ track encodeStanfordPromotersMRC5\ encodeYaleAffyNB4RARNATransMap Yale RNA NB4 RA wig -2730 3394 Yale NB4 RNA Transcript Map, Treated with Retinoic Acid 0 13 150 50 50 202 152 152 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeTxLevels 0 color 150,50,50\ longLabel Yale NB4 RNA Transcript Map, Treated with Retinoic Acid\ parent encodeYaleAffyRNATransMap\ priority 13\ shortLabel Yale RNA NB4 RA\ subGroups celltype=nb4 samples=samples\ track encodeYaleAffyNB4RARNATransMap\ encodeYaleAffyNB4RARNATars Yale TAR NB4 RA bed 3 . Yale NB4 RNA, TAR, Treated with Retinoic Acid 0 13 150 50 50 202 152 152 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeTxLevels 1 color 150,50,50\ longLabel Yale NB4 RNA, TAR, Treated with Retinoic Acid\ parent encodeYaleAffyRNATars\ priority 13\ shortLabel Yale TAR NB4 RA\ subGroups celltype=nb4 samples=samples\ track encodeYaleAffyNB4RARNATars\ encodeAffyChIpHl60SitesCebpeHr08 Affy CEBPe RA 8h bed 3 . Affymetrix ChIP/Chip (CEBPe retinoic acid-treated HL-60, 8hrs) Sites 0 14 200 25 0 227 140 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 1 color 200,25,0\ longLabel Affymetrix ChIP/Chip (CEBPe retinoic acid-treated HL-60, 8hrs) Sites\ parent encodeAffyChIpHl60Sites\ priority 14\ shortLabel Affy CEBPe RA 8h\ subGroups factor=CEBPe time=8h\ track encodeAffyChIpHl60SitesCebpeHr08\ encodeAffyRnaHl60SitesHr00IntronsDistal Affy In Dst HL60 bed 4 . Affy Intronic Distal Hl60 Transfrags 0 14 200 55 255 227 155 255 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeAnalysis 1 color 200,55,255\ longLabel Affy Intronic Distal Hl60 Transfrags\ parent encodeNoncodingTransFrags\ priority 14\ shortLabel Affy In Dst HL60\ subGroups region=intronicDistal celltype=hl60 source=affy\ track encodeAffyRnaHl60SitesHr00IntronsDistal\ encodeAffyChIpHl60SignalStrictp63_mActD Affy p63 ME-180 wig -2.78 3.97 Affymetrix ChIP-chip (p63, ME-180) Strict Signal 0 14 0 0 200 127 127 227 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 0 color 0,0,200\ longLabel Affymetrix ChIP-chip (p63, ME-180) Strict Signal\ parent encodeAffyChIpHl60SignalStrict\ priority 14\ shortLabel Affy p63 ME-180\ subGroups factor=mactd time=0h\ track encodeAffyChIpHl60SignalStrictp63_mActD\ encodeAffyChIpHl60SitesStrictP63_mActD Affy p63 ME-180 bed 3 . Affymetrix ChIP-chip (p63, ME-180) Strict Sites 0 14 0 0 200 127 127 227 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 1 color 0,0,200\ longLabel Affymetrix ChIP-chip (p63, ME-180) Strict Sites\ parent encodeAffyChIpHl60SitesStrict\ priority 14\ shortLabel Affy p63 ME-180\ subGroups factor=mactd time=0h\ track encodeAffyChIpHl60SitesStrictP63_mActD\ encodeAffyChIpHl60PvalStrictp63_mActD Affy p63 ME-180 wig 0 696.62 Affymetrix ChIP-chip (p63, ME-180) Strict P-Value 0 14 0 0 200 127 127 227 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 0 color 0,0,200\ longLabel Affymetrix ChIP-chip (p63, ME-180) Strict P-Value\ parent encodeAffyChIpHl60PvalStrict\ priority 14\ shortLabel Affy p63 ME-180\ subGroups factor=mactd time=0h\ track encodeAffyChIpHl60PvalStrictp63_mActD\ clonePos Coverage clonePos Clone Coverage 0 14 0 0 0 180 180 180 0 0 0\ In dense display mode, this track shows the coverage level of \ the genome. Finished regions are depicted in black. Draft regions \ are shown in various shades of gray that correspond\ to the level of coverage. \
\ In full display mode, this track shows the position of each clone that aligns\ to the genome sequence. Finished clones are depicted in black, and unfinished\ clones are colored gray. NOTE: Fragment positions in unfinished clones are no \ longer delineated.\
\ map 0 altColor 180,180,180\ group map\ longLabel Clone Coverage\ priority 14\ shortLabel Coverage\ track clonePos\ type clonePos\ visibility hide\ encodeAffyEc51PlacentaSignal EC51 Sgnl Placen wig 0 62385 Affy Ext Trans Signal (51-base window) (Placenta) 0 14 176 0 80 215 127 167 0 0 2 chr21,chr22, encodeTxLevels 0 color 176,0,80\ longLabel Affy Ext Trans Signal (51-base window) (Placenta)\ parent encodeAffyEcSignal\ priority 14\ shortLabel EC51 Sgnl Placen\ track encodeAffyEc51PlacentaSignal\ encodeAffyEc51PlacentaSites EC51 Site Placen bed 3 . Affy Ext Trans Sites (51-base window) (Placenta) 0 14 176 0 80 215 127 167 0 0 2 chr21,chr22, encodeTxLevels 1 color 176,0,80\ longLabel Affy Ext Trans Sites (51-base window) (Placenta)\ parent encodeAffyEcSites\ priority 14\ shortLabel EC51 Site Placen\ track encodeAffyEc51PlacentaSites\ hapMapRelease24CEURecombMap HapMap CEU bigWig 0.0 111.0 HapMap Release 24 CEU recombination map 0 14 80 80 80 167 167 167 0 0 22 chr1,chr2,chr3,chr4,chr5,chr6,chr7,chr8,chr9,chr10,chr11,chr12,chr13,chr16,chr14,chr15,chr17,chr18,chr19,chr20,chr22,chr21, map 0 color 80,80,80\ configurable on\ longLabel HapMap Release 24 CEU recombination map\ parent otherMaps\ priority 14\ shortLabel HapMap CEU\ subGroups view=other\ track hapMapRelease24CEURecombMap\ type bigWig 0.0 111.0\ encodeUcsdChipHeLaH3H4TAF250_p30 LI TAF1 +gIF bedGraph 4 Ludwig Institute ChIP-chip: TAF1, HeLa cells, 30 min. after gamma interferon 0 14 109 51 43 182 153 149 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 0 color 109,51,43\ longLabel Ludwig Institute ChIP-chip: TAF1, HeLa cells, 30 min. after gamma interferon\ parent encodeLIChIPgIF\ priority 14\ shortLabel LI TAF1 +gIF\ track encodeUcsdChipHeLaH3H4TAF250_p30\ encodeEgaspFullPairagonMultiple NSCAN genePred N-SCAN Gene Predictions 0 14 12 85 135 133 170 195 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeGenes 1 color 12,85,135\ longLabel N-SCAN Gene Predictions\ parent encodeEgaspFull\ priority 14\ shortLabel NSCAN\ track encodeEgaspFullPairagonMultiple\ encodeGencodeRaceFragsGM06990 RACEfrags GM06990 genePred Gencode RACEfrags from GM06990 cells 0 14 0 0 205 127 127 230 0 0 19 chr1,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr5,chr6,chr7,chr8,chr9,chrX, encodeGenes 1 color 0,0,205\ longLabel Gencode RACEfrags from GM06990 cells\ parent encodeGencodeRaceFrags\ priority 14\ shortLabel RACEfrags GM06990\ track encodeGencodeRaceFragsGM06990\ encodeStanfordPromotersPanc1 Stan Pro Panc1 bed 9 + Stanford Promoter Activity (Panc1 cells) 0 14 0 0 0 127 127 127 0 0 19 chr1,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr5,chr6,chr7,chr8,chr9,chrX, encodeTxLevels 1 longLabel Stanford Promoter Activity (Panc1 cells)\ parent encodeStanfordPromoters\ priority 14\ shortLabel Stan Pro Panc1\ track encodeStanfordPromotersPanc1\ encodeYaleAffyNB4TPARNATransMap Yale RNA NB4 TPA wig -2730 3394 Yale NB4 RNA Transcript Map, Treated with 12-O-tetradecanoylphorbol-13 Acetate (TPA) 0 14 120 50 50 187 152 152 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeTxLevels 0 color 120,50,50\ longLabel Yale NB4 RNA Transcript Map, Treated with 12-O-tetradecanoylphorbol-13 Acetate (TPA)\ parent encodeYaleAffyRNATransMap\ priority 14\ shortLabel Yale RNA NB4 TPA\ subGroups celltype=nb4 samples=samples\ track encodeYaleAffyNB4TPARNATransMap\ encodeYaleAffyNB4TPARNATars Yale TAR NB4 TPA bed 3 . Yale NB4 RNA, TAR, Treated with 12-O-tetradecanoylphorbol-13 Acetate (TPA) 0 14 120 50 50 187 152 152 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeTxLevels 1 color 120,50,50\ longLabel Yale NB4 RNA, TAR, Treated with 12-O-tetradecanoylphorbol-13 Acetate (TPA)\ parent encodeYaleAffyRNATars\ priority 14\ shortLabel Yale TAR NB4 TPA\ subGroups celltype=nb4 samples=samples\ track encodeYaleAffyNB4TPARNATars\ encodeAffyChIpHl60PvalCebpeHr32 Affy CEBPe RA 32h wig 0.0 534.54 Affymetrix ChIP/Chip (CEBPe retinoic acid-treated HL-60, 32hrs) P-Value 0 15 200 25 0 227 140 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 0 color 200,25,0\ longLabel Affymetrix ChIP/Chip (CEBPe retinoic acid-treated HL-60, 32hrs) P-Value\ parent encodeAffyChIpHl60Pval\ priority 15\ shortLabel Affy CEBPe RA 32h\ subGroups factor=CEBPe time=32h\ track encodeAffyChIpHl60PvalCebpeHr32\ encodeAffyRnaHl60SitesHr02IntronsDistal Affy In Dst HL60 2h bed 4 . Affy Intronic Distal Hl60 2hr Transfrags 0 15 175 80 255 215 167 255 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeAnalysis 1 color 175,80,255\ longLabel Affy Intronic Distal Hl60 2hr Transfrags\ parent encodeNoncodingTransFrags\ priority 15\ shortLabel Affy In Dst HL60 2h\ subGroups region=intronicDistal celltype=hl60 source=affy\ track encodeAffyRnaHl60SitesHr02IntronsDistal\ bacEndPairs BAC End Pairs bed 6 + BAC End Pairs 0 15 0 0 0 80 80 80 0 0 0\ Bacterial artificial chromosomes (BACs) are a key part of many \ large-scale sequencing projects. A BAC typically consists of 25 - 350 kb of\ DNA. During the early phase of a sequencing project, it is common\ to sequence a single read (approximately 500 bases) off each end of\ a large number of BACs. Later on in the project, these BAC end reads\ can be mapped to the genome sequence.
\\ This track shows these mappings\ in cases where both ends could be mapped. These BAC end pairs can\ be useful for validating the assembly over relatively long ranges. In some\ cases, the BACs are useful biological reagents. This track can also be\ used for determining which BAC contains a given gene, useful information\ for certain wet lab experiments.
\\ A valid pair of BAC end sequences must be\ at least 25 kb but no more than 350 kb away from each other. \ The orientation of the first BAC end sequence must be "+" and\ the orientation of the second BAC end sequence must be "-".
\\ The scoring scheme used for this annotation assigns 1000 to an alignment \ when the BAC end pair aligns to only one location in the genome (after \ filtering). When a BAC end pair or clone aligns to multiple locations, the \ score is calculated as 1500/(number of alignments).
\ \\ BAC end sequences are placed on the assembled sequence using\ Jim Kent's blat program.
\ \\ Additional information about the clone, including how it\ can be obtained, may be found at the \ NCBI Clone Registry. To view the registry entry for a \ specific clone, open the details page for the clone and click on its name at \ the top of the page.
\ map 1 altColor 80,80,80\ color 0,0,0\ exonArrows off\ group map\ longLabel BAC End Pairs\ priority 15\ shortLabel BAC End Pairs\ track bacEndPairs\ type bed 6 +\ visibility hide\ encodeAffyEc1TestisSignal EC1 Sgnl Testis wig 0 62385 Affy Ext Trans Signal (1-base window) (Testis) 0 15 128 0 128 191 127 191 0 0 2 chr21,chr22, encodeTxLevels 0 color 128,0,128\ longLabel Affy Ext Trans Signal (1-base window) (Testis)\ parent encodeAffyEcSignal\ priority 15\ shortLabel EC1 Sgnl Testis\ track encodeAffyEc1TestisSignal\ encodeAffyEc1TestisSites EC1 Sites Testis bed 3 . Affy Ext Trans Sites (1-base window) (Testis) 0 15 128 0 128 191 127 191 0 0 2 chr21,chr22, encodeTxLevels 1 color 128,0,128\ longLabel Affy Ext Trans Sites (1-base window) (Testis)\ parent encodeAffyEcSites\ priority 15\ shortLabel EC1 Sites Testis\ track encodeAffyEc1TestisSites\ hapMapRelease24YRIRecombMap HapMap YRI bigWig 0.0 72.21 HapMap Release 24 YRI recombination map 0 15 110 110 110 182 182 182 0 0 22 chr1,chr2,chr3,chr4,chr5,chr6,chr7,chr8,chr9,chr10,chr11,chr12,chr13,chr16,chr14,chr15,chr17,chr18,chr19,chr20,chr22,chr21, map 0 color 110,110,110\ configurable on\ longLabel HapMap Release 24 YRI recombination map\ parent otherMaps\ priority 15\ shortLabel HapMap YRI\ subGroups view=other\ track hapMapRelease24YRIRecombMap\ type bigWig 0.0 72.21\ encodePseudogene Pseudogenes genePred ENCODE Pseudogene Predictions - All ENCODE Regions 0 15 0 0 0 127 127 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX,\ This track shows the pseudogenes located in ENCODE regions generated by\ five different methods—Yale Pipeline, GenCode manual annotation, two \ different UCSC methods, and Gene Identification Signature (GIS)—as well \ as a consensus pseudogenes subtrack based on the \ pseudogenes from all five methods. Datasets are displayed in separate\ subtracks within the annotation and are individually described below.
\\ The annotations are colored as follows:
\\
Type | \Color | \Description | \
---|---|---|
Processed_pseudogene | \pink | \Pseudogenes arising via retrotransposition (exon structure of parent gene lost) | \
Unprocessed_pseudogene | \blue | \Pseudogenes arising via gene duplication (exon structure of parent gene retained) | \
Pseudogene_fragment | \light blue | \Pseudogenes sequences that are single-exon and cannot be confidently \ assigned to either the processed or the duplicated category | \
Undefined | \gray | \\ |
\ This subtrack shows pseudogenes derived from a consensus of the five \ methods listed above. In the pseudogene.org data freeze dated 6 Jan. 2006, \ 201 consensus pseudogenes were found.\ Here, pseudogenes are defined as genomic sequences that are similar to known \ genes but exhibit various inactivating disablements (e.g. premature \ stop codons or frameshifts) in their putative protein-coding regions and are \ flagged as either recently-processed or non-processed.
\ \\ The pseudogene sets were processed as follows:\
\ All pseudogenes in the list have been extensively curated by Adam Frankish and\ Jennifer Harrow at the The Wellcome Trust Sanger Institute.
\ \\ More information about this data set is available from pseudogene.org/ENCODE.\
\ \\ This track shows pseudogenes annotated by the \ HAVANA group \ at the Wellcome Trust Sanger Institute. Pseudogenes have homology to protein\ sequences but generally have a disrupted CDS. For all annotated\ pseudogenes, an active homologous gene (the parent) can be identified\ elsewhere in the genome. Pseudogenes are classified as processed or\ unprocessed.\ \
\ Prior to manual annotation, finished sequence is submitted to an\ automated analysis pipeline for similarity searches and ab initio gene\ predictions. The searches are run on a computer farm and stored in an\ Ensembl MySQL database using the Ensembl analysis pipeline system\ (Searle et al., 2004, Harrow et al., 2006).
\\ A pseudogene is annotated\ where the total length of the protein homology to the genomic sequence\ is >20% of the length of the parent protein or >100 aa in length,\ whichever is shortest. If a gene structure has an ORF but has lost\ the structure of the parent gene, a pseudogene is annotated provided there\ is no evidence of transcription from the pseudogene locus. When an\ open but truncated reading frame is present, other evidence is used\ (for example, 3' genomic polyA tract) to allow classification as a\ pseudogene. When a parent gene has only a single coding exon (e.g.\ olfactory receptors), a small 5' or 3' truncation to the CDS at the\ pseudogene locus (compared to other family members) is sufficient to\ confirm pseudogene status where the truncation is predicted to\ significantly affect secondary structure by the literature and/or\ expert community.
\\ Processed and unprocessed pseudogenes are\ distinguished on the basis of structure and genomic context. \ Processed pseudogenes, which arise via retrotransposition, lose the\ intron-exon structure of the parent gene, often have an A-rich tract\ indicative of the insertion site at their 3' end, and are flanked by\ different genomic sequence to the parent gene. Unprocessed\ pseudogenes, which arise via gene duplication, share both the\ intron-exon structure and flanking genomic sequence with the parent\ gene. Transcribed pseudogenes are indicated by the annotation of a\ pseudogene and transcript variant alongside each other.
\ \\ Harrow J, Denoeud F, Frankish A, Reymond A, Chen CK, Chrast J, Lagarde J, \ Gilbert JG, Storey R, Swarbreck D, et al.\ GENCODE: Producing a reference annotation for ENCODE. \ Genome Biol. 2006;7 Suppl 1:S4.1-9.
\\ Searle SM, Gilbert J, Iyer V, Clamp M.\ The otter annotation system.\ Genome Res. 2004 May;14(5):963-70.
\ \\ This subtrack shows pseudogenes in the ENCODE regions identified by the Yale \ Pseudogene Pipeline. In this analysis, pseudogenes are defined as genomic \ sequences that are similar to known genes with various inactivating \ disablements (e.g. premature stop codons or frameshifts) in their \ putative protein-coding regions. Pseudogenes are flagged as \ recently processed, recently duplicated, or of uncertain origin (either \ ancient fragments or resulting from a single-exon parent).
\ \\
\ All pseudogenes in the list have been manually checked.
\ \\ Zhang Z, Harrison PM, Liu Y, Gerstein M. \ Millions of years of evolution preserved: a comprehensive catalog \ of the processed pseudogenes in the human genome.\ Genome Res. 2003 Dec;13(12):2541-58.
\\ Zheng D, Zhang Z, Harrison PM, Karro J, Carriero N, Gerstein M. \ Integrated pseudogene annotation for human chromosome 22: evidence\ for transcription.\ J Mol Biol. 2005 May 27;349(1):27-45.
\ \\ The Retrogene subtrack shows processed mRNAs that have been inserted back \ into the genome since the mouse/human split. Retrogenes can be \ functional genes that have acquired a promoter from a neighboring gene, \ non-functional pseudogenes, or transcribed pseudogenes.
\ \\
\ The "type" field has four possible values: \
\ These features can be downloaded from the table pseudoGeneLink in many \ formats using the Table Browser option on the menubar.
\ \\ Kent WJ, Baertsch R, Hinrichs A, Miller W, Haussler D.\ Evolution's cauldron: \ Duplication, deletion, and rearrangement in the mouse and human genomes. \ Proc Natl Acad Sci USA. 2003 Sep 30;100(20):11484-9.
\\ Schwartz S, Kent WJ, Smit A, Zhang Z, Baertsch R, Hardison R, \ Haussler D, Miller W.\ Human-mouse alignments with BLASTZ. \ Genome Res. 2003 Jan;13(1):103-7.
\ \\
\
\ This subtrack shows retrotransposed pseudogenes predicted by multiple mapped \ GIS-PETs (gene identification signature-pair end ditags) collected from two \ different cancer cell lines HCT116 and MCF7. A total of 49 non-redundant \ processed pseudogenes predicted in the ENCODE regions are presented in this \ dataset. Each pseudogene is labeled with an ID of the format \ AAA-GISPgene-XX, \ where "AAA" indicates the parental gene name, "GISPgene" is the GIS pseudogene, and "XX" is the unique ID for each pseudogene.
\ \\ PETs were generated from full-length transcripts and \ computationally mapped onto the human genome to demarcate the transcript start \ and end positions. The PETs that mapped to multiple genome locations were \ grouped into PET-based gene families that include parent gene and \ pseudogenes. A representative member—the shortest PET as defined by \ genomic coordinates—was selected from each family. This representative\ PET was aligned to the hg17 genome using in order to identify all the \ putative pseudogenes at the whole genome level. All hits with an \ identity >=70% and coverage >=50% within ENCODE regions were \ reported. In this context, "coverage" refers to alignment coverage of \ the query sequence, i.e. a measure of how complete the predicted pseudogene \ is relative to the query sequence.
\ \\ Pseudogenes were verified by manual examination.
\ \\ These data were generated by the ENCODE Pseudogene Annotation group:\ \ Jennifer Harrow,\ \ \ Wei Chia-Lin,\ \ \ Siew Woh Choo\ \ \ Adam Frankish,\ \ \ Robert Baertsch,\ \ \ France Denoeud,\ \ \ Deyou Zheng,\ \ \ Yontao Lu,\ \ \ Alexandre Reymond,\ \ \ Roderic Guigo Serra,\ \ \ Tom Gingeras,\ \ \ Suganthi Balasubramanian and\ \ \ Mark Gerstein.\ \ encodeGenes 1 baseColorDefault genomicCodons\ baseColorUseCds given\ cdsDrawDefault genomic codons\ chromosomes chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX\ compositeTrack on\ dataVersion ENCODE June 2005 Freeze\ gClass_Known 33,91,51\ gClass_Novel_CDS 33,91,51\ gClass_Novel_transcript 84,188,0\ gClass_Novel_transcript_gencode_conf 33,91,51\ gClass_Processed_pseudogene 200,91,191\ gClass_Pseudogene_fragment 100,91,191\ gClass_Putative 84,188,0\ gClass_Putative_gencode_conf 33,91,51\ gClass_TEC 84,188,0\ gClass_Undefined 163,168,163\ gClass_Unprocessed_pseudogene 0,91,191\ geneClasses Artifact Known Novel_CDS Novel_transcript Novel_transcript_gencode_conf Putative Putative_gencode_conf TEC Processed_pseudogene Unprocessed_pseudogene Pseudogene_fragment Undefined\ group encodeGenes\ itemClassTbl encodePseudogeneClass\ longLabel ENCODE Pseudogene Predictions - All ENCODE Regions\ origAssembly hg17\ priority 15.0\ shortLabel Pseudogenes\ track encodePseudogene\ type genePred\ visibility hide\ encodeGencodeRaceFragsHL60 RACEfrags HL60 genePred Gencode RACEfrags from HL60 cells 0 15 0 0 255 127 127 255 0 0 19 chr1,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr5,chr6,chr7,chr8,chr9,chrX, encodeGenes 1 color 0,0,255\ longLabel Gencode RACEfrags from HL60 cells\ parent encodeGencodeRaceFrags\ priority 15\ shortLabel RACEfrags HL60\ track encodeGencodeRaceFragsHL60\ encodeStanfordPromotersSnu182 Stan Pro Snu182 bed 9 + Stanford Promoter Activity (Snu182 cells) 0 15 0 0 0 127 127 127 0 0 19 chr1,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr5,chr6,chr7,chr8,chr9,chrX, encodeTxLevels 1 longLabel Stanford Promoter Activity (Snu182 cells)\ parent encodeStanfordPromoters\ priority 15\ shortLabel Stan Pro Snu182\ track encodeStanfordPromotersSnu182\ encodeYaleAffyNB4UntrRNATransMap Yale RNA NB4 Un wig -2730 3394 Yale NB4 RNA Transcript Map, Untreated 0 15 90 50 50 172 152 152 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeTxLevels 0 color 90,50,50\ longLabel Yale NB4 RNA Transcript Map, Untreated\ parent encodeYaleAffyRNATransMap\ priority 15\ shortLabel Yale RNA NB4 Un\ subGroups celltype=nb4 samples=samples\ track encodeYaleAffyNB4UntrRNATransMap\ encodeYaleAffyNB4UntrRNATars Yale TAR NB4 Un bed 3 . Yale NB4 RNA, TAR, Untreated 0 15 90 50 50 172 152 152 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeTxLevels 1 color 90,50,50\ longLabel Yale NB4 RNA, TAR, Untreated\ parent encodeYaleAffyRNATars\ priority 15\ shortLabel Yale TAR NB4 Un\ subGroups celltype=nb4 samples=samples\ track encodeYaleAffyNB4UntrRNATars\ encodeAffyChIpHl60SitesCebpeHr32 Affy CEBPe RA 32h bed 3 . Affymetrix ChIP/Chip (CEBPe retinoic acid-treated HL-60, 32hrs) Sites 0 16 200 25 0 227 140 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 1 color 200,25,0\ longLabel Affymetrix ChIP/Chip (CEBPe retinoic acid-treated HL-60, 32hrs) Sites\ parent encodeAffyChIpHl60Sites\ priority 16\ shortLabel Affy CEBPe RA 32h\ subGroups factor=CEBPe time=32h\ track encodeAffyChIpHl60SitesCebpeHr32\ encodeAffyRnaHl60SitesHr08IntronsDistal Affy In Dst HL60 8h bed 4 . Affy Intronic Distal Hl60 8hr Transfrags 0 16 150 105 255 202 180 255 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeAnalysis 1 color 150,105,255\ longLabel Affy Intronic Distal Hl60 8hr Transfrags\ parent encodeNoncodingTransFrags\ priority 16\ shortLabel Affy In Dst HL60 8h\ subGroups region=intronicDistal celltype=hl60 source=affy\ track encodeAffyRnaHl60SitesHr08IntronsDistal\ encodeAffyEc51TestisSignal EC51 Sgnl Testis wig 0 62385 Affy Ext Trans Signal (51-base window) (Testis) 0 16 128 0 128 191 127 191 0 0 2 chr21,chr22, encodeTxLevels 0 color 128,0,128\ longLabel Affy Ext Trans Signal (51-base window) (Testis)\ parent encodeAffyEcSignal\ priority 16\ shortLabel EC51 Sgnl Testis\ track encodeAffyEc51TestisSignal\ encodeAffyEc51TestisSites EC51 Site Testis bed 3 . Affy Ext Trans Sites (51-base window) (Testis) 0 16 128 0 128 191 127 191 0 0 2 chr21,chr22, encodeTxLevels 1 color 128,0,128\ longLabel Affy Ext Trans Sites (51-base window) (Testis)\ parent encodeAffyEcSites\ priority 16\ shortLabel EC51 Site Testis\ track encodeAffyEc51TestisSites\ bacEndPairsBad Incorrect BAC End Pairs bed 6 + Orphan, Short and Incorrectly Oriented BAC End Pairs 0 16 0 0 0 90 90 90 0 0 0 map 1 altColor 90,90,90\ color 0,0,0\ exonArrows off\ group map\ longLabel Orphan, Short and Incorrectly Oriented BAC End Pairs\ priority 16\ shortLabel Incorrect BAC End Pairs\ track bacEndPairsBad\ type bed 6 +\ visibility hide\ encodeGencodeRaceFragsHela RACEfrags HeLaS3 genePred Gencode RACEfrags from HeLaS3 cells 0 16 125 130 255 190 192 255 0 0 19 chr1,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr5,chr6,chr7,chr8,chr9,chrX, encodeGenes 1 color 125,130,255\ longLabel Gencode RACEfrags from HeLaS3 cells\ parent encodeGencodeRaceFrags\ priority 16\ shortLabel RACEfrags HeLaS3\ track encodeGencodeRaceFragsHela\ encodeEgaspFullSgp2 SGP2 genePred SGP2 Gene Predictions 0 16 100 12 100 177 133 177 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeGenes 1 color 100,12,100\ longLabel SGP2 Gene Predictions\ parent encodeEgaspFull\ priority 16\ shortLabel SGP2\ track encodeEgaspFullSgp2\ encodeStanfordPromotersU87 Stan Pro U87 bed 9 + Stanford Promoter Activity (U87 cells) 0 16 0 0 0 127 127 127 0 0 19 chr1,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr5,chr6,chr7,chr8,chr9,chrX, encodeTxLevels 1 longLabel Stanford Promoter Activity (U87 cells)\ parent encodeStanfordPromoters\ priority 16\ shortLabel Stan Pro U87\ track encodeStanfordPromotersU87\ encodeAffyChIpHl60PvalCtcfHr00 Affy CTCF RA 0h wig 0.0 534.54 Affymetrix ChIP/Chip (CTCF retinoic acid-treated HL-60, 0hrs) P-Value 0 17 175 50 0 215 152 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 0 color 175,50,0\ longLabel Affymetrix ChIP/Chip (CTCF retinoic acid-treated HL-60, 0hrs) P-Value\ parent encodeAffyChIpHl60Pval\ priority 17\ shortLabel Affy CTCF RA 0h\ subGroups factor=CTCF time=0h\ track encodeAffyChIpHl60PvalCtcfHr00\ encodeAffyRnaHl60SitesHr32IntronsDistal Affy In Dst HL60 32h bed 4 . Affy Intronic Distal Hl60 32hr Transfrags 0 17 125 130 255 190 192 255 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeAnalysis 1 color 125,130,255\ longLabel Affy Intronic Distal Hl60 32hr Transfrags\ parent encodeNoncodingTransFrags\ priority 17\ shortLabel Affy In Dst HL60 32h\ subGroups region=intronicDistal celltype=hl60 source=affy\ track encodeAffyRnaHl60SitesHr32IntronsDistal\ encodeAffyEc1FetalTestisSignal EC1 Sgnl FetalT wig 0 62385 Affy Ext Trans Signal (1-base window) (Fetal Testis) 0 17 128 0 128 191 127 191 0 0 2 chr21,chr22, encodeTxLevels 0 color 128,0,128\ longLabel Affy Ext Trans Signal (1-base window) (Fetal Testis)\ parent encodeAffyEcSignal\ priority 17\ shortLabel EC1 Sgnl FetalT\ track encodeAffyEc1FetalTestisSignal\ encodeAffyEc1FetalTestisSites EC1 Sites FetalT bed 3 . Affy Ext Trans Sites (1-base window) (Fetal Testis) 0 17 128 0 128 191 127 191 0 0 2 chr21,chr22, encodeTxLevels 1 color 128,0,128\ longLabel Affy Ext Trans Sites (1-base window) (Fetal Testis)\ parent encodeAffyEcSites\ priority 17\ shortLabel EC1 Sites FetalT\ track encodeAffyEc1FetalTestisSites\ bacEndPairsLong Long BAC End Pairs bed 6 + Long BAC End Pairs 0 17 0 0 0 90 90 90 0 0 0 map 1 altColor 90,90,90\ color 0,0,0\ exonArrows off\ group map\ longLabel Long BAC End Pairs\ priority 17\ shortLabel Long BAC End Pairs\ track bacEndPairsLong\ type bed 6 +\ visibility hide\ encodeEgaspFullSgp2U12 SGP2 U12 genePred SGP2 U12 Intron Predictions 0 17 200 132 12 227 193 133 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeGenes 1 color 200,132,12\ longLabel SGP2 U12 Intron Predictions\ parent encodeEgaspFull\ priority 17\ shortLabel SGP2 U12\ track encodeEgaspFullSgp2U12\ encodeStanfordPromotersAverage Stan Pro Average bed 9 + Stanford Promoter Activity (Average) 0 17 0 0 0 127 127 127 0 0 19 chr1,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr5,chr6,chr7,chr8,chr9,chrX, encodeTxLevels 1 longLabel Stanford Promoter Activity (Average)\ parent encodeStanfordPromoters\ priority 17\ shortLabel Stan Pro Average\ track encodeStanfordPromotersAverage\ encodeAffyChIpHl60SitesCtcfHr00 Affy CTCF RA 0h bed 3 . Affymetrix ChIP/Chip (CTCF retinoic acid-treated HL-60, 0hrs) Sites 0 18 175 50 0 215 152 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 1 color 175,50,0\ longLabel Affymetrix ChIP/Chip (CTCF retinoic acid-treated HL-60, 0hrs) Sites\ parent encodeAffyChIpHl60Sites\ priority 18\ shortLabel Affy CTCF RA 0h\ subGroups factor=CTCF time=0h\ track encodeAffyChIpHl60SitesCtcfHr00\ encodeAffyEc51FetalTestisSignal EC51 Sgnl FetalT wig 0 62385 Affy Ext Trans Signal (51-base window) (Fetal Testis) 0 18 128 0 128 191 127 191 0 0 2 chr21,chr22, encodeTxLevels 0 color 128,0,128\ longLabel Affy Ext Trans Signal (51-base window) (Fetal Testis)\ parent encodeAffyEcSignal\ priority 18\ shortLabel EC51 Sgnl FetalT\ track encodeAffyEc51FetalTestisSignal\ encodeAffyEc51FetalTestisSites EC51 Site FetalT bed 3 . Affy Ext Trans Sites (51-base window) (Fetal Testis) 0 18 128 0 128 191 127 191 0 0 2 chr21,chr22, encodeTxLevels 1 color 128,0,128\ longLabel Affy Ext Trans Sites (51-base window) (Fetal Testis)\ parent encodeAffyEcSites\ priority 18\ shortLabel EC51 Site FetalT\ track encodeAffyEc51FetalTestisSites\ fosEndPairs Fosmid End Pairs bed 6 + Fosmid End Pairs 0 18 0 0 0 90 90 90 0 0 0
A valid pair of fosmid end sequences must be\ at least 30 kb but no more than 50 kb away from each other. \ The orientation of the first fosmid end sequence must be "+" and\ the orientation of the second fosmid end sequence must be "-".
\ \End sequences were trimmed at the NCBI using\ ssahaCLIP written by Jim Mullikin. Trimmed fosmid end sequences were\ placed on the assembled sequence using Jim Kent's \ blat \ program.
\ \Sequencing of the fosmid ends was done at the \ Eli & Edythe L. Broad\ Institute of MIT and Harvard University. Clones are available through the\ BACPAC Resources\ Center at Children's Hospital Oakland Research Institute (CHORI).\
\ map 1 altColor 90,90,90\ color 0,0,0\ exonArrows off\ group map\ longLabel Fosmid End Pairs\ priority 18\ shortLabel Fosmid End Pairs\ track fosEndPairs\ type bed 6 +\ visibility hide\ encodeEgaspFullSpida SPIDA Exons genePred SPIDA Exon Predictions 0 18 100 12 100 177 133 177 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeGenes 1 color 100,12,100\ longLabel SPIDA Exon Predictions\ parent encodeEgaspFull\ priority 18\ shortLabel SPIDA Exons\ track encodeEgaspFullSpida\ encodeYaleAffyNB4RARNATarsIntronsDistal Yale In Dst NB4 bed 4 . Yale Intronic Distal NB4 Retinoic TARs 0 18 100 155 255 177 205 255 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeAnalysis 1 color 100,155,255\ longLabel Yale Intronic Distal NB4 Retinoic TARs\ parent encodeNoncodingTransFrags\ priority 18\ shortLabel Yale In Dst NB4\ subGroups region=intronicDistal celltype=nb4 source=yale\ track encodeYaleAffyNB4RARNATarsIntronsDistal\ encodeAffyChIpHl60PvalCtcfHr02 Affy CTCF RA 2h wig 0.0 534.54 Affymetrix ChIP/Chip (CTCF retinoic acid-treated HL-60, 2hrs) P-Value 0 19 175 50 0 215 152 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 0 color 175,50,0\ longLabel Affymetrix ChIP/Chip (CTCF retinoic acid-treated HL-60, 2hrs) P-Value\ parent encodeAffyChIpHl60Pval\ priority 19\ shortLabel Affy CTCF RA 2h\ subGroups factor=CTCF time=2h\ track encodeAffyChIpHl60PvalCtcfHr02\ fosEndPairsBad Bad Fosmid End Pairs bed 6 + Orphan, Short and Incorrectly Oriented Fosmid End Pairs 0 19 0 0 0 90 90 90 0 0 0 map 1 altColor 90,90,90\ color 0,0,0\ exonArrows off\ group map\ longLabel Orphan, Short and Incorrectly Oriented Fosmid End Pairs\ priority 19\ shortLabel Bad Fosmid End Pairs\ track fosEndPairsBad\ type bed 6 +\ visibility hide\ encodeAffyEc1ProstateSignal EC1 Sgnl Prost wig 0 62385 Affy Ext Trans Signal (1-base window) (Prostate) 0 19 128 0 128 191 127 191 0 0 2 chr21,chr22, encodeTxLevels 0 color 128,0,128\ longLabel Affy Ext Trans Signal (1-base window) (Prostate)\ parent encodeAffyEcSignal\ priority 19\ shortLabel EC1 Sgnl Prost\ track encodeAffyEc1ProstateSignal\ encodeAffyEc1ProstateSites EC1 Sites Prost bed 3 . Affy Ext Trans Sites (1-base window) (Prostate) 0 19 128 0 128 191 127 191 0 0 2 chr21,chr22, encodeTxLevels 1 color 128,0,128\ longLabel Affy Ext Trans Sites (1-base window) (Prostate)\ parent encodeAffyEcSites\ priority 19\ shortLabel EC1 Sites Prost\ track encodeAffyEc1ProstateSites\ encodeEgaspFullTwinscan Twinscan genePred Twinscan Gene Predictions 0 19 12 20 150 133 137 202 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeGenes 1 color 12,20,150\ longLabel Twinscan Gene Predictions\ parent encodeEgaspFull\ priority 19\ shortLabel Twinscan\ track encodeEgaspFullTwinscan\ encodeYaleAffyNB4TPARNATarsIntronsDistal Yale In Dst NB4 TPA bed 4 . Yale Intronic Distal NB4 TPA-Treated TARs 0 19 75 180 255 165 217 255 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeAnalysis 1 color 75,180,255\ longLabel Yale Intronic Distal NB4 TPA-Treated TARs\ parent encodeNoncodingTransFrags\ priority 19\ shortLabel Yale In Dst NB4 TPA\ subGroups region=intronicDistal celltype=nb4 source=yale\ track encodeYaleAffyNB4TPARNATarsIntronsDistal\ encodeAffyRnaSignal Affy RNA Signal wig -1168.00 1686.5 Affymetrix PolyA+ RNA Signal 0 19.02 0 0 0 127 127 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX,\ This track shows an estimate of RNA abundance (transcription) for\ all ENCODE regions. Retinoic acid-stimulated HL-60 cells were\ harvested after 0, 2, 8, and 32 hours. \ Purified cytosolic polyA+ RNA from unstimulated GM06990 and HeLa cells, \ as well as purified polyA+ RNA from the RA-stimulated HL-60 samples, \ was hybridized to Affymetrix ENCODE oligonucleotide\ tiling arrays, which have 25-mer probes tiled every 22 bp on\ average in the non-repetitive ENCODE regions. \ Composite signals are shown in\ separate subtracks for each cell type and for each of the four \ timepoints for RA-stimulated HL-60.
\\ Data for all biological replicates can be downloaded from Affymetrix in \ wiggle,\ cel, and\ soft formats.
\ \\ The subtracks within this composite annotation track\ may be configured in a variety of ways to highlight different aspects of the \ displayed data. The graphical configuration options for the subtracks \ are shown at the top of the track description page, followed by a list of \ subtracks. To show only selected subtracks, uncheck the boxes next to \ the tracks that you wish to hide. \ For more information about the graphical configuration options, click the \ Graph\ configuration help link.
\\ Color differences among the subtracks are arbitrary. They provide a\ visual cue for distinguishing between the different cell types and \ timepoints.
\ \\ The data from replicate arrays were quantile-normalized (Bolstad\ et al., 2003) and all arrays were scaled to a median array intensity\ of 22. Within a sliding 101 bp window centered on each probe, \ an estimate of RNA abundance (signal) was found by calculating the median \ of all pairwise average PM-MM values, where PM is a perfect match and MM is \ a mismatch. Both Kapranov et al. (2002) and Cawley \ et al. (2004) are good references for the experimental methods; \ Cawley et al. also describes the analytical methods.
\ \\ Three independent biological replicates were generated and hybridized\ to duplicate arrays (two technical replicates). Transcribed regions\ were generated from the composite signal track by merging genomic positions\ to which probes are mapped. This merging was based on a 5% false\ positive rate cutoff in negative bacterial controls, a maximum\ gap (MaxGap) of 50 base-pairs and minimum run (MinRun) of 50 base-pairs (see\ the Affy TransFrags track for the merged regions).\
\ \\ These data were generated and analyzed by the Gingeras/Struhl\ collaboration with the Tom Gingeras group at \ Affymetrix and the \ Kevin Struhl group at Harvard Medical School.
\ \\ Please see the \ Affymetrix Transcriptome site for a project overview and\ additional references to Affymetrix tiling array publications.
\\ Bolstad, B. M., Irizarry, R. A., Astrand, M., and Speed, T. P. \ A comparison of normalization methods for high density \ oligonucleotide array data based on variance and bias. \ Bioinformatics 19(2), 185-193 (2003).
\\ Cawley, S., Bekiranov, S., Ng, H. H., Kapranov, P., Sekinger,\ E. A., Kampa, D., Piccolboni, A., Sementchenko, V., Cheng, J.,\ Williams, A. J., et al. \ Unbiased mapping of transcription factor binding sites along \ human chromosomes 21 and 22 points to widespread regulation of noncoding \ RNAs. \ Cell 116(4), 499-509 (2004).
\\ Kapranov, P., Cawley, S. E., Drenkow, J., Bekiranov, S., Strausberg,\ R. L., Fodor, S. P., and Gingeras, T. R. \ Large-scale transcriptional activity in chromosomes 21 and \ 22. \ Science 296(5569), 916-919 (2002).
\ encodeTxLevels 0 autoScale off\ chromosomes chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX\ compositeTrack on\ dataVersion ENCODE June 2005 Freeze\ group encodeTxLevels\ longLabel Affymetrix PolyA+ RNA Signal\ maxHeightPixels 128:16:16\ origAssembly hg16\ priority 19.02\ shortLabel Affy RNA Signal\ spanList 1\ track encodeAffyRnaSignal\ type wig -1168.00 1686.5\ viewLimits 0:25\ visibility hide\ encodeAffyRnaTransfrags Affy Transfrags bed 3 . Affymetrix PolyA+ RNA Transfrags 0 19.03 0 0 0 127 127 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX,\ This track shows an estimate of RNA abundance (transcription) for\ all ENCODE regions. Retinoic acid-stimulated HL-60 cells were\ harvested after 0, 2, 8, and 32 hours. \ Purified cytosolic polyA+ RNA from unstimulated GM06990 and HeLa cells, \ as well as purified polyA+ RNA from the RA-stimulated HL-60 samples, \ was hybridized to Affymetrix ENCODE oligonucleotide\ tiling arrays, which have 25-mer probes tiled every 22 bp on\ average in the non-repetitive ENCODE regions. \ Clustered sites are shown in\ separate subtracks for each cell type and for each of the four \ timepoints for RA-stimulated HL-60.
\ \\ To show only selected subtracks, uncheck the boxes next to the tracks \ that you wish to hide. \
\ Color differences among the subtracks are arbitrary. They provide a\ visual cue for distinguishing between the different cell types and \ timepoints.
\ \\ The data from replicate arrays were quantile-normalized (Bolstad\ et al., 2003) and all arrays were scaled to a median array intensity\ of 22. Within a sliding 101 bp window centered on each probe, \ an estimate of RNA abundance (signal) was found by calculating the median \ of all pairwise average PM-MM values, where PM is a perfect match and MM is \ a mismatch. Both Kapranov et al. (2002) and Cawley \ et al. (2004) are good references for the experimental methods; \ Cawley et al. also describes the analytical methods.
\ \\ Three independent biological replicates were generated and hybridized\ to duplicate arrays (two technical replicates). Transcribed regions (see the\ Affy RNA Signal track) were generated from the composite signal track by \ merging genomic positions to which probes are mapped. This merging was based \ on a 5% false positive rate cutoff in negative bacterial controls, a maximum\ gap (MaxGap) of 50 base-pairs and minimum run (MinRun) of 50 base-pairs.\
\ \\ These data were generated and analyzed by the Gingeras/Struhl\ collaboration with the Tom Gingeras group at \ Affymetrix and the \ Kevin Struhl group at Harvard Medical School.
\ \\ Please see the \ Affymetrix Transcriptome site for a project overview and\ additional references to Affymetrix tiling array publications.
\\ Bolstad, B. M., Irizarry, R. A., Astrand, M., and Speed, T. P. \ A comparison of normalization methods for high density \ oligonucleotide array data based on variance and bias. \ Bioinformatics 19(2), 185-193 (2003).
\\ Cawley, S., Bekiranov, S., Ng, H. H., Kapranov, P., Sekinger,\ E. A., Kampa, D., Piccolboni, A., Sementchenko, V., Cheng, J.,\ Williams, A. J., et al. \ Unbiased mapping of transcription factor binding sites along \ human chromosomes 21 and 22 points to widespread regulation of noncoding \ RNAs. \ Cell 116(4), 499-509 (2004).
\\ Kapranov, P., Cawley, S. E., Drenkow, J., Bekiranov, S., Strausberg,\ R. L., Fodor, S. P., and Gingeras, T. R. \ Large-scale transcriptional activity in chromosomes 21 and \ 22. \ Science 296(5569), 916-919 (2002).
\ encodeTxLevels 1 chromosomes chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX\ compositeTrack on\ dataVersion ENCODE June 2005 Freeze\ group encodeTxLevels\ longLabel Affymetrix PolyA+ RNA Transfrags\ origAssembly hg16\ priority 19.03\ shortLabel Affy Transfrags\ track encodeAffyRnaTransfrags\ type bed 3 .\ visibility hide\ encodeAffyChIpHl60SitesCtcfHr02 Affy CTCF RA 2h bed 3 . Affymetrix ChIP/Chip (CTCF retinoic acid-treated HL-60, 2hrs) Sites 0 20 175 50 0 215 152 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 1 color 175,50,0\ longLabel Affymetrix ChIP/Chip (CTCF retinoic acid-treated HL-60, 2hrs) Sites\ parent encodeAffyChIpHl60Sites\ priority 20\ shortLabel Affy CTCF RA 2h\ subGroups factor=CTCF time=2h\ track encodeAffyChIpHl60SitesCtcfHr02\ encodeAffyEc51ProstateSignal EC51 Sgnl Prost wig 0 62385 Affy Ext Trans Signal (51-base window) (Prostate) 0 20 128 0 128 191 127 191 0 0 2 chr21,chr22, encodeTxLevels 0 color 128,0,128\ longLabel Affy Ext Trans Signal (51-base window) (Prostate)\ parent encodeAffyEcSignal\ priority 20\ shortLabel EC51 Sgnl Prost\ track encodeAffyEc51ProstateSignal\ encodeAffyEc51ProstateSites EC51 Site Prost bed 3 . Affy Ext Trans Sites (51-base window) (Prostate) 0 20 128 0 128 191 127 191 0 0 2 chr21,chr22, encodeTxLevels 1 color 128,0,128\ longLabel Affy Ext Trans Sites (51-base window) (Prostate)\ parent encodeAffyEcSites\ priority 20\ shortLabel EC51 Site Prost\ track encodeAffyEc51ProstateSites\ fosEndPairsLong Long Fosmid End Pairs bed 6 + Long Fosmid End Pairs 0 20 0 0 0 90 90 90 0 0 0 map 1 altColor 90,90,90\ color 0,0,0\ exonArrows off\ group map\ longLabel Long Fosmid End Pairs\ priority 20\ shortLabel Long Fosmid End Pairs\ track fosEndPairsLong\ type bed 6 +\ visibility hide\ encodeYaleAffyNB4UntrRNATarsIntronsDistal Yale In Dst NB4 Un bed 4 . Yale Intronic Distal Untreated NB4 TARs 0 20 50 205 255 152 230 255 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeAnalysis 1 color 50,205,255\ longLabel Yale Intronic Distal Untreated NB4 TARs\ parent encodeNoncodingTransFrags\ priority 20\ shortLabel Yale In Dst NB4 Un\ subGroups region=intronicDistal celltype=nb4 source=yale\ track encodeYaleAffyNB4UntrRNATarsIntronsDistal\ encodeAffyChIpHl60PvalCtcfHr08 Affy CTCF RA 8h wig 0.0 534.54 Affymetrix ChIP/Chip (CTCF retinoic acid-treated HL-60, 8hrs) P-Value 0 21 175 50 0 215 152 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 0 color 175,50,0\ longLabel Affymetrix ChIP/Chip (CTCF retinoic acid-treated HL-60, 8hrs) P-Value\ parent encodeAffyChIpHl60Pval\ priority 21\ shortLabel Affy CTCF RA 8h\ subGroups factor=CTCF time=8h\ track encodeAffyChIpHl60PvalCtcfHr08\ chr18deletions Chr18 Deletions bed 6 + Chromosome 18 Deletions 0 21 0 0 0 127 127 127 0 0 0 map 1 group map\ longLabel Chromosome 18 Deletions\ priority 21\ shortLabel Chr18 Deletions\ track chr18deletions\ type bed 6 +\ visibility hide\ encodeAffyEc1OvarySignal EC1 Sgnl Ovary wig 0 62385 Affy Ext Trans Signal (1-base window) (Ovary) 0 21 128 0 128 191 127 191 0 0 2 chr21,chr22, encodeTxLevels 0 color 128,0,128\ longLabel Affy Ext Trans Signal (1-base window) (Ovary)\ parent encodeAffyEcSignal\ priority 21\ shortLabel EC1 Sgnl Ovary\ track encodeAffyEc1OvarySignal\ encodeAffyEc1OvarySites EC1 Sites Ovary bed 3 . Affy Ext Trans Sites (1-base window) (Ovary) 0 21 128 0 128 191 127 191 0 0 2 chr21,chr22, encodeTxLevels 1 color 128,0,128\ longLabel Affy Ext Trans Sites (1-base window) (Ovary)\ parent encodeAffyEcSites\ priority 21\ shortLabel EC1 Sites Ovary\ track encodeAffyEc1OvarySites\ encodeYaleAffyNeutRNATarsAllIntronsDistal Yale In Dst Neu bed 4 . Yale Intronic Distal Neutrophil TARs 0 21 25 230 255 140 242 255 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeAnalysis 1 color 25,230,255\ longLabel Yale Intronic Distal Neutrophil TARs\ parent encodeNoncodingTransFrags\ priority 21\ shortLabel Yale In Dst Neu\ subGroups region=intronicDistal celltype=neut source=yale\ track encodeYaleAffyNeutRNATarsAllIntronsDistal\ encodeAffyChIpHl60SitesCtcfHr08 Affy CTCF RA 8h bed 3 . Affymetrix ChIP/Chip (CTCF retinoic acid-treated HL-60, 8hrs) Sites 0 22 175 50 0 215 152 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 1 color 175,50,0\ longLabel Affymetrix ChIP/Chip (CTCF retinoic acid-treated HL-60, 8hrs) Sites\ parent encodeAffyChIpHl60Sites\ priority 22\ shortLabel Affy CTCF RA 8h\ subGroups factor=CTCF time=8h\ track encodeAffyChIpHl60SitesCtcfHr08\ encodeBuFirstExon BU First Exon bed 12 + Boston University First Exon Activity 0 22 0 0 0 127 127 127 0 0 10 chr11,chr13,chr15,chr16,chr19,chr2,chr5,chr7,chr9,chrX,\ This track displays expression levels of computationally identified\ first exons and a constitutive exon of genes in ENCODE regions,\ based on the real competitive Polymerase Chain\ Reaction (rcPCR) technique described in Ding\ et al. (2003). \ Expression levels\ are indicated by color, ranging from black (no expression) to red (high\ expression).
\\ Experiments were performed on total RNA samples of ten\ normal human tissues purchased from Clontech (Palo Alto, CA): \ cerebral cortex, colon, heart, kidney, liver, lung,\ skeletal muscle, spleen, stomach, and testis.
\\ The name for each alternative transcript starts with the gene name,\ followed by an identifier for the alternative first exon or the\ constitutive exon. For example, for gene CAV1, there are three\ alternative first exons (CAV1-E1A, CAV1-E1B, and CAV1-E1C) and the\ third exon is chosen as the constitutively expressed exon (CAV1-E3).
\ \\ Alternative transcription start sites (TSS) for 20 ENCODE genes were predicted\ using PromoSer, an in-house computational tool.\ PromoSer computationally identifies the TSS by considering alignments\ of a large number of partial and full-length mRNA sequences and ESTs to\ genomic DNA, with provision for alternative promoters. In PromoSer, the\ treatment of alternative first exons (or the resulting TSSs) is as\ follows: \
\ For each gene, all alternative first exons were identified based on manual\ selection of PromoSer predictions. An exon that is\ shared by all transcripts (called the constitutive exon) was also selected. \ The selection process involved visually\ examining the structure of the cluster, preferably using the latest\ data available on UCSC, to identify distinct first exons that were well\ formed (having multiple supporting sequences) and had no evidence\ (especially from newer sequences) of additional sequence that made\ them internal exons. After the first exon was identified, a subsequence \ (between 100-300 bases) was selected for use in the experiment. The\ selection process avoided repeat sequences as much as possible and if\ the two first exons partially overlapped, the non-overlapping region was\ selected. If those conditions caused the remaining sequence to be too\ short (or the first exon itself was too short), a junction with the\ second exon was used. A constitutive exon was also selected that was\ included in all (or most) of the alternative transcripts and \ suitable sequences were then extracted as above (no exon junctions are used).\
\\ The absolute expression levels of all exons were individually quantified\ by rcPCR by designing four assays with PCR amplicons corresponding to\ each exon. \ Amplicons were designed according to transcript sequences\ and can span a large distance on the genomic sequence. In addition,\ some amplicons were designed across the junctions between first exons\ and the constitutive second exons, and thus these amplicons may overlap\ with the amplicons that correspond to the constitutive second exons. \
\\ The rcPCR technique combined competitive PCR and matrix-assisted laser\ desorption/ionization time-of-flight mass spectrometry (MALDI-TOF MS)\ for gene expression analysis. To measure the expression level of a\ gene, an oligonucleotide standard (60-80 bases) of known concentration,\ complementary to the target sequence with a single base\ mismatch in the middle, was added as the competitor for PCR. The gene of\ interest and the oligonucleotide standard resembled two alleles of a\ heterozygous locus in an allele frequency analysis experiment, and thus\ could be quantified by the high-throughput MALDI-TOF MS\ based MassARRAY system (Sequenom Inc.).
\\ After PCR, a base extension\ reaction was carried out with an extension primer, a ThermoSequenase and\ a mixture of ddNTPs/dNTP (for example, a mixture of\ ddA, ddC, ddT, and dG). The extension primer annealed the immediate\ 5’-upstream sequence of the mismatch position. Depending on the nature\ of the mismatch and the mixture composition of ddNTPs/dNTP, one or two\ bases were added to the extension primer, producing two extension\ products with one base-length difference. These two extension products\ were then detected and quantified by MALDI-TOF MS.
\\ Expression ratios (e.g. CAV1-E1A/CAV1-E3, CAV1-E1B/CAV1-E3,\ CAV1-E1C/CAV1-E3) indicate the relative abundance of \ alternative first exons. \ 18S rRNA was used for exon absolute expression\ normalization among different tissues.
\\ Values shown on this track represent the relative abundance of the\ alternative first exons with respect to the 18S rRNA. The raw values have\ been log10 transformed and scaled to show graded colors on the browser.
\ \\ One biological replicate was performed for each gene. Two to four\ competitor concentrations were used to detect the expression level\ of each exon. Two to six technical replicates were performed for\ each competitor concentration. One more biological replicate will be\ performed in the future.
\ \\ Data generation and analysis for this track were performed by \ ZLAB\ at Boston University. The following people contributed: Shengnan Jin,\ Anason Halees, Heather Burden, Yutao Fu, Ulas Karaoz, Yong Yu, Chunming\ Ding, Charles R. Cantor, and Zhiping Weng.
\ \\ Ding, C. and Cantor, C.R. \ A\ high-throughput gene expression analysis technique using competitive PCR and \ matrix-assisted laser desorption ionization time-of-flight MS.\ Proc Natl Acad Sci U S A 100(6), 3059-64 (2003).
\Ding, C. and Cantor, C.R. \ Direct molecular haplotyping of long-range genomic DNA with\ M1-PCR. \ Proc Natl Acad Sci U S A 100(13), 7449-53 (2003).
\Halees, A.S., Leyfer, D. and Weng, Z. \ PromoSer: A large-scale mammalian promoter and transcription \ start site identification service. \ Nucleic Acids Res. 31(13), 3554-9 (2003).
\Halees, A.S. and Weng, Z. \ PromoSer: improvements to the algorithm, visualization and\ accessibility.\ Nucleic Acids Res., 32, W191-W194 (2004).
\ encodeTxLevels 1 chromosomes chr11,chr13,chr15,chr16,chr19,chr2,chr5,chr7,chr9,chrX\ compositeTrack on\ dataVersion ENCODE June 2005 Freeze\ group encodeTxLevels\ itemRgb on\ longLabel Boston University First Exon Activity\ origAssembly hg16\ priority 22.0\ shortLabel BU First Exon\ track encodeBuFirstExon\ type bed 12 +\ visibility hide\ encodeAffyEc51OvarySignal EC51 Sgnl Ovary wig 0 62385 Affy Ext Trans Signal (51-base window) (Ovary) 0 22 128 0 128 191 127 191 0 0 2 chr21,chr22, encodeTxLevels 0 color 128,0,128\ longLabel Affy Ext Trans Signal (51-base window) (Ovary)\ parent encodeAffyEcSignal\ priority 22\ shortLabel EC51 Sgnl Ovary\ track encodeAffyEc51OvarySignal\ encodeAffyEc51OvarySites EC51 Site Ovary bed 3 . Affy Ext Trans Sites (51-base window) (Ovary) 0 22 128 0 128 191 127 191 0 0 2 chr21,chr22, encodeTxLevels 1 color 128,0,128\ longLabel Affy Ext Trans Sites (51-base window) (Ovary)\ parent encodeAffyEcSites\ priority 22\ shortLabel EC51 Site Ovary\ track encodeAffyEc51OvarySites\ isochores Isochores bed 4 + GC-Rich (dark) and AT-Rich (light) Isochores 0 22 0 0 0 127 127 127 1 0 0Isochores describe a region of a chromosome where the CG-content is\ either higher or lower than the whole genome average (42%). A CG-rich\ isochore is given a dark color, while a CG-poor isochore is a light\ color.
\Isochores were determined by first calculating the CG-content of 100,000 bp\ windows across the genome. These windows were either labeled H or L\ depending on whether the window contained a higher or lower GC-content\ than average. A two-state HMM was created in which one state represented\ GC-rich regions, and the other GC-poor. It was trained using the first 12\ chromosomes. The trained HMM was used to generate traces over all chromosomes.\ These traces define the boundaries of the isochores,\ and their type (GC-rich or AT-rich).
\ map 1 group map\ longLabel GC-Rich (dark) and AT-Rich (light) Isochores\ priority 22\ shortLabel Isochores\ spectrum on\ track isochores\ type bed 4 +\ visibility hide\ encodeYaleAffyPlacRNATarsIntronsDistal Yale In Dst Plac bed 4 . Yale Intronic Distal Placental TARs 0 22 0 255 255 127 255 255 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeAnalysis 1 color 0,255,255\ longLabel Yale Intronic Distal Placental TARs\ parent encodeNoncodingTransFrags\ priority 22\ shortLabel Yale In Dst Plac\ subGroups region=intronicDistal celltype=plac source=yale\ track encodeYaleAffyPlacRNATarsIntronsDistal\ encodeAffyChIpHl60PvalCtcfHr32 Affy CTCF RA 32h wig 0.0 534.54 Affymetrix ChIP/Chip (CTCF retinoic acid-treated HL-60, 32hrs) P-Value 0 23 175 50 0 215 152 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 0 color 175,50,0\ longLabel Affymetrix ChIP/Chip (CTCF retinoic acid-treated HL-60, 32hrs) P-Value\ parent encodeAffyChIpHl60Pval\ priority 23\ shortLabel Affy CTCF RA 32h\ subGroups factor=CTCF time=32h\ track encodeAffyChIpHl60PvalCtcfHr32\ encodeAffyRnaGm06990SitesIntergenicProximal Affy Ig Prx GM06990 bed 4 . Affymetrix Intergenic Proximal GM06990 Transfrags 0 23 131 191 7 193 223 131 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeAnalysis 1 color 131,191,7\ longLabel Affymetrix Intergenic Proximal GM06990 Transfrags\ parent encodeNoncodingTransFrags\ priority 23\ shortLabel Affy Ig Prx GM06990\ subGroups region=intergenicProximal celltype=gm06990 source=affy\ track encodeAffyRnaGm06990SitesIntergenicProximal\ encodeAffyEc1HeLaC1S3Signal EC1 Sgnl HeLa wig 0 62385 Affy Ext Trans Signal (1-base window) (HeLa C1S3) 0 23 0 0 205 127 127 230 0 0 2 chr21,chr22, encodeTxLevels 0 color 0,0,205\ longLabel Affy Ext Trans Signal (1-base window) (HeLa C1S3)\ parent encodeAffyEcSignal\ priority 23\ shortLabel EC1 Sgnl HeLa\ track encodeAffyEc1HeLaC1S3Signal\ encodeAffyEc1HeLaC1S3Sites EC1 Sites HeLa bed 3 . Affy Ext Trans Sites (1-base window) (HeLa C1S3) 0 23 0 0 205 127 127 230 0 0 2 chr21,chr22, encodeTxLevels 1 color 0,0,205\ longLabel Affy Ext Trans Sites (1-base window) (HeLa C1S3)\ parent encodeAffyEcSites\ priority 23\ shortLabel EC1 Sites HeLa\ track encodeAffyEc1HeLaC1S3Sites\ gcPercent GC% 20K bed 4 + Percentage GC in 20,000-Base Windows 0 23 0 0 0 127 127 127 1 0 0\ The GC percent track shows the percentage of G (guanine) and C (cytosine) bases\ in a 20,000 base window. Windows with high GC content are drawn more darkly \ than windows with low GC content. High GC content is typically associated with \ gene-rich areas.\
\\ This track was generated at UCSC.\ map 1 group map\ longLabel Percentage GC in 20,000-Base Windows\ priority 23\ shortLabel GC% 20K\ spectrum on\ track gcPercent\ type bed 4 +\ visibility hide\ encodeRikenCage Riken CAGE bedGraph 4 Riken CAGE - Predicted Gene Start Sites 0 23 0 0 0 127 127 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX,
\ This track shows the number of 5' cap analysis gene expression (CAGE) tags \ that map to the genome on the "plus" and "minus" strands at \ a specific location. For clarity, only the first 5' nucleotide in the tag \ (relative to the transcript direction) is considered. Areas in which many tags \ map to the same region may indicate a significant transcription start site.
\ \\ The position of the first 5' nucleotide in the tag is represented by a solid\ block. The height of the block indicates the number of 5' cDNA starts that map\ at that location.
\\ This composite annotation track contains multiple subtracks that \ may be configured in a variety of ways to highlight different aspects of the \ displayed data. The graphical configuration options are shown at the top of \ the track description page, followed by a list of subtracks. \ For more information about the graphical configuration options, click the \ Graph\ configuration help link. To display only selected subtracks, uncheck the \ boxes next to the tracks you wish to hide.
\ \\ The CAGE tags are sequenced from the 5' ends of full-length cDNAs produced using RIKEN full-length cDNA technology. To create the tag, a linker was \ attached to the 5' end of full-length cDNAs which were selected by cap \ trapping. The first 20 bp of the cDNA were cleaved using class II restriction \ enzymes, followed by PCR amplification and then concatamers of the resulting\ 32 bp tags were formed for more efficient sequencing. For more information on \ CAGE analysis, see Shiraki et al. (2003) below. Refer to the \ RIKEN website\ for information about RIKEN full-length cDNA technologies. The mapping \ methodology employed in this annotation will be described in upcoming \ publications.
\ \\ The techniques used to verify these data will be described in upcoming \ publications.
\ \\ These data were contributed by the Functional Annotation of Mouse \ (FANTOM) \ Consortium, RIKEN Genome Science Laboratory and \ RIKEN Genome Exploration Research Group \ (Genome Network Project Core Group).
\\ FANTOM Consortium: P. Carninci, T. Kasukawa, S. Katayama, Gough, \ M. Frith, N. Maeda, R. Oyama, T. Ravasi, B. Lenhard, C. Wells, R. \ Kodzius, K. Shimokawa, V. B. Bajic, S. E. Brenner, S. Batalov, A. R. R. \ Forrest, M. Zavolan, M. J. Davis, L. G. Wilming, V. Aidinis, J. Allen, \ A. Ambesi-Impiombato, R. Apweiler, R. N. Aturaliya, T. L. Bailey, M. \ Bansal, K. W. Beisel, T. Bersano, H. Bono, A. M. Chalk, K. P. Chiu, V. \ Choudhary, A. Christoffels, D. R. Clutterbuck, M. L. Crowe, E. Dalla, \ B. P. Dalrymple, B. de Bono, G. Della Gatta, D. di Bernardo, T. Down, \ P. Engstrom, M. Fagiolini, G. Faulkner, C. F. Fletcher, T. Fukushima, \ M. Furuno, S. Futaki, M. Gariboldi, P. Georgii-Hemming, T. R. Gingeras, \ T. Gojobori, R. E. Green, S. Gustincich, M. Harbers, V. Harokopos, Y. \ Hayashi, S. Henning, T. K. Hensch, N. Hirokawa, D. Hill, L. Huminiecki, \ M. Iacono, K. Ikeo, A. Iwama, T. Ishikawa, M. Jakt, A. Kanapin, M. \ Katoh, Y. Kawasawa, J. Kelso, H. Kitamura, H. Kitano, G. Kollias, S. \ P. T. Krishnan, A.F. Kruger, K. Kummerfeld, I. V. Kurochkin, \ L. F. Lareau, L. Lipovich, J. Liu, S. Liuni, S. McWilliam, M. Madan \ Babu, M. Madera, L. Marchionni, H. Matsuda, S. Matsuzawa, H. Miki, F. \ Mignone, S. Miyake, K. Morris, S. Mottagui-Tabar, N. Mulder, N. Nakano, \ H. Nakauchi, P. Ng, R. Nilsson, S. Nishiguchi, S. Nishikawa, F. Nori, \ O. Ohara, Y. Okazaki, V. Orlando, K. C. Pang, W. J. Pavan, G. Pavesi, \ G. Pesole, N. Petrovsky, S. Piazza, W. Qu, J. Reed, J. F. Reid, B. Z. \ Ring, M. Ringwald, B. Rost, Y. Ruan, S. Salzberg, A. Sandelin, C. \ Schneider, C. Schoenbach, K. Sekiguchi, C. A. M. Semple, S. Seno, \ L. Sessa, Y. Sheng, Y. Shibata, H. Shimada, K. Shimada, B. Sinclair, S. \ Sperling, E. Stupka, K. Sugiura, R. Sultana, Y. Takenaka, K. Taki, K. \ Tammoja, S. L. Tan, S. Tang, M. S. Taylor, J. Tegner, S. A. Teichmann, \ H. R. Ueda, E. van Nimwegene, R. Verardo, C. L. Wei, K. Yagi, H. \ Yamanishi, E. Zabarovsky, S. Zhu, A. Zimmer, W. Hide, C. Bult, S. M. \ Grimmond, R. D. Teasdale, E. T. Liu, V. Brusic, J. Quackenbush, C. \ Wahlestedt, J. Mattick, D. Hume.
\\ RIKEN Genome Exploration Research Group: C. Kai, D. Sasaki, Y. \ Tomaru, S. Fukuda, M. Kanamori-Katayama, M. Suzuki, J. Aoki, T. \ Arakawa, J. Iida, K. Imamura, M. Itoh, T. Kato, H. Kawaji, N. \ Kawagashira, T. Kawashima, M. Kojima, S. Kondo, H. Konno, K. Nakano, N. \ Ninomiya, T. Nishio, M. Okada, C. Plessy, K. Shibata, T. Shiraki, S. \ Suzuki, M. Tagami, K Waki, A. Watahiki, Y. Okamura-Oho, H. Suzuki, J. \ Kawai.
\\ General Organizer: Y. Hayashizaki\ \
\ Shiraki, T., Kondo, S., Katayama, S., Waki, K., Kasukawa, T., Kawaji, H., \ Kodzius, R., Watahiki, A., Nakamura, M. et al.\ Cap analysis gene expression for high-throughput analysis of \ transcriptional starting point and identification of promoter usage.\ Proc Natl Acad Sci U S A. 100(26), 15776-81 (2003).
\ encodeTxLevels 0 autoScale Off\ chromosomes chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX\ compositeTrack on\ dataVersion ENCODE June 2005 Freeze\ group encodeTxLevels\ longLabel Riken CAGE - Predicted Gene Start Sites\ maxHeightPixels 128:16:16\ maxLimit 4316\ minLimit 1\ origAssembly hg16\ priority 23.0\ shortLabel Riken CAGE\ track encodeRikenCage\ type bedGraph 4\ viewLimits 1.0:10.0\ visibility hide\ windowingFunction mean\ gc5BaseBw GC Percent bigWig 0 100 GC Percent in 5-Base Windows 0 23.5 0 0 0 128 128 128 0 0 0\ This track may be configured in a variety of ways to highlight different aspects \ of the displayed information. Click the "Graph configuration help" link\ for an explanation of the configuration options.\ \
The data and presentation of this graph were prepared by\ Hiram Clawson.\ \ map 0 altColor 128,128,128\ autoScale Off\ color 0,0,0\ graphTypeDefault Bar\ gridDefault OFF\ group map\ html gc5Base\ longLabel GC Percent in 5-Base Windows\ maxHeightPixels 128:36:16\ priority 23.5\ shortLabel GC Percent\ track gc5BaseBw\ type bigWig 0 100\ viewLimits 30:70\ visibility hide\ windowingFunction Mean\ gc5Base GC Percent wig 0 100 Percentage GC in 5-Base Windows 0 23.5 0 0 0 128 128 128 0 0 0
\ This track may be configured in a variety of ways to highlight different aspects \ of the displayed information. Click the "Graph configuration help" link\ for an explanation of the configuration options.\ \
The data and presentation of this graph were prepared by\ Hiram Clawson.\ \ map 0 altColor 128,128,128\ autoScale Off\ color 0,0,0\ graphTypeDefault Bar\ gridDefault OFF\ group map\ longLabel Percentage GC in 5-Base Windows\ maxHeightPixels 128:36:16\ priority 23.5\ shortLabel GC Percent\ spanList 5,1000\ track gc5Base\ type wig 0 100\ viewLimits 30:70\ visibility hide\ windowingFunction Mean\ gc5Win20K GC% Win20K wig 0 100 GC Percent in 5 Bases Smoothed to 20,000-Base Windows 0 23.6 0 128 255 255 128 0 0 0 0
\ The GC percent track shows the percentage of G (guanine) and C (cytosine) bases\ in a 5-base windows. High GC content is typically associated with\ gene-rich areas.\
\\ This track was produced by measuring GC percent in 5 bases, then running\ a 20,000 base smoothing window over that data to indicate the average\ GC percent in the 20,000 base window, at each 5 base interval.\ Thus, each 5 base point in the graph represents the average GC percent\ in the next 20,000 bases.\
\\ This track may be configured in a variety of ways to highlight different aspects \ of the displayed information. Click the \ Graph \ configuration help link for an explanation of the configuration options.
\ \The data and presentation of this graph were prepared by\ Hiram Clawson (hiram@soe.ucsc.\ edu).\ map 0 altColor 255,128,0\ autoScale Off\ color 0,128,255\ graphTypeDefault Bar\ gridDefault OFF\ group map\ longLabel GC Percent in 5 Bases Smoothed to 20,000-Base Windows\ maxHeightPixels 128:36:16\ priority 23.6\ shortLabel GC% Win20K\ spanList 5\ track gc5Win20K\ type wig 0 100\ viewLimits 30:70\ visibility hide\ windowingFunction Mean\ qualityBw Quality Scores bigWig 0 100 Human Sequencing Quality Scores 0 23.6 0 128 255 255 128 0 0 0 0
\ The Quality Scores track shows the sequencing quality score \ (range: 0 to 99) of each base in the assembly. \ The height at each position of the track \ indicates the quality of the base. \ When zoomed out to a large range, the heights reflect the averaged scores. \ Scores of 40 or higher reflect high confidence in the sequence (with an error rate of less than \ 1/10,000); scores of 20 or higher reflect reasonable confidence (of working draft \ quality).\
\\ This track may be configured in a variety of ways to highlight different aspects \ of the displayed information. Click the \ Graph \ configuration help link for an explanation of the configuration options.
\ \\ The quality scores were provided as part of the human assembly. \ The database representation and graphical display code were written by\ Hiram Clawson.\ map 0 altColor 255,128,0\ autoScale Off\ color 0,128,255\ graphTypeDefault Bar\ gridDefault OFF\ group map\ html quality\ longLabel $Organism Sequencing Quality Scores\ maxHeightPixels 128:36:16\ priority 23.6\ shortLabel Quality Scores\ track qualityBw\ type bigWig 0 100\ visibility hide\ windowingFunction Mean\ quality Quality Scores wig 0 100 Human Sequencing Quality Scores 0 23.6 0 128 255 255 128 0 0 0 0
\ The Quality Scores track shows the sequencing quality score \ (range: 0 to 99) of each base in the assembly. \ The height at each position of the track \ indicates the quality of the base. \ When zoomed out to a large range, the heights reflect the averaged scores. \ Scores of 40 or higher reflect high confidence in the sequence (with an error rate of less than \ 1/10,000); scores of 20 or higher reflect reasonable confidence (of working draft \ quality).\
\\ This track may be configured in a variety of ways to highlight different aspects \ of the displayed information. Click the \ Graph \ configuration help link for an explanation of the configuration options.
\ \\ The quality scores were provided as part of the human assembly. \ The database representation and graphical display code were written by\ Hiram Clawson.\ map 0 altColor 255,128,0\ autoScale Off\ color 0,128,255\ graphTypeDefault Bar\ gridDefault OFF\ group map\ longLabel $Organism Sequencing Quality Scores\ maxHeightPixels 128:36:16\ priority 23.6\ shortLabel Quality Scores\ spanList 1,1024\ track quality\ type wig 0 100\ visibility hide\ windowingFunction Mean\ gc5Win100K GC% Win100K wig 0 100 GC Percent in 5 Bases Smoothed to 100,000-Base Windows 0 23.7 0 128 255 255 128 0 0 0 0 map 0 altColor 255,128,0\ autoScale Off\ color 0,128,255\ graphTypeDefault Bar\ gridDefault OFF\ group map\ longLabel GC Percent in 5 Bases Smoothed to 100,000-Base Windows\ maxHeightPixels 128:36:16\ priority 23.7\ shortLabel GC% Win100K\ spanList 5\ track gc5Win100K\ type wig 0 100\ viewLimits 30:70\ visibility hide\ windowingFunction Mean\ encodeAffyChIpHl60SitesCtcfHr32 Affy CTCF RA 32h bed 3 . Affymetrix ChIP/Chip (CTCF retinoic acid-treated HL-60, 32hrs) Sites 0 24 175 50 0 215 152 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 1 color 175,50,0\ longLabel Affymetrix ChIP/Chip (CTCF retinoic acid-treated HL-60, 32hrs) Sites\ parent encodeAffyChIpHl60Sites\ priority 24\ shortLabel Affy CTCF RA 32h\ subGroups factor=CTCF time=32h\ track encodeAffyChIpHl60SitesCtcfHr32\ encodeAffyRnaHeLaSitesIntergenicProximal Affy Ig Prx HeLa bed 4 . Affymetrix Intergenic Proximal Hela Transfrags 0 24 137 185 19 196 220 137 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeAnalysis 1 color 137,185,19\ longLabel Affymetrix Intergenic Proximal Hela Transfrags\ parent encodeNoncodingTransFrags\ priority 24\ shortLabel Affy Ig Prx HeLa\ subGroups region=intergenicProximal celltype=hela source=affy\ track encodeAffyRnaHeLaSitesIntergenicProximal\ encodeAffyEc51HeLaC1S3Signal EC51 Sgnl HeLa wig 0 62385 Affy Ext Trans Signal (51-base window) (HeLa C1S3) 0 24 0 0 205 127 127 230 0 0 2 chr21,chr22, encodeTxLevels 0 color 0,0,205\ longLabel Affy Ext Trans Signal (51-base window) (HeLa C1S3)\ parent encodeAffyEcSignal\ priority 24\ shortLabel EC51 Sgnl HeLa\ track encodeAffyEc51HeLaC1S3Signal\ encodeAffyEc51HeLaC1S3Sites EC51 Site HeLa bed 3 . Affy Ext Trans Sites (51-base window) (HeLa C1S3) 0 24 0 0 205 127 127 230 0 0 2 chr21,chr22, encodeTxLevels 1 color 0,0,205\ longLabel Affy Ext Trans Sites (51-base window) (HeLa C1S3)\ parent encodeAffyEcSites\ priority 24\ shortLabel EC51 Site HeLa\ track encodeAffyEc51HeLaC1S3Sites\ gcPercentSmall GC % 100b bed 4 + Percentage GC in 100-Base Windows 0 24 0 0 0 127 127 127 1 0 0 map 1 group map\ longLabel Percentage GC in 100-Base Windows\ priority 24\ shortLabel GC % 100b\ spectrum on\ track gcPercentSmall\ type bed 4 +\ visibility hide\ encodeRikenCageMappedTagsScore Riken CAGE MT bedGraph 4 Riken CAGE Mapped Tags overlap count - TEST TRACK ONLY 0 24 0 0 0 127 127 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX,
TEST TRACK ONLY
\\ This track shows the number of 5' CAGE tags mapped to the genome on the \ plus and minus strands. At each base in the genome, the count is the number\ of CAGE tags that overlap at this base.\
\ CAGE tags are sequenced 5' ends of full length cDNA (using RIKEN fl \ cDNA technology). Mapping methodology will be described in upcoming \ publications\ \
\ Verification will be described in upcoming publications\ \
\ \ The FANTOM Consortium,\ Riken Genome Science Laboratory and Riken Genome \ Exploration Research Group (Genome Network Project Core Group)\ \ FANTOM Consortium: P. Carninci, T. Kasukawa, S. Katayama, Gough, \ M. Frith, N. Maeda, R. Oyama, T. Ravasi, B. Lenhard, C. Wells, R. \ Kodzius, K. Shimokawa, V. B. Bajic, S. E. Brenner, S. Batalov, A. R. R. \ Forrest, M. Zavolan, M. J. Davis, L. G. Wilming, V. Aidinis, J. Allen, \ A. Ambesi-Impiombato, R. Apweiler, R. N. Aturaliya, T. L. Bailey, M. \ Bansal, K. W. Beisel, T. Bersano, H. Bono, A. M. Chalk, K. P. Chiu, V. \ Choudhary, A. Christoffels, D. R. Clutterbuck, M. L. Crowe, E. Dalla, \ B. P. Dalrymple, B. de Bono, G. Della Gatta, D. di Bernardo, T. Down, \ P. Engstrom, M. Fagiolini, G. Faulkner, C. F. Fletcher, T. Fukushima, \ M. Furuno, S. Futaki, M. Gariboldi, P. Georgii-Hemming, T. R. Gingeras, \ T. Gojobori, R. E. Green, S. Gustincich, M. Harbers, V. Harokopos, Y. \ Hayashi, S. Henning, T. K. Hensch, N. Hirokawa, D. Hill, L. Huminiecki, \ M. Iacono, K. Ikeo, A. Iwama, T. Ishikawa, M. Jakt, A. Kanapin, M. \ Katoh, Y. Kawasawa, J. Kelso, H. Kitamura, H. Kitano, G. Kollias, S. \ P. T. Krishnan, A.F. Kruger, S.K. Kummerfeld, I. V. Kurochkin, \ L. F. Lareau, L. Lipovich, J. Liu, S. Liuni, S. McWilliam, M. Madan \ Babu, M. Madera, L. Marchionni, H. Matsuda, S. Matsuzawa, H. Miki, F. \ Mignone, S. Miyake, K. Morris, S. Mottagui-Tabar, N. Mulder, N. Nakano, \ H. Nakauchi, P. Ng, R. Nilsson, S. Nishiguchi, S. Nishikawa, F. Nori, \ O. Ohara, Y. Okazaki, V. Orlando, K. C. Pang, W. J. Pavan, G. Pavesi, \ G. Pesole, N. Petrovsky, S. Piazza, W. Qu, J. Reed, J. F. Reid, B. Z. \ Ring, M. Ringwald, B. Rost, Y. Ruan, S. Salzberg, A. Sandelin, C. \ Schneider, C. Sch.ANvnbach, K. Sekiguchi, C. A. M. Semple, S. Seno, \ L. Sessa, Y. Sheng, Y. Shibata, H. Shimada, K. Shimada, B. Sinclair, S. \ Sperling, E. Stupka, K. Sugiura, R. Sultana, Y. Takenaka, K. Taki, K. \ Tammoja, S. L. Tan, S. Tang, M. S. Taylor, J. Tegner, S. A. Teichmann, \ H. R. Ueda, E. van Nimwegene, R. Verardo, C. L. Wei, K. Yagi, H. \ Yamanishi, E. Zabarovsky, S. Zhu, A. Zimmer, W. Hide, C. Bult, S. M. \ Grimmond, R. D. Teasdale, E. T. Liu, V. Brusic, J. Quackenbush, C. \ Wahlestedt, J. Mattick, D. Hume. \ \ RIKEN Genome Exploration Research Group: C. Kai, D. Sasaki, Y. \ Tomaru, S. Fukuda, M. Kanamori-Katayama, M. Suzuki, J. Aoki, T. \ Arakawa, J. Iida, K. Imamura, M. Itoh, T. Kato, H. Kawaji, N. \ Kawagashira, T. Kawashima, M. Kojima, S. Kondo, H. Konno, K. Nakano, N. \ Ninomiya, T. Nishio, M. Okada, C. Plessy, K. Shibata, T. Shiraki, S. \ Suzuki, M. Tagami, K Waki, A. Watahiki, Y. Okamura-Oho, H. Suzuki, J. \ Kawai, \ \ General Organizer: Y. Hayashizaki\ \
\ Cap analysis gene expression for high-throughput analysis of \ transcriptional starting point and identification of promoter usage.\ Shiraki T, Kondo S, Katayama S, Waki K, Kasukawa T, Kawaji H, Kodzius \ R, Watahiki A, Nakamura M, Arakawa T, Fukuda S, Sasaki D, Podhajska A, \ Harbers M, Kawai J, Carninci P, Hayashizaki Y.\ Proc Natl Acad Sci U S A. 2003 Dec 23;100(26):15776-81\ encodeTxLevels 0 autoScale Off\ chromosomes chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX\ compositeTrack on\ dataVersion ENCODE June 2005 Freeze\ group encodeTxLevels\ longLabel Riken CAGE Mapped Tags overlap count - TEST TRACK ONLY\ maxHeightPixels 128:16:16\ maxLimit 5945\ minLimit 1\ origAssembly hg16\ priority 24.0\ shortLabel Riken CAGE MT\ track encodeRikenCageMappedTagsScore\ type bedGraph 4\ viewLimits 0.0:10.0\ visibility hide\ windowingFunction mean\ encodeAffyChIpHl60PvalH3K27me3Hr00 Affy H3K27me3 RA 0h wig 0.0 534.54 Affymetrix ChIP/Chip (H3K27me3 retinoic acid-treated HL-60, 0hrs) P-Value 0 25 150 75 0 202 165 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 0 color 150,75,0\ longLabel Affymetrix ChIP/Chip (H3K27me3 retinoic acid-treated HL-60, 0hrs) P-Value\ parent encodeAffyChIpHl60Pval\ priority 25\ shortLabel Affy H3K27me3 RA 0h\ subGroups factor=H3K27me3 time=0h\ track encodeAffyChIpHl60PvalH3K27me3Hr00\ encodeAffyRnaHl60SitesHr00IntergenicProximal Affy Ig Prx HL60 0h bed 4 . Affymetrix Intergenic Proximal HL60 Transfrags 0 25 143 179 31 199 217 143 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeAnalysis 1 color 143,179,31\ longLabel Affymetrix Intergenic Proximal HL60 Transfrags\ parent encodeNoncodingTransFrags\ priority 25\ shortLabel Affy Ig Prx HL60 0h\ subGroups region=intergenicProximal celltype=hl60 source=affy\ track encodeAffyRnaHl60SitesHr00IntergenicProximal\ encodeAffyEc1GM06990Signal EC1 Sgnl GM0699 wig 0 62385 Affy Ext Trans Signal (1-base window) (GM06990) 0 25 0 0 205 127 127 230 0 0 2 chr21,chr22, encodeTxLevels 0 color 0,0,205\ longLabel Affy Ext Trans Signal (1-base window) (GM06990)\ parent encodeAffyEcSignal\ priority 25\ shortLabel EC1 Sgnl GM0699\ track encodeAffyEc1GM06990Signal\ encodeAffyEc1GM06990Sites EC1 Sites GM0699 bed 3 . Affy Ext Trans Sites (1-base window) (GM06990) 0 25 0 0 205 127 127 230 0 0 2 chr21,chr22, encodeTxLevels 1 color 0,0,205\ longLabel Affy Ext Trans Sites (1-base window) (GM06990)\ parent encodeAffyEcSites\ priority 25\ shortLabel EC1 Sites GM0699\ track encodeAffyEc1GM06990Sites\ GCwiggle GC Samples sample GC Percent Sample Track (every 20,000 bases) 0 25 0 0 0 127 127 127 0 0 1 chr22, map 0 chromosomes chr22,\ group map\ longLabel GC Percent Sample Track (every 20,000 bases)\ priority 25\ shortLabel GC Samples\ track GCwiggle\ type sample\ visibility hide\ encodeRikenCageMappedTags Riken CAGE Tags bed 9 . Riken CAGE Mapped Tags - TEST TRACK ONLY 0 25 0 0 0 127 127 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX,
\ This track shows 5' CAGE tags mapped to the genome - TEST TRACK ONLY\ \
\ CAGE tags are sequenced 5' ends of full length cDNA (using RIKEN fl \ cDNA technology). Mapping methodology will be described in upcoming \ publications\ \
\ Verification will be described in upcoming publications.
\ \\ These data were contributed by the Functional Annotation of Mouse (FANTOM) \ Consortium, Riken Genome Science Laboratory and \ Riken Genome Exploration Research Group \ (Genome Network Project Core Group).
\\ FANTOM Consortium: P. Carninci, T. Kasukawa, S. Katayama, Gough, \ M. Frith, N. Maeda, R. Oyama, T. Ravasi, B. Lenhard, C. Wells, R. \ Kodzius, K. Shimokawa, V. B. Bajic, S. E. Brenner, S. Batalov, A. R. R. \ Forrest, M. Zavolan, M. J. Davis, L. G. Wilming, V. Aidinis, J. Allen, \ A. Ambesi-Impiombato, R. Apweiler, R. N. Aturaliya, T. L. Bailey, M. \ Bansal, K. W. Beisel, T. Bersano, H. Bono, A. M. Chalk, K. P. Chiu, V. \ Choudhary, A. Christoffels, D. R. Clutterbuck, M. L. Crowe, E. Dalla, \ B. P. Dalrymple, B. de Bono, G. Della Gatta, D. di Bernardo, T. Down, \ P. Engstrom, M. Fagiolini, G. Faulkner, C. F. Fletcher, T. Fukushima, \ M. Furuno, S. Futaki, M. Gariboldi, P. Georgii-Hemming, T. R. Gingeras, \ T. Gojobori, R. E. Green, S. Gustincich, M. Harbers, V. Harokopos, Y. \ Hayashi, S. Henning, T. K. Hensch, N. Hirokawa, D. Hill, L. Huminiecki, \ M. Iacono, K. Ikeo, A. Iwama, T. Ishikawa, M. Jakt, A. Kanapin, M. \ Katoh, Y. Kawasawa, J. Kelso, H. Kitamura, H. Kitano, G. Kollias, S. \ P. T. Krishnan, A.F. Kruger, S.K. Kummerfeld, I. V. Kurochkin, \ L. F. Lareau, L. Lipovich, J. Liu, S. Liuni, S. McWilliam, M. Madan \ Babu, M. Madera, L. Marchionni, H. Matsuda, S. Matsuzawa, H. Miki, F. \ Mignone, S. Miyake, K. Morris, S. Mottagui-Tabar, N. Mulder, N. Nakano, \ H. Nakauchi, P. Ng, R. Nilsson, S. Nishiguchi, S. Nishikawa, F. Nori, \ O. Ohara, Y. Okazaki, V. Orlando, K. C. Pang, W. J. Pavan, G. Pavesi, \ G. Pesole, N. Petrovsky, S. Piazza, W. Qu, J. Reed, J. F. Reid, B. Z. \ Ring, M. Ringwald, B. Rost, Y. Ruan, S. Salzberg, A. Sandelin, C. \ Schneider, C. Sch.ANvnbach, K. Sekiguchi, C. A. M. Semple, S. Seno, \ L. Sessa, Y. Sheng, Y. Shibata, H. Shimada, K. Shimada, B. Sinclair, S. \ Sperling, E. Stupka, K. Sugiura, R. Sultana, Y. Takenaka, K. Taki, K. \ Tammoja, S. L. Tan, S. Tang, M. S. Taylor, J. Tegner, S. A. Teichmann, \ H. R. Ueda, E. van Nimwegene, R. Verardo, C. L. Wei, K. Yagi, H. \ Yamanishi, E. Zabarovsky, S. Zhu, A. Zimmer, W. Hide, C. Bult, S. M. \ Grimmond, R. D. Teasdale, E. T. Liu, V. Brusic, J. Quackenbush, C. \ Wahlestedt, J. Mattick, D. Hume.
\\ RIKEN Genome Exploration Research Group: C. Kai, D. Sasaki, Y. \ Tomaru, S. Fukuda, M. Kanamori-Katayama, M. Suzuki, J. Aoki, T. \ Arakawa, J. Iida, K. Imamura, M. Itoh, T. Kato, H. Kawaji, N. \ Kawagashira, T. Kawashima, M. Kojima, S. Kondo, H. Konno, K. Nakano, N. \ Ninomiya, T. Nishio, M. Okada, C. Plessy, K. Shibata, T. Shiraki, S. \ Suzuki, M. Tagami, K Waki, A. Watahiki, Y. Okamura-Oho, H. Suzuki, J. \ Kawai.
\\ General Organizer: Y. Hayashizaki\ \
\ Shiraki, T., Kondo, S., Katayama, S., Waki, K., Kasukawa, T., Kawaji, H., \ Kodzius, R., Watahiki, A., Nakamura, M. et al.\ Cap analysis gene expression for high-throughput analysis of \ transcriptional starting point and identification of promoter usage.\ Proc Natl Acad Sci U S A. 100(26), 15776-81 (2003).
\ encodeTxLevels 1 chromosomes chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX\ dataVersion ENCODE June 2005 Freeze\ group encodeTxLevels\ itemRgb On\ longLabel Riken CAGE Mapped Tags - TEST TRACK ONLY\ origAssembly hg16\ priority 25.0\ shortLabel Riken CAGE Tags\ track encodeRikenCageMappedTags\ type bed 9 .\ visibility hide\ encodeAffyChIpHl60SitesH3K27me3Hr00 Affy H3K27me3 RA 0h bed 3 . Affymetrix ChIP/Chip (H3K27me3 retinoic acid-treated HL-60, 0hrs) Sites 0 26 150 75 0 202 165 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 1 color 150,75,0\ longLabel Affymetrix ChIP/Chip (H3K27me3 retinoic acid-treated HL-60, 0hrs) Sites\ parent encodeAffyChIpHl60Sites\ priority 26\ shortLabel Affy H3K27me3 RA 0h\ subGroups factor=H3K27me3 time=0h\ track encodeAffyChIpHl60SitesH3K27me3Hr00\ encodeAffyRnaHl60SitesHr02IntergenicProximal Affy Ig Prx HL60 2h bed 4 . Affymetrix Intergenic Proximal HL60 Retinoic 2hr Transfrags 0 26 149 173 43 202 214 149 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeAnalysis 1 color 149,173,43\ longLabel Affymetrix Intergenic Proximal HL60 Retinoic 2hr Transfrags\ parent encodeNoncodingTransFrags\ priority 26\ shortLabel Affy Ig Prx HL60 2h\ subGroups region=intergenicProximal celltype=hl60 source=affy\ track encodeAffyRnaHl60SitesHr02IntergenicProximal\ encodeAffyEc51GM06990Signal EC51 Sgnl GM0699 wig 0 62385 Affy Ext Trans Signal (51-base window) (GM06990) 0 26 0 0 205 127 127 230 0 0 2 chr21,chr22, encodeTxLevels 0 color 0,0,205\ longLabel Affy Ext Trans Signal (51-base window) (GM06990)\ parent encodeAffyEcSignal\ priority 26\ shortLabel EC51 Sgnl GM0699\ track encodeAffyEc51GM06990Signal\ encodeAffyEc51GM06990Sites EC51 Site GM0699 bed 3 . Affy Ext Trans Sites (51-base window) (GM06990) 0 26 0 0 205 127 127 230 0 0 2 chr21,chr22, encodeTxLevels 1 color 0,0,205\ longLabel Affy Ext Trans Sites (51-base window) (GM06990)\ parent encodeAffyEcSites\ priority 26\ shortLabel EC51 Site GM0699\ track encodeAffyEc51GM06990Sites\ ensemblGeneScaffold Ensembl Assembly bed 6 + Ensembl Gene Scaffold assembly 0 26 0 0 0 127 127 127 1 0 0\ This track shows the mapping of Ensembl gene scaffold coordinates onto\ the underlying assembly contigs or scaffolds. Items are displayed in black \ and gray. Gray items indicate segments of the gene scaffold that do not map \ in linear order to the underlying contig or scaffold.\
\\ These gene scaffolds were generated by Ensembl.
\ \\ This track is created from the Ensembl MySQL tables: assembly.txt\ and seq_region.txt.\
\\ For a description of the methods used in Ensembl gene prediction, refer to \ Hubbard, T. et al. (2002) in the References section below.
\ \ \\ Thanks to Ensembl for providing this annotation.
\ \\ Hubbard T, Barker D, Birney E, Cameron G, Chen Y, Clark L, Cox T, Cuff J,\ Curwen V, Down T, et al. \ The Ensembl genome database project.\ Nucleic Acids Res. 2002 Jan 1;30(1):38-41.
\ map 1 color 0,0,0\ group map\ longLabel Ensembl Gene Scaffold assembly\ priority 26\ shortLabel Ensembl Assembly\ track ensemblGeneScaffold\ type bed 6 +\ useScore 1\ visibility hide\ pGC GC Samples sample GC Percent Sample Track 0 26 0 0 0 127 127 127 0 0 0 map 0 group map\ longLabel GC Percent Sample Track\ priority 26\ shortLabel GC Samples\ track pGC\ type sample\ visibility hide\ encodeStanfordPromoters Stanf Promoter bed 9 + Stanford Promoter Activity 0 26 0 0 0 127 127 127 0 0 19 chr1,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr5,chr6,chr7,chr8,chr9,chrX,\ This track displays activity levels of 643 putative promoter fragments\ in the ENCODE regions, based on high-throughput transient transfection\ luciferase reporter assays. The activity of each putative promoter is\ indicated by color, ranging from black (no activity) to red (strong\ activity). Each of the fragments was tested in a panel of 16 cell\ lines:\
\
\\ \ \\
\\ Cell Line Classification Isolated From \ AGS gastric adenocarcinoma stomach \ BE(2)-C neuroblastoma brain (metastatic, from bone marrow) \ T98G (CRL-1690) glioblastoma brain \ G-402 renal leiomyoblastoma kidney \ HCT 116 colorectal carcinoma colon \ HMCB melanoma skin \ HT-1080 fibrosarcoma connective tissue \ SK-N-SH (HTB-11) neuroblastoma brain (metastatic, from bone marrow) \ HeLa adenocarcinoma cervix \ HepG2 hepatocellular carcinoma liver \ JEG-3 choriocarcinoma placenta \ MG-63 osteosarcoma bone \ MRC-5 fibroblast lung \ PANC-1 epithelioid carcinoma pancreas (duct) \ SNU-182 hepatocellular carcinoma liver \ U-87 MG glioblastoma-astrocytoma brain
\ Promoters in the ENCODE region were predicted using a variation on methods\ previously described (Trinklein et al., 2003, Trinklein et \ al., 2004). Using BLAT alignments of human cDNAs in Genbank to the \ genome, those with at least one bp of exon overlap were merged,\ generating gene models. The transcription start sites were predicted\ by assigning the 5' end of each gene model as one transcription start\ site and alternative 5' ends that were at least 500 bp downstream and\ supported by full-length cDNAs as other start sites. Promoters were \ defined as the regions approximately 600 bp upstream and 100 bp\ downstream of each transcription start site. \
\\ Primer3 was used to pick primers yielding approximately 500 bp\ amplicons containing the predicted transcription start site. Each\ fragment of DNA represented in this track was cloned into a\ luciferase reporter vector (pGL3-Basic, Promega) using the BD\ Clontech Infusion Cloning System. The Dual Luciferase system\ (Promega) was used to co-transfect the experimental DNA along with a\ control plasmid expressing Renilla - to control for variation in \ transcription efficiency - in 96-well format into one of the sixteen \ cell types using FuGENE Transfection Reagent (Roche). Each\ transfection was done in duplicate. \
\\ Data are reported as normalized and log2 transformed averages of the\ Luciferase/Renilla ratio. This normalization was based on the\ activity of 102 random genomic fragments (negative controls) derived \ from exons and intergenic regions. Such a normalization allows\ for a meaningful comparison between cell types. The average log transformed \ Luciferase/Renilla ratio was scaled linearly to create a score where the\ maximum value is 1000 and the minimum value is 0. This score is arbitrary\ and for visualization purposes only; the raw ratio values should be used\ for all analyses. \
\ \\ Data were verified by repeating the preparation and measurement of\ 48 random fragments. No significant variation between the two\ preparations was detected.\
\\ A spreadsheet containing the negative control data can be downloaded \ here.\
\ \This\ work was done in collaboration at the \ Myers Lab at Stanford University (now at HudsonAlpha Institute for Biotechnology). The following people contributed: Sara J. Cooper, Nathan D. Trinklein, Elizabeth D. Anton, Loan Nguyen, and Richard M. Myers. \
\ \Trinklein ND, Aldred SJ, Saldanha AJ, Myers RM.\ Identification and functional analysis of human transcriptional \ promoters.\ Genome Res. 2003 Feb;13(2):308-12.\
\ \\ Trinklein ND, Aldred SF, Hartman SJ, Schroeder DI, Otillar RP,\ Myers RM.\ An abundance of bidirectional promoters in the human genome.\ Genome Res. 2004 Jan;14(1):62-6. \
\ encodeTxLevels 1 chromosomes chr1,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr5,chr6,chr7,chr8,chr9,chrX\ compositeTrack on\ dataVersion ENCODE June 2005 Freeze\ exonArrowsDense on\ group encodeTxLevels\ itemRgb on\ longLabel Stanford Promoter Activity\ origAssembly hg16\ priority 26.0\ shortLabel Stanf Promoter\ track encodeStanfordPromoters\ type bed 9 +\ visibility hide\ encodeAffyChIpHl60PvalH3K27me3Hr02 Affy H3K27me3 RA 2h wig 0.0 534.54 Affymetrix ChIP/Chip (H3K27me3 retinoic acid-treated HL-60, 2hrs) P-Value 0 27 150 75 0 202 165 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 0 color 150,75,0\ longLabel Affymetrix ChIP/Chip (H3K27me3 retinoic acid-treated HL-60, 2hrs) P-Value\ parent encodeAffyChIpHl60Pval\ priority 27\ shortLabel Affy H3K27me3 RA 2h\ subGroups factor=H3K27me3 time=2h\ track encodeAffyChIpHl60PvalH3K27me3Hr02\ encodeAffyRnaHl60SitesHr08IntergenicProximal Affy Ig Prx HL60 8h bed 4 . Affymetrix Intergenic Proximal HL60 Retinoic 8hr Transfrags 0 27 155 167 55 205 211 155 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeAnalysis 1 color 155,167,55\ longLabel Affymetrix Intergenic Proximal HL60 Retinoic 8hr Transfrags\ parent encodeNoncodingTransFrags\ priority 27\ shortLabel Affy Ig Prx HL60 8h\ subGroups region=intergenicProximal celltype=hl60 source=affy\ track encodeAffyRnaHl60SitesHr08IntergenicProximal\ encodeAffyEc1HepG2Signal EC1 Sgnl HepG2 wig 0 62385 Affy Ext Trans Signal (1-base window) (HepG2) 0 27 0 0 205 127 127 230 0 0 2 chr21,chr22, encodeTxLevels 0 color 0,0,205\ longLabel Affy Ext Trans Signal (1-base window) (HepG2)\ parent encodeAffyEcSignal\ priority 27\ shortLabel EC1 Sgnl HepG2\ track encodeAffyEc1HepG2Signal\ encodeAffyEc1HepG2Sites EC1 Sites HepG2 bed 3 . Affy Ext Trans Sites (1-base window) (HepG2) 0 27 0 0 205 127 127 230 0 0 2 chr21,chr22, encodeTxLevels 1 color 0,0,205\ longLabel Affy Ext Trans Sites (1-base window) (HepG2)\ parent encodeAffyEcSites\ priority 27\ shortLabel EC1 Sites HepG2\ track encodeAffyEc1HepG2Sites\ hiSeqDepth Hi Seq Depth bed 3 Regions of Exceptionally High Depth of Aligned Short Reads 0 27 139 69 19 197 162 137 0 0 0\ This track displays regions of the reference genome that have exceptionally high\ sequence depth, inferred from alignments of short-read sequences from the\ 1000 Genomes Project.\ These regions may be caused by collapsed repetitive sequences\ in the reference genome assembly; they also have high read depth in assays such as\ ChIP-seq, and may trigger false positive calls from peak-calling algorithms.\ Excluding these regions from analysis of short-read alignments should reduce\ such false positive calls.\
\ \\ Pickrell et al. downloaded sequencing reads for 57 Yoruba individuals\ from the 1000 Genomes Project's low-coverage pilot data, mapped them to the\ Mar. 2006 human genome assembly (NCBI36/hg18), computed the read depth for\ every base in the genome, and compiled a distribution of read depths.\ They then identified contiguous regions where read depth exceeded thresholds\ corresponding to the top 0.001, 0.005, 0.01, 0.05 and 0.1 of the per-base \ read depths, merging regions which fall within 50 bases of each other.\ The regions are available for download from\ http://eqtl.uchicago.edu/Masking/\ (see the\ readme file).\
\ \\ Thanks to Joseph Pickrell at the University of Chicago for these data.\
\ \\ Pickrell JK, Gaffney DJ, Gilad Y, Pritchard JK.\ \ False positive peaks in ChIP-seq and other sequencing-based\ functional assays caused by unannotated high copy number regions.\ Bioinformatics. 2011 Aug 1;27(15):2144-6. Epub 2011 Jun 19.\
\ map 1 altColor 0,0,0\ color 139,69,19\ compositeTrack on\ group map\ longLabel Regions of Exceptionally High Depth of Aligned Short Reads\ priority 27\ shortLabel Hi Seq Depth\ track hiSeqDepth\ type bed 3\ visibility hide\ ncbiIncidentDb NCBI Incident bigBed 4 + NCBI Incident database 0 27 0 0 0 127 127 127 0 0 0 http://www.ncbi.nlm.nih.gov/projects/genome/assembly/grc/issue_detail.cgi?id=$$\ This track indicates locations in this genome assembly where assembly\ problems have been noted or resolved. The data is taken directly from the\ Genome Reference Consortium. This track updates\ once a day to catch up with new issues.\
\\ If you would like to report an assembly issue with this genome assembly,\ please use the Genome Reference Consortium\ issue reporting system.\
\\ Data for this track is extracted from the Genome Reference Consortium\ \ incident database. This data is checked once a day for updates.\ The track will include any new updates on a daily basis.\
\The data and presentation of this track were prepared by\ Hiram Clawson.\
\ map 1 group map\ longLabel NCBI Incident database\ nonBedFieldsLabel Summary information from incident database:\ priority 27\ shortLabel NCBI Incident\ track ncbiIncidentDb\ type bigBed 4 +\ url http://www.ncbi.nlm.nih.gov/projects/genome/assembly/grc/issue_detail.cgi?id=$$\ urlLabel NCBI Incident:\ visibility hide\ encodeStanfordRtPcr Stanf RTPCR bed 5 + Stanford Endogenous Transcript Levels in HCT116 Cells 0 27 0 0 0 127 127 127 1 0 17 chr1,chr11,chr13,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr5,chr6,chr7,chr8,chr9,chrX,\ This track displays absolute transcript copy numbers for 136 genes and\ 12 negative control intergenic regions, determined by RTPCR in HCT116 cells.\ \
\ The genomic regions are indicated by solid blocks. The shade of an item gives a \ rough indication of its count, ranging from light gray for zero to black for a \ count of 7000 or greater. To display only those items that exceed a specific \ unnormalized score, enter a minimum score between 0 and 1000 in the text box at \ the top of the track description page.
\ \\ Total RNA was prepared in quadruplicate from HCT116 cells grown in\ culture. cDNA was prepared as described in Trinklein et\ al. (2004). Duplicate primer pairs were designed to each gene, and the\ absolute number of cDNA molecules containing each amplicon were\ determined by real-time PCR. The submitted data are the calculated\ number of molecules of each transcript containing the defined\ amplicon.
\ \\ Four biological replicates were performed, and two primer pairs were\ used to measure the abundance of each transcript.
\ \\ These data were generated in the Richard M. Myers lab \ at Stanford University (now at \ HudsonAlpha Institute for Biotechnology).\ \
\ Trinklein, N.D., Chen, W.C., Kingston, R.E. and Myers, R.M. \ Transcriptional regulation and binding of HSF1 and HSF2\ to 32 human heat shock genes during thermal stress and\ differentiation.\ Cell Stress Chaperones 9(1), 21-28 (2004).
\ encodeTxLevels 1 chromosomes chr1,chr11,chr13,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr5,chr6,chr7,chr8,chr9,chrX\ dataVersion ENCODE June 2005 Freeze\ group encodeTxLevels\ longLabel Stanford Endogenous Transcript Levels in HCT116 Cells\ origAssembly hg16\ priority 27.0\ shortLabel Stanf RTPCR\ track encodeStanfordRtPcr\ type bed 5 +\ useScore 1\ visibility hide\ nextNcbiIncidentDb TBD NCBIIncident bigBed 4 + In Progress NCBI Incident database 0 27 0 0 0 127 127 127 0 0 0 http://www.ncbi.nlm.nih.gov/projects/genome/assembly/grc/issue_detail.cgi?id=$$ map 1 group map\ longLabel In Progress NCBI Incident database\ nonBedFieldsLabel Summary information from incident database:\ priority 27\ shortLabel TBD NCBIIncident\ track nextNcbiIncidentDb\ type bigBed 4 +\ url http://www.ncbi.nlm.nih.gov/projects/genome/assembly/grc/issue_detail.cgi?id=$$\ urlLabel NCBI Incident:\ visibility hide\ encodeAffyChIpHl60SitesH3K27me3Hr02 Affy H3K27me3 RA 2h bed 3 . Affymetrix ChIP/Chip (H3K27me3 retinoic acid-treated HL-60, 2hrs) Sites 0 28 150 75 0 202 165 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 1 color 150,75,0\ longLabel Affymetrix ChIP/Chip (H3K27me3 retinoic acid-treated HL-60, 2hrs) Sites\ parent encodeAffyChIpHl60Sites\ priority 28\ shortLabel Affy H3K27me3 RA 2h\ subGroups factor=H3K27me3 time=2h\ track encodeAffyChIpHl60SitesH3K27me3Hr02\ encodeAffyRnaHl60SitesHr32IntergenicProximal Affy Ig Prx HL60 32h bed 4 . Affymetrix Intergenic Proximal HL60 Retinoic 32hr Transfrags 0 28 161 161 67 208 208 161 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeAnalysis 1 color 161,161,67\ longLabel Affymetrix Intergenic Proximal HL60 Retinoic 32hr Transfrags\ parent encodeNoncodingTransFrags\ priority 28\ shortLabel Affy Ig Prx HL60 32h\ subGroups region=intergenicProximal celltype=hl60 source=affy\ track encodeAffyRnaHl60SitesHr32IntergenicProximal\ encodeAffyEc51HepG2Signal EC51 Sgnl HepG2 wig 0 62385 Affy Ext Trans Signal (51-base window) (HepG2) 0 28 0 0 205 127 127 230 0 0 2 chr21,chr22, encodeTxLevels 0 color 0,0,205\ longLabel Affy Ext Trans Signal (51-base window) (HepG2)\ parent encodeAffyEcSignal\ priority 28\ shortLabel EC51 Sgnl HepG2\ track encodeAffyEc51HepG2Signal\ encodeAffyEc51HepG2Sites EC51 Site HepG2 bed 3 . Affy Ext Trans Sites (51-base window) (HepG2) 0 28 0 0 205 127 127 230 0 0 2 chr21,chr22, encodeTxLevels 1 color 0,0,205\ longLabel Affy Ext Trans Sites (51-base window) (HepG2)\ parent encodeAffyEcSites\ priority 28\ shortLabel EC51 Site HepG2\ track encodeAffyEc51HepG2Sites\ humanParalog Human Paralog bed 5 + Human Paralogs Using Fgenesh++ Gene Predictions 0 28 0 100 0 255 240 200 1 0 0 map 1 altColor 255,240,200\ color 0,100,0\ group map\ longLabel Human Paralogs Using Fgenesh++ Gene Predictions\ priority 28\ shortLabel Human Paralog\ spectrum on\ track humanParalog\ type bed 5 +\ visibility hide\ encodeYaleMASPlacRNATransMap Yale MAS RNA bedGraph 4 Yale Maskless Array Synthesizer, RNA Transcript Map 0 28 0 0 0 127 127 127 0 0 8 chr5,chr7,chrX,chr11,chr16,chr19,chr21,chr22,\ This track shows the forward (+) and reverse (-) strand transcript map of \ intensity scores (estimating RNA abundance) for human NB4 cell total RNA,\ and human placental Poly(A)+ RNA, hybridized \ to the Yale MAS (Maskless Array Synthesizer) ENCODE oligonucleotide \ microarray, transcription mapping design #1. This array has 36-mer \ oligonucleotide probes approximately every 36 bp (i.e. \ end-to-end) covering all the non-repetitive DNA sequence of the ENCODE \ regions ENm001-ENm012. See NCBI\ GEO GPL2105 for details of this array design.
\\ This transcript map is a combined signal from three biological replicates, \ each with at least two technical replicates. Arrays were hybridized using \ either the standard Nimblegen protocol or the protocol described in Bertone \ et al. (2004). The label of each subtrack in this annotation \ indicates the specific protocol used for that particular data set.
\ \\ This annotation follows the display conventions for composite \ tracks. The subtracks within this annotation \ may be configured in a variety of ways to highlight different aspects of the \ displayed data. The graphical configuration options are shown at the top of \ the track description page, followed by a list of subtracks. To display only \ selected subtracks, uncheck the boxes next to the tracks you wish to hide. \ For more information about the graphical configuration options, click the \ Graph\ configuration help link.
\ \\ A score was assigned to each oligonucleotide probe position by combining \ two or more technical replicates and by using a sliding window \ approach. Within a sliding window of 160 bp (corresponding to 5 \ oligos), the hybridization intensities for all replicates of each \ oligonucleotide probe were compared to their respective array median \ score. Within the window and across all the replicates, the number of \ probes above and below their respective median were counted. Using the \ sign test, a one-sided P-value was then calculated and a score defined \ as score=-log(P-value) was assigned to the oligo in the center of \ the window.
\\ Three independent biological replicates were generated and each was \ hybridized to at least 2 different arrays (technical replicates).
\ \\ Reasonable correlation coefficients between replicates were ensured. \ Additionally, transcribed regions (TARs/transfrags) were called and \ compared between technical and biological replicates to ensure \ significant overlap.
\ \\ These data were generated and analyzed by the labs of Michael Snyder, \ Mark Gerstein and Sherman Weissman at Yale University.
\ \\ Bertone, P., Stolc, V., Royce, T.E., Rozowsky, J.S., Urban, A.E., Zhu, X., \ Rinn, J.L., Tongprasit, W., Samanta, M. et al.\ Global identification of human transcribed sequences with \ genome tiling arrays. \ Science 306(5705), 2242-6 (2004).
\\ Cheng, J., Kapranov, P., Drenkow, J., Dike, S., Brubaker, S., Patel, S., \ Long, J., Stern, D., Tammana, H. et al.\ Transcriptional maps of 10 human chromosomes at 5-nucleotide \ resolution. \ Science 308(5725), 1149-54 (2005).
\\ Kapranov, P., Cawley, S.E., Drenkow, J., Bekiranov, S., Strausberg, R.L., \ Fodor, S.P. and Gingeras, T.R.\ Large-scale transcriptional activity in chromosomes 21 and \ 22. \ Science 296(5569), 916-9 (2002).
\\ Kluger, Y., Tuck, D.P., Chang, J.T., Nakayama, Y., Poddar, R., Kohya, N., \ Lian, Z., Ben Nasr, A., Halaban, H.R. et al.\ Lineage specificity of gene expression patterns. \ Proc Natl Acad Sci U S A 101(17), 6508-13 (2004).
\\ Rinn, J.L., Euskirchen, G., Bertone, P., Martone, R., Luscombe, N.M., \ Hartman, S., Harrison, P.M., Nelson, F.K., Miller, P. et al.\ The transcriptional activity of human Chromosome 22. \ Genes Dev 17(4), 529-40 (2003).
\ encodeTxLevels 0 autoScale Off\ chromosomes chr5,chr7,chrX,chr11,chr16,chr19,chr21,chr22\ compositeTrack on\ dataVersion ENCODE June 2005 Freeze\ group encodeTxLevels\ longLabel Yale Maskless Array Synthesizer, RNA Transcript Map\ maxHeightPixels 128:16:16\ maxLimit 10.536\ minLimit 0\ origAssembly hg16\ priority 28.0\ shortLabel Yale MAS RNA\ superTrack encodeYaleRnaSuper dense\ track encodeYaleMASPlacRNATransMap\ type bedGraph 4\ viewLimits 0:11\ visibility hide\ encodeYaleRnaSuper Yale RNA Yale RNA (Neutrophil, Placenta and NB4 cells) 0 28 0 0 0 127 127 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX,\ This super-track combines related tracks from Yale Transcript Map analysis.\ These tracks contain transcriptome data from different\ cell lines and biological samples as well as analysis of transcriptionally\ active regions (TARs).\
\ Experiments were performed with Yale MAS (Maskless Array Synthesizer)\ ENCODE oligonucleotide microarray (see NCBI\ GEO GPL2105 for details of this array design) as well as\ the Affymetrix ENCODE oligonucleotide microarray. Multiple biological samples \ were assayed, such as total RNA from human NB4 cells. Experiments also included \ chemical treatments such as retinoic acid (RA) treatments.\ \\ These data were generated and analyzed by the the labs of Michael Snyder,\ Mark Gerstein and Sherman Weissman at Yale University.
\ \\ These data were generated and analyzed by the Yale/Affymetrix\ collaboration among the labs of Michael Snyder, Mark Gerstein and\ Sherman Weissman at Yale University and Tom Gingeras at Affymetrix.
\ \\ These data were generated and analyzed by the lab of Mark Gerstein \ at Yale University.
\ \\ Bertone P, Stolc V, Royce TE, Rozowsky JS, Urban AE, Zhu X,\ Rinn JL, Tongprasit W, Samanta M et al.\ Global identification of human transcribed sequences with\ genome tiling arrays.\ Science. 2004 Dec 24;306(5705):2242-6.
\\ Cheng J, Kapranov P, Drenkow J, Dike S, Brubaker S, Patel S,\ Long J, Stern D, Tammana H et al.\ Transcriptional maps of 10 human chromosomes at 5-nucleotide\ resolution.\ Science. 2005 May 20;308(5725):1149-54.
\\ Kapranov P, Cawley SE, Drenkow J, Bekiranov S, Strausberg RL,\ Fodor SP, Gingeras TR.\ Large-scale transcriptional activity in chromosomes 21 and 22.\ Science. 2002 May 3;296(5569):916-9.
\\ Kluger Y, Tuck DP, Chang JT, Nakayama Y, Poddar R, Kohya N,\ Lian Z, Ben Nasr A, Halaban HR et al.\ Lineage specificity of gene expression patterns.\ Proc Natl Acad Sci U S A. 2004 Apr 27;101(17):6508-13.
\\ Rinn JL, Euskirchen G, Bertone P, Martone R, Luscombe NM,\ Hartman S, Harrison PM, Nelson FK, Miller P et al.\ The transcriptional activity of human Chromosome 22.\ Genes Dev. 2003 Feb 15;17(4):529-40.
\ encodeTxLevels 0 chromosomes chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX\ group encodeTxLevels\ longLabel Yale RNA (Neutrophil, Placenta and NB4 cells)\ priority 28\ shortLabel Yale RNA\ superTrack on\ track encodeYaleRnaSuper\ encodeAffyChIpHl60PvalH3K27me3Hr08 Affy H3K27me3 RA 8h wig 0.0 534.54 Affymetrix ChIP/Chip (H3K27me3 retinoic acid-treated HL-60, 8hrs) P-Value 0 29 150 75 0 202 165 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 0 color 150,75,0\ longLabel Affymetrix ChIP/Chip (H3K27me3 retinoic acid-treated HL-60, 8hrs) P-Value\ parent encodeAffyChIpHl60Pval\ priority 29\ shortLabel Affy H3K27me3 RA 8h\ subGroups factor=H3K27me3 time=8h\ track encodeAffyChIpHl60PvalH3K27me3Hr08\ encodeAffyEc1K562Signal EC1 Sgnl K562 wig 0 62385 Affy Ext Trans Signal (1-base window) (K562) 0 29 0 0 205 127 127 230 0 0 2 chr21,chr22, encodeTxLevels 0 color 0,0,205\ longLabel Affy Ext Trans Signal (1-base window) (K562)\ parent encodeAffyEcSignal\ priority 29\ shortLabel EC1 Sgnl K562\ track encodeAffyEc1K562Signal\ encodeAffyEc1K562Sites EC1 Sites K562 bed 3 . Affy Ext Trans Sites (1-base window) (K562) 0 29 0 0 205 127 127 230 0 0 2 chr21,chr22, encodeTxLevels 1 color 0,0,205\ longLabel Affy Ext Trans Sites (1-base window) (K562)\ parent encodeAffyEcSites\ priority 29\ shortLabel EC1 Sites K562\ track encodeAffyEc1K562Sites\ celeraCoverage WSSD Coverage bed 4 . Regions Assayed for SDD 0 29 0 0 0 127 127 127 0 0 0\ This track represents coverage of clones that were assayed for \ segmental duplications using high-depth Celera reads. Absent regions were \ not assessed by this version of the Segmental Duplication Database (SDD). \ For a description of the whole-genome shotgun sequence detection (WSSD)\ "fuguization" method, see Bailey, J.A. et al. (2001) in \ the References section below.
\ \\ The data were provided by \ Xinwei She \ and Evan Eichler as part of their\ effort to map human paralogy at the \ University of Washington.
\ \\ Bailey, J.A., et al., \ Recent segmental duplications in the human genome. \ Science 297(5583), 945-7 (2002).
\\ Bailey, J.A., et al., \ Segmental duplications: organization and impact within the \ current human genome project assembly, Genome Res. 11(6), \ 1005-17 (2001).
\\ She, X., et al., \ Shotgun sequence assembly and recent segmental duplications \ within the human genome. Nature 431(7011), 927-30 (2004).\
\ map 1 group map\ longLabel Regions Assayed for SDD\ priority 29\ shortLabel WSSD Coverage\ track celeraCoverage\ type bed 4 .\ visibility hide\ encodeYaleAffyNB4RARNATarsIntergenicProximal Yale Ig Prx NB4 RA bed 4 . Yale Intergenic Proximal NB4 Retinoic TARs 0 29 167 155 79 211 205 167 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeAnalysis 1 color 167,155,79\ longLabel Yale Intergenic Proximal NB4 Retinoic TARs\ parent encodeNoncodingTransFrags\ priority 29\ shortLabel Yale Ig Prx NB4 RA\ subGroups region=intergenicProximal celltype=nb4 source=yale\ track encodeYaleAffyNB4RARNATarsIntergenicProximal\ encodeYaleMASPlacRNATars Yale MAS TAR bed 6 . Yale Maskless Array Synthesizer, RNA Transcriptionally Active Regions 0 29 0 0 0 127 127 127 0 0 8 chr5,chr7,chrX,chr11,chr16,chr19,chr21,chr22,\ This track shows the locations of forward (+) and reverse (-) strand \ transcriptionally-active regions (TARs)/transcribed fragments \ (transfrags), for human NB4 cell total RNA and for\ human placenta Poly(A)+ RNA, hybridized to the Yale \ Maskless Array Synthesizer (MAS) ENCODE oligonucleotide microarray, \ transcription mapping design #1. This array has 36-mer oligonucleotide probes \ approximately every 36 bp (i.e. end-to-end) covering all the \ non-repetitive DNA sequence of the ENCODE regions ENm001 - ENm012. See \ NCBI GEO accession \ GPL2105 for details of this array design.
\\ These TARs/transfrags are based on a transcript map combining \ hybridization intensities from three biological replicates, each with at \ least two technical replicates. Arrays were hybridized using either\ Nimblegen standard protocol, or the protocol described in Bertone \ et al. (2004). The label of each subtrack in this annotation \ indicates the specific protocol used for that particular data set.
\ \\ A score was assigned to each oligonucleotide probe position by combining \ two or more technical replicates and by using a sliding window \ approach. Within a sliding window of 160 bp (corresponding to 5 \ oligos), the hybridization intensities for all replicates of each \ oligonucleotide probe were compared to their respective array median \ intensity. Within the window and across all the replicates, the number \ of probes above and below their respective median was counted. Using \ the sign test, a one-sided P-value was then calculated and a score \ defined as score=-log(p-value) was assigned to the oligo in the \ center of the window.
\\ Three independent biological replicates were generated, and each was \ hybridized to at least two different arrays (technical replicates). \ Transcribed regions (TARs/transfrags) were then identified using a score \ threshold of 95th percentile as well as a maximum gap of 80 bp and a \ minimum run of 50 bp (between oligonucleotide positions), effectively \ allowing a gap of one oligo and demanding the TAR/transfrag to \ encompass at least 3 oligos.
\ \\ Transcribed regions (TARs/transfrags), as determined by individual biological \ samples, were compared to ensure significant overlap.
\ \\ These data were generated and analyzed by the the labs of Michael Snyder, \ Mark Gerstein and Sherman Weissman at Yale University.
\ \\
Kapranov P, Cawley SE, Drenkow J, Bekiranov S, Strausberg RL, Fodor SP, \
Gingeras TR,\
\
Large-scale transcriptional activity in chromosomes 21 and 22, \
Science. 2002 May 3;296(5569):916-9.\
\
\
Rinn JL, Euskirchen G, Bertone P, Martone R, Luscombe NM, Hartman S, \
Harrison PM, Nelson FK, Miller P, Gerstein M, Weissman S, Snyder M, \
\
The transcriptional activity of human Chromosome 22, \
Genes Dev, 2003 Feb 15;17(4):529-40.\
\
\
Bertone P, Stolc V, Royce TE, Rozowsky JS, Urban AE, Zhu X, Rinn JL, \
Tongprasit W, Samanta M, Weissman S, Gerstein M, Snyder M,\
\
Global identification of human transcribed sequences with genome tiling arrays, \
Science. 2004 Dec 24;306(5705):2242-6. Epub 2004 Nov 11.\
\
\
Cheng J, Kapranov P, Drenkow J, Dike S, Brubaker S, Patel S, Long J, \
Stern D, Tammana H, Helt G, Sementchenko V, Piccolboni A, Bekiranov S, \
Bailey DK, Ganesh M, Ghosh S, Bell I, Gerhard DS, Gingeras TR,\
\
Transcriptional maps of 10 human chromosomes at 5-nucleotide resolution, \
Science. 2005 May 20;308(5725):1149-54. Epub 2005 Mar 24.\
encodeTxLevels 1 chromosomes chr5,chr7,chrX,chr11,chr16,chr19,chr21,chr22\
compositeTrack on\
dataVersion ENCODE June 2005 Freeze\
group encodeTxLevels\
longLabel Yale Maskless Array Synthesizer, RNA Transcriptionally Active Regions\
origAssembly hg16\
priority 29.0\
shortLabel Yale MAS TAR\
superTrack encodeYaleRnaSuper dense\
track encodeYaleMASPlacRNATars\
type bed 6 .\
visibility hide\
gad GAD View bed 4 Genetic Association Studies of Complex Diseases and Disorders 0 29.5 200 0 0 227 127 127 0 0 0 http://geneticassociationdb.nih.gov/cgi-bin/tableview.cgi?table=allview&cond=gene=
\ The Genetic Association Database (GAD) is intended for use primarily by \ medical scientists and other professionals concerned with genetic disorders, \ by genetics researchers, and by advanced students in science \ and medicine. While the GAD database is open to the public, \ users seeking information about a personal medical or \ genetic condition are urged to consult with a qualified \ physician for diagnosis and for answers to personal questions.\ These data are provided by the GAD\ and do not represent any additional curation by UCSC.
\ \\ The \ Genetic Association Database is an archive of human genetic \ association studies of complex diseases and disorders. The goal \ of the database is to allow the user to rapidly identify medically \ relevant polymorphism from the large volume of polymorphism and \ mutational data, in the context of standardized nomenclature.\
\If the track is displayed in "pack" or "full" mode, \ mousing over an entry of this track will show a pop-up message listing all \ associated diseases. \ In "full" mode, each feature is labeled with the associated disease \ class code (as defined below). \
Disease Class | Disease Class Code |
AGING | AGE |
CANCER | CAN |
CARDIOVASCULAR | CARD |
CHEMICAL DEPENDENCY | CHEM |
DEVELOPMENTAL | DEV |
HEMATOLOGICAL | HEM |
IMMUNE | IMM |
INFECTION | INF |
METABOLIC | MET |
MITOCHONDRIAL | MITO |
NEUROLOGICAL | NEUR |
NORMAL VARIATION | NV |
OTHER | OTH |
PHARMACOGENOMICS | PHARM |
PSYCHIATRIC | PSY |
RENAL | REN |
REPRODUCTION | REP |
UNKNOWN | UNK |
VISION | VIS |
\ Study data are recorded in the context of official human gene \ nomenclature with additional molecular reference numbers and links. The data\ are gene-centered; that is, each record is based on a gene or marker. \ For example, if a study investigated six genes for a particular disorder, \ there will be six records. Gene information is standardized and annotated with \ molecular information, enabling integration with other molecular and genomic \ data resources.\
\ \\ Data are added to GAD on a periodic\ basis by the curator or investigators. A majority of the records in GAD\ are extracted from the online \ HuGE Navigator \ database, which is sponsored by the Centers for Disease Control and \ Prevention. HuGE Navigator\ provides access to a continuously updated, curated knowledge base of\ gene-disease associations, meta-analyses, and related information on genes\ and diseases extracted from NCBI PubMed. A gene-centered view is available via\ Genopedia, which is also available as \ HuGETrack.
\ \ \\ For more information on this dataset, contact \ Kevin G. Becker, PhD,\ \ Yongqing Zhang, PhD, and John Garner, MS, \ from the DNA Array Unit, NIA, NIH.\
\ \\ Becker KG, Barnes KC, Bright TJ, Wang AS. \ The Genetic Association Database. \ Nature Genetics 2004 May; 36(5):431-432.\
\ phenDis 1 color 200,0,0\ group phenDis\ longLabel Genetic Association Studies of Complex Diseases and Disorders\ priority 29.5\ shortLabel GAD View\ track gad\ type bed 4\ url http://geneticassociationdb.nih.gov/cgi-bin/tableview.cgi?table=allview&cond=gene=\ visibility hide\ decipher DECIPHER bed 4 DECIPHER: Chromosomal Imbalance and Phenotype in Humans 0 29.6 0 0 0 127 127 127 0 0 0NOTE:
\
While the DECIPHER database is \
open to the public, users seeking information about a personal medical or\
genetic condition are urged to consult with a qualified physician for\
diagnosis and for answers to personal questions.\
Because the UCSC Genes mappings are based on associations from RefSeq and\ UniProt, they are dependent on any interpretations from those sources.\ Furthermore, because many DECIPHER records refer to multiple gene names,\ or syndromes not tightly mapped to individual genes, the associations\ in this track should be treated with skepticism and any conclusions\ based on them should be carefully scrutinized using independent\ resources.\
\UCSC is authorized to present these data only via display in the\ Browser, and not for bulk download. Access to bulk data may be obtained\ directly from DECIPHER and is subject to a Data Access Agreement, in which\ the user certifies that no attempt to identify individual patients will\ be undertaken. The same restrictions apply to the public data displayed\ at UCSC: No one is authorized to attempt to identify patients by any\ means.\
\\ The \ DECIPHER database of submicroscopic chromosomal imbalance \ collects clinical information about chromosomal \ microdeletions/duplications/insertions, translocations and inversions, \ and displays this information on the human genome map.\
\ This track shows genomic regions of reported cases and their \ associated phenotype information. All data have passed the strict\ consent requirements of the DECIPHER project and are approved for\ unrestricted public release. Clicking the Patient View ID link\ brings up a more detailed informational page on the patient at the \ DECIPHER web site. \
\ Data provided by the DECIPHER project group are imported and processed \ to create a simple BED track to annotate the genomic regions associated \ with individual patients. \
\\
The entries are colored red for deletions \
(mean log ratio < 0) and blue for duplications \
(mean log ratio > 0).\
Note that the color scheme changed in March, 2011.\
\
\ For more information on DECIPHER, please contact\ \ decipher@sanger.\ ac.\ uk.\
\ \\ Firth HV, Richards SM, Bevan AP, Clayton S, Corpas M, Rajan D, Van Vooren S, Moreau Y, Pettett RM, Carter NP.\ \ DECIPHER: Database of Chromosomal Imbalance and Phenotype in Humans Using Ensembl Resources.\ Am J Hum Genet. 2009 Apr;84(4):524-33.\ (Cambridge University Department of Medical Genetics, Addenbrooke's Hospital, Cambridge CB2 2QQ, UK.\ \ hvf21@cam.\ ac.\ uk.)\
\ \ phenDis 1 color 0,0,0\ group phenDis\ longLabel DECIPHER: Chromosomal Imbalance and Phenotype in Humans\ nextExonText Right edge\ prevExonText Left edge\ priority 29.6\ shortLabel DECIPHER\ tableBrowser off decipherRaw knownToDecipher knownCanonToDecipher\ track decipher\ type bed 4\ visibility hide\ omimAvSnp OMIM AV SNPs bed 4 OMIM Allelic Variant SNPs 0 29.71 0 80 0 127 167 127 0 0 0 http://www.omim.org/entry/NOTE:
\
OMIM is intended for use primarily by physicians and other\
professionals concerned with genetic disorders, by genetics researchers, and\
by advanced students in science and medicine. While the OMIM database is\
open to the public, users seeking information about a personal medical or\
genetic condition are urged to consult with a qualified physician for\
diagnosis and for answers to personal questions. Further, please be\
sure to click through to omim.org for the very latest, as they are continually \
updating data.
NOTE ABOUT DOWNLOADS:
\
OMIM is the property \
of Johns Hopkins University and is not available for download or mirroring \
by any third party without their permission. Please see \
OMIM\
for downloads.
OMIM is a compendium of human genes and genetic phenotypes. The full-text,\ referenced overviews in OMIM contain information on all known Mendelian\ disorders and over 12,000 genes. OMIM is authored and edited at the\ McKusick-Nathans Institute of Genetic Medicine, Johns Hopkins University\ School of Medicine, under the direction of Dr. Ada Hamosh. This database\ was initiated in the early 1960s by Dr. Victor A. McKusick as a catalog\ of Mendelian traits and disorders, entitled Mendelian Inheritance\ in Man (MIM).\
\ \\ The OMIM data are separated into three separate tracks:\
\ \OMIM Allelic Variant SNPs\
Variants in the OMIM database that have associated \
dbSNP identifiers.\
\
OMIM Genes\
The genomic positions of gene entries in the OMIM \
database. The coloring indicates the associated OMIM phenotype map key.\
OMIM Phenotypes - Gene Unknown\
Regions known to be associated with a phenotype, \
but for which no specific gene is known to be causative. This track \
also includes known multi-gene syndromes.\
\ This track shows the allelic variants in the Online Mendelian Inheritance in Man\ (OMIM) database that have associated\ dbSNP identifiers.\
\ \Genomic positions of OMIM allelic variant SNPs are marked by solid blocks, which appear\ as tick marks when zoomed out. \
The details page for each variant displays the allelic variant description, the amino\ acid replacement, and the dbSNP identifier, with a link to that SNP's details page in the\ "All SNPs (132)" track.\
\The descriptions of OMIM entries are shown on the main browser display when Full display\ mode is chosen. In Pack mode, the descriptions are shown when mousing over each entry.\
\ \\ This track was constructed as follows: \
\ Thanks to OMIM and NCBI for the use of their data. This track was constructed by Fan Hsu,\ Robert Kuhn, and Brooke Rhead of the UCSC Genome Bioinformatics Group.
\ \Amberger J, Bocchini CA, Scott AF, Hamosh A.\ McKusick's Online Mendelian Inheritance in Man (OMIM®).\ Nucleic Acids Res. 2009 Jan;37(Database issue):D793-6. Epub 2008 Oct 8.\
\\ Hamosh A, Scott AF, Amberger JS, Bocchini CA, McKusick VA.\ Online Mendelian Inheritance in Man (OMIM), a knowledgebase of\ human genes and genetic disorders.\ Nucleic Acids Res. 2005 Jan 1;33(Database issue):D514-7.\
\ phenDis 1 color 0, 80, 0\ group phenDis\ hgsid on\ longLabel OMIM Allelic Variant SNPs\ priority 29.71\ shortLabel OMIM AV SNPs\ tableBrowser off omimAv omimAvRepl\ track omimAvSnp\ type bed 4\ url http://www.omim.org/entry/\ visibility hide\ omimGene2 OMIM Genes bed 4 OMIM Genes - Dark Green Are Disease-causing 0 29.73 0 80 0 127 167 127 0 0 0 http://www.omim.org/entry/NOTE:
\
OMIM is intended for use primarily by physicians and other\
professionals concerned with genetic disorders, by genetics researchers, and\
by advanced students in science and medicine. While the OMIM database is\
open to the public, users seeking information about a personal medical or\
genetic condition are urged to consult with a qualified physician for\
diagnosis and for answers to personal questions. Further, please be\
sure to click through to omim.org for the very latest, as they are continually \
updating data.
NOTE ABOUT DOWNLOADS:
\
OMIM is the property \
of Johns Hopkins University and is not available for download or mirroring \
by any third party without their permission. Please see \
OMIM\
for downloads.
OMIM is a compendium of human genes and genetic phenotypes. The full-text,\ referenced overviews in OMIM contain information on all known Mendelian\ disorders and over 12,000 genes. OMIM is authored and edited at the\ McKusick-Nathans Institute of Genetic Medicine, Johns Hopkins University\ School of Medicine, under the direction of Dr. Ada Hamosh. This database\ was initiated in the early 1960s by Dr. Victor A. McKusick as a catalog\ of Mendelian traits and disorders, entitled Mendelian Inheritance\ in Man (MIM).\
\ \\ The OMIM data are separated into three separate tracks:\
\ \OMIM Allelic Variant SNPs\
Variants in the OMIM database that have associated \
dbSNP identifiers.\
\
OMIM Genes\
The genomic positions of gene entries in the OMIM \
database. The coloring indicates the associated OMIM phenotype map key.\
OMIM Phenotypes - Gene Unknown\
Regions known to be associated with a phenotype, \
but for which no specific gene is known to be causative. This track \
also includes known multi-gene syndromes.\
\ This track shows the genomic positions of all gene entries in the Online Mendelian\ Inheritance in Man (OMIM) database.\
\ \Genomic locations of OMIM gene entries are displayed as solid blocks. The entries are colored\ according to the associated OMIM phenotype map key (if any):\
Gene symbol and disease information, when available, are displayed on the details page for an\ item, and links to related RefSeq Genes and UCSC Genes are given.\
\The descriptions of the OMIM entries are shown on the main browser display when Full display\ mode is chosen. In Pack mode, the descriptions are shown when mousing over each entry. Items\ displayed can be filtered according to phenotype map key on the track controls page. \
\ \\ The mappings displayed in this track are based on OMIM gene entries, their Entrez Gene IDs, and\ the corresponding RefSeq Gene locations:\
\ *The locations in the refGene table are from alignments of RefSeq Genes to the reference\ genome using BLAT.\
\ \\ Thanks to OMIM and NCBI for the use of their data. This track was\ constructed by Fan Hsu, Robert Kuhn, and Brooke Rhead of the UCSC Genome Bioinformatics Group.
\ \Amberger J, Bocchini CA, Scott AF, Hamosh A. \ McKusick's Online Mendelian Inheritance in Man (OMIM®). \ Nucleic Acids Res. 2009 Jan;37(Database issue):D793-6. Epub 2008 Oct 8.\
\\ Hamosh A, Scott AF, Amberger JS, Bocchini CA, McKusick VA. \ Online Mendelian Inheritance in Man (OMIM), a knowledgebase of \ human genes and genetic disorders. \ Nucleic Acids Res. 2005 Jan 1;33(Database issue):D514-7.\
\ phenDis 1 color 0, 80, 0\ group phenDis\ hgsid on\ longLabel OMIM Genes - Dark Green Are Disease-causing\ priority 29.73\ shortLabel OMIM Genes\ tableBrowser off omimGeneMap omimPhenotype omimGeneSymbol mim2gene\ track omimGene2\ type bed 4\ url http://www.omim.org/entry/\ visibility hide\ omimLocation OMIM Pheno Loci bed 4 OMIM Phenotypes - Gene Unknown 0 29.75 0 80 0 127 167 127 0 0 0 http://www.omim.org/entry/NOTE:
\
OMIM is intended for use primarily by physicians and other\
professionals concerned with genetic disorders, by genetics researchers, and\
by advanced students in science and medicine. While the OMIM database is\
open to the public, users seeking information about a personal medical or\
genetic condition are urged to consult with a qualified physician for\
diagnosis and for answers to personal questions. Further, please be\
sure to click through to omim.org for the very latest, as they are continually \
updating data.
NOTE ABOUT DOWNLOADS:
\
OMIM is the property \
of Johns Hopkins University and is not available for download or mirroring \
by any third party without their permission. Please see \
OMIM\
for downloads.
OMIM is a compendium of human genes and genetic phenotypes. The full-text,\ referenced overviews in OMIM contain information on all known Mendelian\ disorders and over 12,000 genes. OMIM is authored and edited at the\ McKusick-Nathans Institute of Genetic Medicine, Johns Hopkins University\ School of Medicine, under the direction of Dr. Ada Hamosh. This database\ was initiated in the early 1960s by Dr. Victor A. McKusick as a catalog\ of Mendelian traits and disorders, entitled Mendelian Inheritance\ in Man (MIM).\
\ \\ The OMIM data are separated into three separate tracks:\
\ \OMIM Allelic Variant SNPs\
Variants in the OMIM database that have associated \
dbSNP identifiers.\
\
OMIM Genes\
The genomic positions of gene entries in the OMIM \
database. The coloring indicates the associated OMIM phenotype map key.\
OMIM Phenotypes - Gene Unknown\
Regions known to be associated with a phenotype, \
but for which no specific gene is known to be causative. This track \
also includes known multi-gene syndromes.\
\ This track shows the cytogenetic locations of phenotype entries in the Online Mendelian\ Inheritance in Man (OMIM) database for which\ the gene is unknown.\
\ \Cytogenetic locations of OMIM entries are displayed as solid\ blocks. The entries are colored according to the OMIM phenotype map key of associated disorders:\ \
Gene symbols and disease information, when available, are displayed on the details pages.\
\The descriptions of OMIM entries are shown on the main browser display when Full display\ mode is chosen. In Pack mode, the descriptions are shown when mousing over each entry. Items\ displayed can be filtered according to phenotype map key on the track controls page.\
\ \\ This track was constructed as follows: \
\ Thanks to OMIM and NCBI for the use of their data. This track was constructed by Fan Hsu,\ Robert Kuhn, and Brooke Rhead of the UCSC Genome Bioinformatics Group.
\ \Amberger J, Bocchini CA, Scott AF, Hamosh A.\ McKusick's Online Mendelian Inheritance in Man (OMIM®).\ Nucleic Acids Res. 2009 Jan;37(Database issue):D793-6. Epub 2008 Oct 8.\
\\ Hamosh A, Scott AF, Amberger JS, Bocchini CA, McKusick VA.\ Online Mendelian Inheritance in Man (OMIM), a knowledgebase of\ human genes and genetic disorders.\ Nucleic Acids Res. 2005 Jan 1;33(Database issue):D514-7.\
\ phenDis 1 color 0, 80, 0\ group phenDis\ hgsid on\ longLabel OMIM Phenotypes - Gene Unknown\ priority 29.75\ shortLabel OMIM Pheno Loci\ tableBrowser off\ track omimLocation\ type bed 4\ url http://www.omim.org/entry/\ visibility hide\ cosmic COSMIC bed 4 COSMIC: Catalogue Of Somatic Mutations In Cancer 0 29.8 200 0 0 227 127 127 0 0 0 http://www.sanger.ac.uk/perl/genetics/CGP/cosmic?action=mut_summary phenDis 1 color 200, 0, 0\ group phenDis\ hgsid on\ longLabel COSMIC: Catalogue Of Somatic Mutations In Cancer\ priority 29.80\ shortLabel COSMIC\ track cosmic\ type bed 4\ url http://www.sanger.ac.uk/perl/genetics/CGP/cosmic?action=mut_summary\ visibility hide\ encodeAffyChIpHl60SitesH3K27me3Hr08 Affy H3K27me3 RA 8h bed 3 . Affymetrix ChIP/Chip (H3K27me3 retinoic acid-treated HL-60, 8hrs) Sites 0 30 150 75 0 202 165 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 1 color 150,75,0\ longLabel Affymetrix ChIP/Chip (H3K27me3 retinoic acid-treated HL-60, 8hrs) Sites\ parent encodeAffyChIpHl60Sites\ priority 30\ shortLabel Affy H3K27me3 RA 8h\ subGroups factor=H3K27me3 time=8h\ track encodeAffyChIpHl60SitesH3K27me3Hr08\ encodeAffyEc51K562Signal EC51 Sgnl K562 wig 0 62385 Affy Ext Trans Signal (51-base window) (K562) 0 30 0 0 205 127 127 230 0 0 2 chr21,chr22, encodeTxLevels 0 color 0,0,205\ longLabel Affy Ext Trans Signal (51-base window) (K562)\ parent encodeAffyEcSignal\ priority 30\ shortLabel EC51 Sgnl K562\ track encodeAffyEc51K562Signal\ encodeAffyEc51K562Sites EC51 Site K562 bed 3 . Affy Ext Trans Sites (51-base window) (K562) 0 30 0 0 205 127 127 230 0 0 2 chr21,chr22, encodeTxLevels 1 color 0,0,205\ longLabel Affy Ext Trans Sites (51-base window) (K562)\ parent encodeAffyEcSites\ priority 30\ shortLabel EC51 Site K562\ track encodeAffyEc51K562Sites\ celeraDupPositive WSSD Duplication bed 4 + Sequence Identified as Duplicate by High-Depth Celera Reads 0 30 0 0 0 127 127 127 0 0 0\ High-depth sequence reads from the Celera project were used to \ detect paralogy in the human genome reference sequence.\ This track shows confirmed segmental duplications, defined as having \ similarity to sequences in the Segmental Duplication Database (SDD) of\ greater than 90% over more than 250 bp of repeatmasked sequence.\ For a description of the whole-genome shotgun sequence detection (WSSD) \ "fuguization" method, see Bailey, J.A. et al. (2001) in \ the References section below.
\ \\ The data were provided by \ Xinwei She \ and Evan Eichler as part of their\ efforts to map human paralogy at the \ University of Washington.
\ \\ Bailey, J.A., et al., \ Recent segmental duplications in the human genome. \ Science 297(5583), 945-7 (2002).
\\ Bailey, J.A., et al., \ Segmental duplications: organization and impact within the \ current human genome project assembly, Genome Res. 11(6), \ 1005-17 (2001).
\\ She, X., et al., \ Shotgun sequence assembly and recent segmental duplications \ within the human genome. Nature 431(7011), 927-30 (2004).\
\ map 1 group map\ longLabel Sequence Identified as Duplicate by High-Depth Celera Reads\ priority 30\ shortLabel WSSD Duplication\ track celeraDupPositive\ type bed 4 +\ visibility hide\ encodeYaleAffyNB4TPARNATarsIntergenicProximal Yale Ig Prx NB4 TPA bed 4 . Yale Intergenic Proximal NB4 TPA-Treated TARs 0 30 173 149 91 214 202 173 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeAnalysis 1 color 173,149,91\ longLabel Yale Intergenic Proximal NB4 TPA-Treated TARs\ parent encodeNoncodingTransFrags\ priority 30\ shortLabel Yale Ig Prx NB4 TPA\ subGroups region=intergenicProximal celltype=nb4 source=yale\ track encodeYaleAffyNB4TPARNATarsIntergenicProximal\ encodeYaleAffyRNATransMap Yale RNA wig -2730 3394 Yale RNA Transcript Map (Neutrophil, Placenta and NB4 cells) 0 30 0 0 0 127 127 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX,\ This track shows the transcript map of signal intensity (estimating RNA \ abundance) for the following, hybridized to the Affymetrix ENCODE \ oligonucleotide microarray:\
\ The human NB4 cell can be made to differentiate towards either monocytes (by\ treatment with TPA) or neutrophils (by treatment with RA). See Kluger\ et al., 2004 in the References section for more details about the\ differentiation of hematopoietic cells.
\\ This array has 25-mer oligonucleotide probes tiled \ approximately every 22 bp, covering all the non-repetitive DNA sequence \ of the ENCODE regions. The transcript map is a combined signal for both \ strands of DNA. This is derived from the number of different biological \ samples indicated above, each with at least two technical replicates.
\\ See the following NCBI GEO accessions for details of experimental protocols:\
\ \\ This annotation follows the display conventions for composite \ "wiggle" tracks. The subtracks within this annotation \ may be configured in a variety of ways to highlight different aspects of the \ displayed data. The graphical configuration options are shown at the top of \ the track description page, followed by a list of subtracks. To display only \ selected subtracks, uncheck the boxes next to the tracks you wish to hide. \ For more information about the graphical configuration options, click the \ Graph\ configuration help link.
\\ Color differences among the subtracks are arbitrary. They provide a\ visual cue for distinguishing between the different data samples.
\ \\ The data from technical replicates were median-scaled and quantile-normalized \ to each other. Using a 101 bp sliding window centered on \ each oligonucleotide probe, a signal map (estimating RNA abundance) was\ generated by computing the pseudomedian signal of all PM-MM pairs \ (median of pairwise PM-MM averages) within the window, including \ replicates. Biological replicate signal maps were combined by \ quantile-normalizing them between replicates and computing the median signal \ at each oligonucleotide probe location.
\ \\ Independent biological replicates (as indicated above) were generated,\ and each was hybridized to at least two different arrays\ (technical replicates). Transcribed regions were then identified using a\ signal theshold of 90 percentile of signal intensities, as well as a maximum\ gap of 50 bp and a minimum run of 50 bp (between oligonucleotide positions).\ Transcribed regions, as determined by individual biological samples, were\ compared to ensure significant overlap.
\ \\ This data was generated and analyzed by the Yale/Affymetrix \ collaboration between the labs of Michael Snyder, Mark Gerstein and \ Sherman Weissman at Yale University and Tom Gingeras at Affymetrix.
\ \\ Bertone, P., Stolc, V., Royce, T.E., Rozowsky, J.S., Urban, A.E., Zhu, X., \ Rinn, J.L., Tongprasit, W., Samanta, M. et al.\ Global identification of human transcribed sequences with \ genome tiling arrays. \ Science 306(5705), 2242-6 (2004).
\\ Cheng, J., Kapranov, P., Drenkow, J., Dike, S., Brubaker, S., Patel, S., \ Long, J., Stern, D., Tammana, H. et al.\ Transcriptional maps of 10 human chromosomes at 5-nucleotide \ resolution. \ Science 308(5725), 1149-54 (2005).
\\ Kapranov, P., Cawley, S.E., Drenkow, J., Bekiranov, S., Strausberg, R.L., \ Fodor, S.P. and Gingeras, T.R.\ Large-scale transcriptional activity in chromosomes 21 and \ 22. \ Science 296(5569), 916-9 (2002).
\\ Kluger, Y., Tuck, D.P., Chang, J.T., Nakayama, Y., Poddar, R., Kohya, N., \ Lian, Z., Ben Nasr, A., Halaban, H.R. et al.\ Lineage specificity of gene expression patterns. \ Proc Natl Acad Sci U S A 101(17), 6508-13 (2004).
\\ Rinn, J.L., Euskirchen, G., Bertone, P., Martone, R., Luscombe, N.M., \ Hartman, S., Harrison, P.M., Nelson, F.K., Miller, P. et al.\ The transcriptional activity of human Chromosome 22. \ Genes Dev 17(4), 529-40 (2003).
\ encodeTxLevels 0 autoScale Off\ chromosomes chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX\ compositeTrack on\ dataVersion ENCODE June 2005 Freeze\ group encodeTxLevels\ longLabel Yale RNA Transcript Map (Neutrophil, Placenta and NB4 cells)\ maxHeightPixels 128:16:16\ priority 30.0\ shortLabel Yale RNA\ spanList 1\ subGroup1 samples Sample summary=Summary samples=samples1-10\ subGroup2 celltype Cell_Type neutro=Neutrophil plac=Placenta nb4=NB4\ track encodeYaleAffyRNATransMap\ type wig -2730 3394\ viewLimits 0:150\ visibility hide\ windowingFunction mean\ celeraOverlay WSSD Overlay bed 4 + Celera WGS Assembly Overlay on Public Assembly 0 30.1 0 0 0 127 127 127 0 0 0\ This track shows regions detected as overlays of Celera\ whole-genome shotgun sequence assembly on the public human \ assembly.
\ \\ The data were provided by \ Xinwei She \ and Evan Eichler \ as part of their effort to map \ human paralogy at the \ University of Washington.
\ \\ Bailey, J.A., et al., \ Recent segmental duplications in the human genome. \ Science 297(5583), 945-7 (2002).
\\ Bailey, J.A., et al., \ Segmental duplications: organization and impact within the \ current human genome project assembly, Genome Res. 11(6), \ 1005-17 (2001).
\\ She, X., et al., \ Shotgun sequence assembly and recent segmental duplications \ within the human genome. Nature 431(7011), 927-30 (2004).\
\ map 1 group map\ longLabel Celera WGS Assembly Overlay on Public Assembly\ priority 30.1\ shortLabel WSSD Overlay\ track celeraOverlay\ type bed 4 +\ visibility hide\ encodeAffyChIpHl60PvalH3K27me3Hr32 Affy H3K27me3 RA 32h wig 0.0 534.54 Affymetrix ChIP/Chip (H3K27me3 retinoic acid-treated HL-60, 32hrs) P-Value 0 31 150 75 0 202 165 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 0 color 150,75,0\ longLabel Affymetrix ChIP/Chip (H3K27me3 retinoic acid-treated HL-60, 32hrs) P-Value\ parent encodeAffyChIpHl60Pval\ priority 31\ shortLabel Affy H3K27me3 RA 32h\ subGroups factor=H3K27me3 time=32h\ track encodeAffyChIpHl60PvalH3K27me3Hr32\ genomicDups Duplications bed 6 + Duplications of >1000 Bases Sequence 0 31 170 0 0 160 150 0 0 0 0 This region was detected as a genomic duplication within the golden path. \ Duplications of 99% or greater similarity, which are likely missed overlaps, \ are shown as red. Duplications of 98% - 99% similarity are shown as yellow. \ Duplications of 90% - 98% similarity are shown as shades of gray. Cut off \ values were at least 1 kb of total sequence aligned (containing at least 500 bp \ non-RepeatMasked sequence) and at least 90% sequence identity. For a \ description of the 'fuguization' detection method see \ Bailey, et al (2001) Genome Res 11:1005-17. \ The data were provided by \ Jeff Bailey \ \ and Evan Eichler.\\ This track shows the locations of transcriptionally active regions \ (TARs)/transcribed fragments (transfrags) for the following, hybridized to\ the Affymetrix ENCODE oligonucleotide microarray:\
\ The human NB4 cell can be made to differentiate towards either monocytes (by\ treatment with TPA) or neutrophils (by treatment with RA). See Kluger\ et al., 2004 in the References section for more details about the\ differentiation of hematopoietic cells.
\\ This array has 25-mer oligonucleotide probes tiled \ approximately every 22 bp, covering all the non-repetitive DNA sequence \ of the ENCODE regions. The transcript map is a combined signal for both \ strands of DNA. This is derived from the number of different biological \ samples indicated above, each with at least two technical replicates.
\\ See the following NCBI GEO accessions for details of experimental protocols:\
\ \\ TARs are represented by blocks in the graphical display. This composite \ annotation track consists of several subtracks that are listed at the top of\ the track description page. To display only selected subtracks, uncheck the \ boxes next to the tracks you wish to hide.
\\ Color differences among the subtracks are arbitrary. They provide a\ visual cue for distinguishing between the different data samples.
\ \\ The data from technical replicates were median-scaled and quantile-normalized \ to each other. Using a 101 bp sliding window centered on \ each oligonucleotide probe, a signal map estimating RNA abundance was\ generated by computing the pseudomedian signal of all PM-MM pairs \ (median of pairwise PM-MM averages) within the window, including \ replicates. Biological replicate signal maps were combined by \ quantile-normalizing them between replicates and computing the median signal \ at each oligonucleotide probe location. Independent biological \ replicates (as described above) were generated, and each was hybridized\ to at least two different arrays (technical replicates). Transcribed regions \ (TARs/transfrags) were then identified using a signal theshold of 90 \ percentile of signal intensities, as well as a maximum gap of 50 bp and \ a minimum run of 50 bp (between oligonucleotide positions).
\ \\ Transcribed regions (TARs/transfrags), as determined by individual \ biological samples, were compared to ensure significant overlap.
\ \\ These data were generated and analyzed by the Yale/Affymetrix \ collaboration between the labs of Michael Snyder, Mark Gerstein and \ Sherman Weissman at Yale University and Tom Gingeras at Affymetrix.
\ \\ Bertone, P., Stolc, V., Royce, T.E., Rozowsky, J.S., Urban, A.E., Zhu, X., \ Rinn, J.L., Tongprasit, W., Samanta, M. et al.\ Global identification of human transcribed sequences with \ genome tiling arrays. \ Science 306(5705), 2242-6 (2004).
\\ Cheng, J., Kapranov, P., Drenkow, J., Dike, S., Brubaker, S., Patel, S., \ Long, J., Stern, D., Tammana, H. et al.\ Transcriptional maps of 10 human chromosomes at 5-nucleotide \ resolution. \ Science 308(5725), 1149-54 (2005).
\\ Kapranov, P., Cawley, S.E., Drenkow, J., Bekiranov, S., Strausberg, R.L., \ Fodor, S.P. and Gingeras, T.R.\ Large-scale transcriptional activity in chromosomes 21 and \ 22. \ Science 296(5569), 916-9 (2002).
\\ Kluger, Y., Tuck, D.P., Chang, J.T., Nakayama, Y., Poddar, R., Kohya, N., \ Lian, Z., Ben Nasr, A., Halaban, H.R. et al.\ Lineage specificity of gene expression patterns. \ Proc Natl Acad Sci U S A 101(17), 6508-13 (2004).
\\ Rinn, J.L., Euskirchen, G., Bertone, P., Martone, R., Luscombe, N.M., \ Hartman, S., Harrison, P.M., Nelson, F.K., Miller, P. et al.\ The transcriptional activity of human Chromosome 22. \ Genes Dev 17(4), 529-40 (2003).
\ encodeTxLevels 1 chromosomes chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX\ compositeTrack on\ dataVersion ENCODE June 2005 Freeze\ group encodeTxLevels\ longLabel Yale RNA Transcriptionally Active Regions (TARs)\ priority 31.0\ shortLabel Yale TAR\ subGroup1 samples Sample summary=Summary samples=samples1-10\ subGroup2 celltype Cell_Type neutro=Neutrophil plac=Placenta nb4=NB4\ track encodeYaleAffyRNATars\ type bed 3 .\ visibility hide\ encodeAffyChIpHl60SitesH3K27me3Hr32 Affy H3K27me3 RA 32h bed 3 . Affymetrix ChIP/Chip (H3K27me3 retinoic acid-treated HL-60, 32hrs) Sites 0 32 150 75 0 202 165 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 1 color 150,75,0\ longLabel Affymetrix ChIP/Chip (H3K27me3 retinoic acid-treated HL-60, 32hrs) Sites\ parent encodeAffyChIpHl60Sites\ priority 32\ shortLabel Affy H3K27me3 RA 32h\ subGroups factor=H3K27me3 time=32h\ track encodeAffyChIpHl60SitesH3K27me3Hr32\ dupes Duplications bed 6 . Duplications of >98% Identity >1kb 1 32 0 0 0 127 127 127 0 0 0 map 1 group map\ longLabel Duplications of >98% Identity >1kb\ priority 32\ shortLabel Duplications\ track dupes\ type bed 6 .\ visibility dense\ encodeAffyEc51TertBJSignal EC51 Sgnl TertBJ wig 0 62385 Affy Ext Trans Signal (51-base window) (Tert-BJ) 0 32 0 0 205 127 127 230 0 0 2 chr21,chr22, encodeTxLevels 0 color 0,0,205\ longLabel Affy Ext Trans Signal (51-base window) (Tert-BJ)\ parent encodeAffyEcSignal\ priority 32\ shortLabel EC51 Sgnl TertBJ\ track encodeAffyEc51TertBJSignal\ encodeAffyEc51TertBJSites EC51 Site TertBJ bed 3 . Affy Ext Trans Sites (51-base window) (Tert-BJ) 0 32 0 0 205 127 127 230 0 0 2 chr21,chr22, encodeTxLevels 1 color 0,0,205\ longLabel Affy Ext Trans Sites (51-base window) (Tert-BJ)\ parent encodeAffyEcSites\ priority 32\ shortLabel EC51 Site TertBJ\ track encodeAffyEc51TertBJSites\ encodeYaleAffyNeutRNATarsAllIntergenicProximal Yale Ig Prx Neu bed 4 . Yale Intergenic Proximal Neutrophil TARs 0 32 185 137 115 220 196 185 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeAnalysis 1 color 185,137,115\ longLabel Yale Intergenic Proximal Neutrophil TARs\ parent encodeNoncodingTransFrags\ priority 32\ shortLabel Yale Ig Prx Neu\ subGroups region=intergenicProximal celltype=neut source=yale\ track encodeYaleAffyNeutRNATarsAllIntergenicProximal\ Nregion N Regions bed 4 . N Regions 0 32.5 150 100 30 202 177 142 0 0 0\ This track displays contiguous Ns of 1000 or more.\
\ Bolstad BM, Irizarry RA, Astrand M, Speed TP. \ A comparison of normalization methods for high density oligonucleotide\ array data based on variance and bias. \ Bioinformatics. 2003 Jan 22;19(2):185-93.\
\ Cawley S, Bekiranov S, Ng HH, Kapranov P, Sekinger EA,\ Kampa D, Piccolboni A, Sementchenko V, Cheng J, Williams AJ et al.\ Unbiased mapping of transcription factor \ binding sites along human chromosomes 21 and 22 points to widespread \ regulation of noncoding RNAs. Cell. 2004 Feb 20;116(4):499-509.\
\ Kapranov P, Cawley SE, Drenkow J, Bekiranov S, Strausberg RL, Fodor SP, \ Gingeras TR. \ Large-scale transcriptional activity in chromosomes 21 and 22. \ Science. 2002 May 3;296(5569):916-9.\ encodeTxLevels 0 chromosomes chr21,chr22\ group encodeTxLevels\ longLabel Affymetrix ENCODE Extension Transcription\ priority 34.0\ shortLabel Affy EC\ superTrack on\ track encodeAffyEcSuper\ encodeAffyChIpHl60SitesH4Kac4Hr00 Affy H4Kac4 RA 0h bed 3 . Affymetrix ChIP/Chip (H4Kac4 retinoic acid-treated HL-60, 0hrs) Sites 0 34 125 100 0 190 177 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 1 color 125,100,0\ longLabel Affymetrix ChIP/Chip (H4Kac4 retinoic acid-treated HL-60, 0hrs) Sites\ parent encodeAffyChIpHl60Sites\ priority 34\ shortLabel Affy H4Kac4 RA 0h\ subGroups factor=H4Kac4 time=0h\ track encodeAffyChIpHl60SitesH4Kac4Hr00\ encodeAffyRnaGm06990SitesIntergenicDistal Affy Ig Dst GM06990 bed 4 . Affymetrix Intergenic Distal GM06990 Transfrags 0 34 0 0 255 127 127 255 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeAnalysis 1 color 0,0,255\ longLabel Affymetrix Intergenic Distal GM06990 Transfrags\ parent encodeNoncodingTransFrags\ priority 34\ shortLabel Affy Ig Dst GM06990\ subGroups region=intergenicDistal celltype=gm06990 source=affy\ track encodeAffyRnaGm06990SitesIntergenicDistal\ knownGene Known Genes genePred knownGenePep knownGeneMrna Known Genes (March 04) Based on SWISS-PROT, TrEMBL, mRNA, and RefSeq 0 34 12 12 120 133 133 187 0 0 0
\ The UCSC Known Genes track shows known protein-coding genes based on \ protein data from SWISS-PROT, TrEMBL, and TrEMBL-NEW and their\ corresponding mRNAs from \ GenBank.
\ \\ This track follows the display conventions for\ gene prediction\ tracks. Black coloring indicates features that have corresponding entries\ in the Protein Databank (PDB). Blue indicates features associated with\ mRNAs from NCBI RefSeq or (dark blue) items having associated proteins in\ the SWISS-PROT database. The variation in blue shading of RefSeq items\ corresponds to the level of review the RefSeq record has undergone:\ predicted (light), provisional (medium), or reviewed (dark).
\\ This track contains an optional codon coloring\ feature that allows users to quickly validate and compare gene predictions.\ To display codon colors, select the genomic codons option from the\ Color track by codons pull-down menu. Click\ here for more\ information about this feature.
\ \\ mRNA sequences were aligned against the human genome using blat. When a \ single mRNA aligned in multiple places, only alignments having at least 98% \ base identity with the genomic sequence were kept. This set of mRNA \ alignments was further reduced by keeping only those mRNAs referenced by a \ protein in SWISS-PROT, TrEMBL, or TrEMBL-NEW.
\\ Among multiple mRNAs referenced by a single protein, the best mRNA was \ selected, based on a quality score derived from its length, the level of the\ match between its translation and the protein sequence, and its release date.\ The resulting mRNA and protein pairs were further filtered by removing \ short invalid entries and consolidating entries with identical CDS regions.\
\\ Finally, RefSeq entries derived from DNA sequences instead of \ mRNA sequences were added to produce the final data set shown in this track. \ Disease annotations were obtained from SWISS-PROT.
\ \\ The Known Genes track was produced at UCSC based primarily on cross-references\ between proteins from \ SWISS-PROT \ (including TrEMBL and TrEMBL-NEW) and mRNAs from \ GenBank\ contributed by scientists worldwide. \ NCBI RefSeq \ data were also included in this track.
\ \\ The UniProt data have the following terms of use, UniProt copyright(c) 2002 - \ 2004 UniProt consortium:
\\ For non-commercial use, all databases and documents in the UniProt FTP\ directory may be copied and redistributed freely, without advance\ permission, provided that this copyright statement is reproduced with\ each copy.
\\ For commercial use, all databases and documents in the UniProt FTP\ directory except the files\
\ From January 1, 2005, all databases and documents in the UniProt FTP\ directory may be copied and redistributed freely by all entities,\ without advance permission, provided that this copyright statement is\ reproduced with each copy.
\ \\ Benson DA, Karsch-Mizrachi I, Lipman DJ, Ostell J,\ Wheeler DL.\ GenBank: update.\ Nucleic Acids Res. 2004 Jan 1;32:D23-6.
\\ Hsu F, Kent WJ, Clawson H, Kuhn RM, Diekhans M, Haussler D.\ The UCSC Known Genes.\ Bioinformatics. 2006 May 1;22(9):1036-46.
\\ Kent WJ.\ BLAT - the BLAST-like alignment tool.\ Genome Res. 2002 Apr;12(4):656-64.
\ genes 1 baseColorDefault genomicCodons\ baseColorUseCds given\ color 12,12,120\ defaultLinkedTables kgXref\ directUrl /cgi-bin/hgGene?hgg_gene=%s&hgg_chrom=%s&hgg_start=%d&hgg_end=%d&hgg_type=%s&db=%s\ group genes\ hgGene on\ hgsid on\ idXref kgAlias kgID alias\ intronGap 12\ longLabel Known Genes (March 04) Based on SWISS-PROT, TrEMBL, mRNA, and RefSeq\ priority 34\ shortLabel Known Genes\ track knownGene\ type genePred knownGenePep knownGeneMrna\ visibility hide\ encodeAffyEcSites Affy EC Sites bed 3 . Affymetrix ENCODE Extension Transcription Sites 0 34.1 0 0 0 127 127 127 0 0 2 chr21,chr22,\ Data for all biological replicates can be \ downloaded from Affymetrix in wig, BED, and cel formats.\ \
\
\ Bolstad BM, Irizarry RA, Astrand M, Speed TP.\ \ A comparison of normalization methods for high density oligonucleotide\ array data based on variance and bias.\ Bioinformatics. 2003 Jan 22;19(2):185-93.\
\ Cawley S, Bekiranov S, Ng HH, Kapranov P, Sekinger EA,\ Kampa D, Piccolboni A, Sementchenko V, Cheng J, Williams AJ et al.\ Unbiased mapping of transcription factor \ binding sites along human chromosomes 21 and 22 points to widespread \ regulation of noncoding RNAs. Cell. 2004 Feb 20;116(4):499-509.\
\ Kapranov P, Cawley SE, Drenkow J, Bekiranov S, Strausberg RL, Fodor SP,\ Gingeras TR. \ Large-scale transcriptional activity in chromosomes 21 and 22.\ Science. 2002 May 3;296(5569):916-9.\ encodeTxLevels 1 chromosomes chr21,chr22\ compositeTrack on\ group encodeTxLevels\ longLabel Affymetrix ENCODE Extension Transcription Sites\ origAssembly hg17\ priority 34.1\ shortLabel Affy EC Sites\ superTrack encodeAffyEcSuper dense\ track encodeAffyEcSites\ type bed 3 .\ visibility hide\ hg17Kg Known Genes II genePred UCSC Known Genes II (June 05, Based on hg17 UCSC Known Genes) 3 34.1 12 12 120 133 133 187 0 0 0 http://genome.ucsc.edu/cgi-bin/hgGene?db=hg17&hgg_gene=$$&hgg_chrom=none&hgg_type=knownGene
\ The Known Genes II track was built based on UCSC Known Genes data set of hg17 (Human May 2004 Assembly).\ Clicking the "Outside Link" entry above will bring you to the gene details page of hg17 (Human May 2004 Assembly). The original "Known Genes" track of hg16 \ (built in March, 2004) \ is somewhat outdated, but still available.\
The hg17 UCSC Known Genes was built by a new process, KG II, \ as described below.\
\ UniProt protein sequences (including alternative splicing isoforms) \ and mRNA sequences from RefSeq and GenBank \ were aligned against the base genome using BLAT. \ RefSeq alignments having a base identity level within 0.1% of the best \ and at least 96% base identity with the genomic sequence were kept. \ GenBank mRNA alignments having a base identity level within 0.2% of \ the best and at least 97% base identity with the genomic sequence were kept. \ Protein alignments having a base identity level within 0.2% of the best and \ at least 80% base identity with the genomic sequence were kept.\
Then the genomic mRNA and protein alignments were compared, \ and protein-mRNA pairings were determined from their overlaps. \ mRNA CDS data were obtained from RefSeq and GenBank data \ and supplemented by CDS structures derived from UCSC protein-mRNA BLAT alignments. \ The initial set of UCSC Known Genes candidates consists of \ all protein-mRNA pairs with valid mRNA CDS structures. \ A gene-check program (similar to the one used for the Consensus CDS (CCDS) project) \ is used to remove questionable candidates, such as those with in-frame stop \ codons, missing start or stop codons, etc.\
From each group of gene candidates that share the same CDS structure, \ the protein-mRNA pair having the best ranking and protein-mRNA alignment score \ is selected as a UCSC Known Gene. \ The ranking of a gene candidate depends on its gene-check quality measures. \ When all else is equal, \ a preference is given to RefSeq mRNAs and next to MGC mRNAs. \ Similarly, preference is given to gene candidates represented by Swiss-Prot \ proteins. \ The protein-mRNA alignment score is calculated based on a protein-to-mRNA \ alignment using TBLASTN, plus weighted sub-scores according \ to the date and length of the mRNA. \
\ The UCSC Known Genes track was produced using protein data from \ UniProt and mRNA \ data from NCBI \ RefSeq\ and GenBank.
\ \\ The UniProt data have the following terms of use, UniProt copyright(c) 2002 - \ 2004 UniProt consortium:
\\ For non-commercial use, all databases and documents in the UniProt FTP\ directory may be copied and redistributed freely, without advance\ permission, provided that this copyright statement is reproduced with\ each copy.
\\ For commercial use, all databases and documents in the UniProt FTP\ directory except the files\
\ From January 1, 2005, all databases and documents in the UniProt FTP\ directory may be copied and redistributed freely by all entities,\ without advance permission, provided that this copyright statement is\ reproduced with each copy.
\ \\ Benson DA, Karsch-Mizrachi I, Lipman DJ, Ostell J, \ Wheeler DL. \ GenBank: update. Nucleic Acids Res. \ 2004 Jan 1;32(Database issue):D23-6.
\\ Kent WJ.\ BLAT - the BLAST-like alignment tool.\ Genome Res. 2002 Apr;12(4):656-64.
\ \ genes 1 baseColorDefault genomicCodons\ baseColorUseCds given\ color 12,12,120\ group genes\ longLabel UCSC Known Genes II (June 05, Based on hg17 UCSC Known Genes)\ priority 34.1\ shortLabel Known Genes II\ track hg17Kg\ type genePred\ url http://genome.ucsc.edu/cgi-bin/hgGene?db=hg17&hgg_gene=$$&hgg_chrom=none&hgg_type=knownGene\ visibility pack\ encodeAffyEcSignal Affy EC Signal wig 0 62385 Affymetrix ENCODE Extension Transcription Signal 0 34.2 0 0 0 127 127 127 0 0 2 chr21,chr22,\ Data for all biological replicates can be \ downloaded from Affymetrix in wig, BED, and cel formats.\ \
\ Bolstad, B. M., Irizarry, R. A., Astrand, M., and Speed, T. P. A\ comparison of normalization methods for high density oligonucleotide\ array data based on variance and bias. Bioinformatics 19(2),\ 185-193 (2003).\
\ Cawley, S., Bekiranov, S., Ng, H. H., Kapranov, P., Sekinger, E. A.,\ Kampa, D., Piccolboni, A., Sementchenko, V., Cheng, J., Williams,\ A. J., et al. Unbiased mapping of transcription factor binding sites along human \ chromosomes 21 and 22 points to widespread regulation of noncoding RNAs. Cell \ 116(4), 499-509 (2004).\
\ Kapranov, P., Cawley, S. E., Drenkow, J., Bekiranov, S., Strausberg,\ R. L., Fodor, S. P., and Gingeras, T. R. Large-scale transcriptional activity in chromosomes 21 and \ 22. Science 296(5569), 916-919 (2002). \ encodeTxLevels 0 autoScale Off\ chromosomes chr21,chr22\ compositeTrack on\ group encodeTxLevels\ longLabel Affymetrix ENCODE Extension Transcription Signal\ maxHeightPixels 100:30:10\ origAssembly hg17\ priority 34.2\ shortLabel Affy EC Signal\ superTrack encodeAffyEcSuper dense\ track encodeAffyEcSignal\ type wig 0 62385\ viewLimits 0:5000\ visibility hide\ knownAlt Alt Events bed 6 . Alternative Splicing, Alternative Promoter and Similar Events in UCSC Genes 0 34.2 90 0 150 172 127 202 0 0 0
This track shows various types of alternative splicing and other\ events that result in more than a single transcript from the same\ gene. The label by an item describes the type of event. The events are:
\This track is based on an analysis by the txgAnalyse program of splicing graphs\ produced by the txGraph program. Both of these programs were written by Jim\ Kent at UCSC.
\ genes 1 color 90,0,150\ group genes\ longLabel Alternative Splicing, Alternative Promoter and Similar Events in UCSC Genes\ noScoreFilter .\ priority 34.2\ shortLabel Alt Events\ track knownAlt\ type bed 6 .\ visibility hide\ ccdsGene CCDS genePred Consensus CDS 0 34.5 12 120 12 133 187 133 0 0 0\ This track shows human genome high-confidence gene annotations from the\ Consensus \ Coding Sequence (CCDS) project. This project is a collaborative effort \ to identify a core set of \ human protein-coding regions that are consistently annotated and of high \ quality. The long-term goal is to support convergence towards a standard set \ of gene annotations on the human genome.\
\Collaborators include:\
\ CDS annotations of the human genome were obtained from two sources:\ NCBI \ RefSeq and a union of the gene annotations from \ Ensembl and \ Vega, collectively known \ as Hinxton.
\\ Genes with identical CDS genomic coordinates in both sets become CCDS \ candidates. The genes undergo a quality evaluation, which must be approved by \ all collaborators. The following criteria are currently used to assess each\ gene: \
\ A unique CCDS ID is assigned to the CCDS, which links together all gene \ annotations with the same CDS. CCDS gene annotations are under continuous\ review, with periodic updates to this track.\
\ \\ This track was produced at UCSC from data downloaded from the\ CCDS project \ web site.\
\ \\ Pruitt KD, Harrow J, Harte RA, Wallin C, Diekhans M, Maglott DR, Searle S, \ Farrell CM, Loveland JE, Ruef BJ et al. \ The consensus coding sequence (CCDS) project: Identifying a common protein-coding gene set for the human and mouse genomes. \ Genome Res. 2009 Jun 4. [Epub ahead of print]\
\ Hubbard T, Barker D, Birney E, Cameron G, Chen Y, Clark L, Cox T, Cuff J, Curwen V, Down T, et al.\ The Ensembl genome database project. \ Nucl. Acids Res. 2002 Jan 1;30(1):38-41.
\\ Pruitt KD, Tatusova T, Maglott DR.\ NCBI Reference Sequence (RefSeq): a curated non-redundant \ sequence database of genomes, transcripts and proteins. \ Nucl. Acids Res. 2005 Jan 1;33(Database Issue):D501-D504. \
\ genes 1 baseColorDefault genomicCodons\ baseColorUseCds given\ color 12,120,12\ group genes\ longLabel Consensus CDS\ priority 34.5\ shortLabel CCDS\ track ccdsGene\ type genePred\ visibility hide\ interPro InterPro bed 4 InterPro Domains 0 34.6 12 12 120 133 133 187 0 0 0\ Description of InterPro goes here.\ \
\ Methods goes here.\
\ Credits goes here.\ \ genes 1 color 12,12,120\ group genes\ longLabel InterPro Domains\ priority 34.6\ shortLabel InterPro\ track interPro\ type bed 4\ visibility hide\ encodeAffyChIpHl60PvalH4Kac4Hr02 Affy H4Kac4 RA 2h wig 0.0 534.54 Affymetrix ChIP/Chip (H4Kac4 retinoic acid-treated HL-60, 2hrs) P-Value 0 35 125 100 0 190 177 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 0 color 125,100,0\ longLabel Affymetrix ChIP/Chip (H4Kac4 retinoic acid-treated HL-60, 2hrs) P-Value\ parent encodeAffyChIpHl60Pval\ priority 35\ shortLabel Affy H4Kac4 RA 2h\ subGroups factor=H4Kac4 time=2h\ track encodeAffyChIpHl60PvalH4Kac4Hr02\ encodeAffyRnaHeLaSitesIntergenicDistal Affy Ig Dst HeLa bed 4 . Affymetrix Intergenic Distal HeLa Transfrags 0 35 5 0 250 130 127 252 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeAnalysis 1 color 5,0,250\ longLabel Affymetrix Intergenic Distal HeLa Transfrags\ parent encodeNoncodingTransFrags\ priority 35\ shortLabel Affy Ig Dst HeLa\ subGroups region=intergenicDistal celltype=hela source=affy\ track encodeAffyRnaHeLaSitesIntergenicDistal\ refGene RefSeq Genes genePred refPep refMrna RefSeq Genes 1 35 12 12 120 133 133 187 0 0 0
\ The RefSeq Genes track shows known human protein-coding and \ non-protein-coding genes taken from the NCBI RNA reference sequences \ collection (RefSeq). The data underlying this track are updated daily.
\ \\ This track follows the display conventions for \ \ gene prediction tracks.\ The color shading indicates the level of review the RefSeq record has \ undergone: predicted (light), provisional (medium), reviewed (dark).
\\ The item labels and display colors of features within this track can be\ configured through the controls at the top of the track description page. \ This page is accessed via the small button to the left of the track's \ graphical display or through the link on the track's control menu. \
\ RefSeq RNAs were aligned against the human genome using blat; those\ with an alignment of less than 15% were discarded. When a single RNA \ aligned in multiple places, the alignment having the highest base identity \ was identified. Only alignments having a base identity level within 0.1% of \ the best and at least 96% base identity with the genomic sequence were kept.\
\ \ \\ This track was produced at UCSC from RNA sequence data\ generated by scientists worldwide and curated by the \ NCBI RefSeq project.
\ \\ Kent WJ.\ BLAT - the BLAST-like alignment tool.\ Genome Res. 2002 Apr;12(4):656-64.
\ \Pruitt KD, Tatusova T, Maglott DR. \ NCBI Reference Sequence (RefSeq): a curated non-redundant \ sequence database of genomes, transcripts and proteins. Nucleic Acids \ Res. 2005 Jan 1;33(Database issue):D501-4.\
\ genes 1 baseColorUseCds given\ color 12,12,120\ group genes\ idXref refLink mrnaAcc name\ longLabel RefSeq Genes\ priority 35\ shortLabel RefSeq Genes\ track refGene\ type genePred refPep refMrna\ visibility dense\ xenoRefGene Other RefSeq genePred xenoRefPep xenoRefMrna Non-Human RefSeq Genes 0 35.1 12 12 120 133 133 187 0 0 0\ This track shows known protein-coding and non-protein-coding genes \ for organisms other than human, taken from the NCBI RNA reference \ sequences collection (RefSeq). The data underlying this track are \ updated daily.
\ \\ This track follows the display conventions for \ gene prediction \ tracks.\ The color shading indicates the level of review the RefSeq record has \ undergone: predicted (light), provisional (medium), reviewed (dark).
\\ The item labels and display colors of features within this track can be\ configured through the controls at the top of the track description page. \
\ The RNAs were aligned against the human genome using blat; those\ with an alignment of less than 15% were discarded. When a single RNA aligned \ in multiple places, the alignment having the highest base identity was \ identified. Only alignments having a base identity level within 0.5% of \ the best and at least 25% base identity with the genomic sequence were kept.\
\ \\ This track was produced at UCSC from RNA sequence data\ generated by scientists worldwide and curated by the \ NCBI RefSeq project.
\ \\ Kent WJ.\ BLAT - the BLAST-like alignment tool.\ Genome Res. 2002 Apr;12(4):656-64.
\ genes 1 color 12,12,120\ group genes\ longLabel Non-$Organism RefSeq Genes\ priority 35.1\ shortLabel Other RefSeq\ track xenoRefGene\ type genePred xenoRefPep xenoRefMrna\ visibility hide\ rgdGene RGD Genes genePred Rat Genome Database Curated Genes 1 35.5 12 12 120 133 133 187 0 0 0 http://rgd.mcw.edu/generalSearch/RgdSearch.jsp?quickSearch=1&searchKeyword=\ This track shows RefSeq genes curated by the Rat Genome Database (RGD).\ Coding exons are represented by \ blocks connected by horizontal lines representing introns. The 5' and 3' \ untranslated regions (UTRs) are displayed as thinner blocks on the leading \ and trailing ends of the aligning regions. In full display mode, arrowheads \ on the connecting intron lines indicate the direction of transcription.
\ \\ The annotation data file, \ RGD_curated_genes.gff, was downloaded from the RGD website\ and processed to create this track.
\ \\ Thanks to the RGD for \ providing this annotation. RGD is funded by grant HL64541 entitled "Rat \ Genome Database", awarded to Dr. Howard J Jacob, Medical College of \ Wisconsin, from the National Heart Lung and Blood Institute \ (NHLBI) of the National \ Institutes of Health (NIH).\
\ genes 1 color 12,12,120\ group genes\ longLabel Rat Genome Database Curated Genes\ priority 35.5\ shortLabel RGD Genes\ track rgdGene\ type genePred\ url http://rgd.mcw.edu/generalSearch/RgdSearch.jsp?quickSearch=1&searchKeyword=\ visibility dense\ encodeAffyChIpHl60SitesH4Kac4Hr02 Affy H4Kac4 RA 2h bed 3 . Affymetrix ChIP/Chip (H4Kac4 retinoic acid-treated HL-60, 2hrs) Sites 0 36 125 100 0 190 177 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 1 color 125,100,0\ longLabel Affymetrix ChIP/Chip (H4Kac4 retinoic acid-treated HL-60, 2hrs) Sites\ parent encodeAffyChIpHl60Sites\ priority 36\ shortLabel Affy H4Kac4 RA 2h\ subGroups factor=H4Kac4 time=2h\ track encodeAffyChIpHl60SitesH4Kac4Hr02\ encodeAffyRnaHl60SitesHr00IntergenicDistal Affy Ig Dst HL60 0h bed 4 . Affymetrix Intergenic Distal HL60 Transfrags 0 36 30 0 225 142 127 240 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeAnalysis 1 color 30,0,225\ longLabel Affymetrix Intergenic Distal HL60 Transfrags\ parent encodeNoncodingTransFrags\ priority 36\ shortLabel Affy Ig Dst HL60 0h\ subGroups region=intergenicDistal celltype=hl60 source=affy\ track encodeAffyRnaHl60SitesHr00IntergenicDistal\ mgcFullMrna MGC Genes psl Mammalian Gene Collection Full ORF mRNAs 0 36 0 100 0 127 177 127 0 0 0\ This track shows alignments of human mRNAs from the\ Mammalian Gene Collection \ (MGC) having full-length open reading frames (ORFs) to the genome.\ The goal of the Mammalian Gene Collection is to provide researchers with\ unrestricted access to sequence-validated full-length protein-coding cDNA\ clones for human, mouse, and rat genes.
\ \\ The track follows the display conventions for \ gene prediction \ tracks.
\\ An optional codon coloring feature is available for quick\ validation and comparison of gene predictions.\ To display codon colors, select the genomic codons option from the\ Color track by codons pull-down menu. For more information\ about this feature, go to the \ \ Coloring Gene Predictions and Annotations by Codon page.
\ \\ GenBank human MGC mRNAs identified as having full-length ORFs \ were aligned against the genome using blat. When a single mRNA \ aligned in multiple places, the alignment having the highest base identity was\ found. Only alignments having a base identity level within 1% of\ the best and at least 95% base identity with the genomic sequence \ were kept.
\ \\ The human MGC full-length mRNA track was produced at UCSC from \ mRNA sequence data submitted to \ GenBank by the Mammalian Gene Collection project.
\ \\ Mammalian Gene Collection project references.
\\ Kent WJ.\ BLAT - the BLAST-like alignment tool.\ Genome Res. 2002 Apr;12(4):656-64.
\ genes 1 baseColorDefault diffCodons\ baseColorUseCds genbank\ baseColorUseSequence genbank\ color 0,100,0\ group genes\ indelDoubleInsert on\ indelQueryInsert on\ longLabel Mammalian Gene Collection Full ORF mRNAs\ priority 36\ shortLabel MGC Genes\ showCdsAllScales .\ showCdsMaxZoom 10000.0\ showDiffBasesAllScales .\ showDiffBasesMaxZoom 10000.0\ track mgcFullMrna\ type psl\ visibility hide\ orfeomeMrna ORFeome Clones psl ORFeome Collaboration Gene Clones 0 36.1 34 139 34 144 197 144 0 0 0\ This track shows alignments of human clones from the\ ORFeome Collaboration. The project goal is to be an\ "unrestricted source of fully sequence-validated full-ORF human cDNA \ clones in a format allowing easy transfer of the ORF sequences into \ virtually any type of expression vector. A major goal is to provide \ at least one fully sequenced full-ORF clone for each human gene."\ This track is updated automatically as new clones become available.\
\ \\ The track follows the display conventions for \ gene prediction \ tracks.
\ \\ ORFeome human clones were obtained from GenBank and aligned against the\ genome using the blat program. When a single clone aligned in multiple \ places,\ the alignment having the highest base identity was found. Only alignments\ having a base identity level within 0.5% of the best and at least 96% base\ identity with the genomic sequence were kept.\
\ \\ Visit the ORFeome Collaboration \ members page for a list of credits and references.\
\ genes 1 baseColorDefault diffCodons\ baseColorUseCds genbank\ baseColorUseSequence genbank\ color 34,139,34\ group genes\ indelDoubleInsert on\ indelQueryInsert on\ longLabel ORFeome Collaboration Gene Clones\ priority 36.1\ shortLabel ORFeome Clones\ showCdsAllScales .\ showCdsMaxZoom 10000.0\ showDiffBasesAllScales .\ showDiffBasesMaxZoom 10000.0\ track orfeomeMrna\ type psl\ visibility hide\ encodeAffyChIpHl60PvalH4Kac4Hr08 Affy H4Kac4 RA 8h wig 0.0 534.54 Affymetrix ChIP/Chip (H4Kac4 retinoic acid-treated HL-60, 8hrs) P-Value 0 37 125 100 0 190 177 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 0 color 125,100,0\ longLabel Affymetrix ChIP/Chip (H4Kac4 retinoic acid-treated HL-60, 8hrs) P-Value\ parent encodeAffyChIpHl60Pval\ priority 37\ shortLabel Affy H4Kac4 RA 8h\ subGroups factor=H4Kac4 time=8h\ track encodeAffyChIpHl60PvalH4Kac4Hr08\ encodeAffyRnaHl60SitesHr02IntergenicDistal Affy Ig Dst HL60 2h bed 4 . Affymetrix Intergenic Distal HL60 Retinoic 2hr Transfrags 0 37 55 0 200 155 127 227 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeAnalysis 1 color 55,0,200\ longLabel Affymetrix Intergenic Distal HL60 Retinoic 2hr Transfrags\ parent encodeNoncodingTransFrags\ priority 37\ shortLabel Affy Ig Dst HL60 2h\ subGroups region=intergenicDistal celltype=hl60 source=affy\ track encodeAffyRnaHl60SitesHr02IntergenicDistal\ protBlat Protein BLAT psl protein Protein Blatted Against Genome 0 37 0 100 0 255 240 200 0 0 0 genes 1 altColor 255,240,200\ color 0,100,0\ group genes\ longLabel Protein Blatted Against Genome\ priority 37\ shortLabel Protein BLAT\ track protBlat\ type psl protein\ visibility hide\ transMap TransMap TransMap Alignments 0 37.001 0 0 0 127 127 127 0 0 0\ These tracks contain cDNA and gene alignments produced by\ the TransMap cross-species alignment algorithm\ from other vertebrate species in the UCSC Genome Browser.\ For closer evolutionary distances, the alignments are created using\ syntenically filtered BLASTZ alignment chains, resulting in a prediction of the\ orthologous genes in human.\
\ \ TransMap maps genes and related annotations in one species to another\ using synteny-filtered pairwise genome alignments (chains and nets) to\ determine the most likely orthologs. For example, for the mRNA TransMap track\ on the human assembly, more than 400,000 mRNAs from 25 vertebrate species were\ aligned at high stringency to the native assembly using BLAT. The alignments\ were then mapped to the human assembly using the chain and net alignments\ produced using blastz, which has higher sensitivity than BLAT for diverged\ organisms.\\ Compared to translated BLAT, TransMap finds fewer paralogs and aligns more UTR\ bases. For closely related low-coverage assemblies, a reciprocal-best\ relationship is used in the chains and nets to improve the synteny prediction.\
\ \ \\ This track follows the display conventions for \ PSL alignment tracks.
\\ This track may also be configured to display codon coloring, a feature that\ allows the user to quickly compare cDNAs against the genomic sequence. For more \ information about this option, click \ here.\ Several types of alignment gap may also be colored; \ for more information, click \ here.\ \ \
\
\ To ensure unique identifiers for each alignment, cDNA and gene accessions were\ made unique by appending a suffix for each location in the source genome and\ again for each mapped location in the destination genome. The format is:\
\ accession.version-srcUniq.destUniq\\ \ Where srcUniq is a number added to make each source alignment unique, and\ destUniq is added to give the subsequent TransMap alignments unique\ identifiers.\ \
\ For example, in the cow genome, there are two alignments of mRNA BC149621.1.\ These are assigned the identifiers BC149621.1-1 and BC149621.1-2.\ When these are mapped to the human genome, BC149621.1-1 maps to a single\ location and is given the identifier BC149621.1-1.1. However, BC149621.1-2\ maps to two locations, resulting in BC149621.1-2.1 and BC149621.1-2.2. Note\ that multiple TransMap mappings are usually the result of tandem duplications, where both\ chains are identified as syntenic.\
\ \ \\ This track was produced by Mark Diekhans at UCSC from cDNA sequence data\ submitted to the international public sequence databases by \ scientists worldwide.
\ \\ Zhu J, Sanborn JZ, Diekhans M, Lowe CB, Pringle TH, Haussler D. \ Comparative genomics search for losses of long-established genes \ on the human lineage. \ PLoS Comput Biol. 2007 Dec;3(12):e247.
\\ Stanke M, Diekhans M, Baertsch R, Haussler D.\ Using native and syntenically mapped cDNA alignments to improve \ de novo gene finding.\ Bioinformatics. 2008 Mar 1;24(5):637-44.
\\ Siepel A, Diekhans M, Brejová B, Langton L, Stevens M, Comstock CL, Davis C, \ Ewing B, Oommen S, Lau C et al.\ Targeted discovery of novel human exons by comparative \ genomics.\ Genome Res. 2007 Dec;17(12):1763-73.
\ \ genes 0 group genes\ longLabel TransMap Alignments\ priority 37.001\ shortLabel TransMap\ superTrack on\ track transMap\ transMapAlnUcscGenes TransMap UCSC psl TransMap UCSC Gene Mappings 3 37.002 0 100 0 127 177 127 0 0 0\ This track contains UCSC Gene alignments produced by\ the TransMap cross-species alignment algorithm\ from other vertebrate species in the UCSC Genome Browser.\ For closer evolutionary distances, the alignments are created using\ syntenically filtered BLASTZ alignment chains, resulting in a prediction of the\ orthologous genes in human.\
\ \ \ \ TransMap maps genes and related annotations in one species to another\ using synteny-filtered pairwise genome alignments (chains and nets) to\ determine the most likely orthologs. For example, for the mRNA TransMap track\ on the human assembly, more than 400,000 mRNAs from 23 vertebrate species were\ aligned at high stringency to the native assembly using BLAT. The alignments\ were then mapped to the human assembly using the chain and net alignments\ produced using blastz, which has higher sensitivity than BLAT for diverged\ organisms.\\ Compared to translated BLAT, TransMap finds fewer paralogs and aligns more UTR\ bases. For closely related low-coverage assemblies, a reciprocal-best\ relationship is used in the chains and nets to improve the synteny prediction.\
\ \
\ This track follows the display conventions for \ PSL alignment tracks.
\\ This track may also be configured to display codon coloring, a feature that\ allows the user to quickly compare cDNAs against the genomic sequence. For more \ information about this option, click \ here.\ Several types of alignment gap may also be colored; \ for more information, click \ here.\ \ \
\
\ To ensure unique identifiers for each alignment, cDNA and gene accessions were\ made unique by appending a suffix for each location in the source genome and\ again for each mapped location in the destination genome. The format is:\
\ accession.version-srcUniq.destUniq\\ \ Where srcUniq is a number added to make each source alignment unique, and\ destUniq is added to give the subsequent TransMap alignments unique\ identifiers.\ \
\ For example, in the cow genome, there are two alignments of mRNA BC149621.1.\ These are assigned the identifiers BC149621.1-1 and BC149621.1-2.\ When these are mapped to the human genome, BC149621.1-1 maps to a single\ location and is given the identifier BC149621.1-1.1. However, BC149621.1-2\ maps to two locations, resulting in BC149621.1-2.1 and BC149621.1-2.2. Note\ that multiple TransMap mappings are usually the result of tandem duplications, where both\ chains are identified as syntenic.\
\ \ \\ This track was produced by Mark Diekhans at UCSC from cDNA sequence data\ submitted to the international public sequence databases by \ scientists worldwide.
\ \\ Zhu J, Sanborn JZ, Diekhans M, Lowe CB, Pringle TH, Haussler D. \ Comparative genomics search for losses of long-established genes \ on the human lineage. \ PLoS Comput Biol. 2007 Dec;3(12):e247.
\\ Stanke M, Diekhans M, Baertsch R, Haussler D.\ Using native and syntenically mapped cDNA alignments to improve \ de novo gene finding.\ Bioinformatics. 2008 Mar 1;24(5):637-44.
\\ Siepel A, Diekhans M, Brejová B, Langton L, Stevens M, Comstock CL, Davis C, \ Ewing B, Oommen S, Lau C et al.\ Targeted discovery of novel human exons by comparative \ genomics.\ Genome Res. 2007 Dec;17(12):1763-73.
\ \ genes 1 baseColorDefault diffCodons\ baseColorUseCds table hgFixed.transMapGeneUcscGenes\ baseColorUseSequence extFile hgFixed.transMapSeqUcscGenes hgFixed.transMapExtFileUcscGenes\ color 0,100,0\ group genes\ indelDoubleInsert on\ indelQueryInsert on\ longLabel TransMap UCSC Gene Mappings\ priority 37.002\ shortLabel TransMap UCSC\ showCdsAllScales .\ showCdsMaxZoom 10000.0\ showDiffBasesAllScales .\ showDiffBasesMaxZoom 10000.0\ superTrack transMap pack\ track transMapAlnUcscGenes\ transMapGene hgFixed.transMapGeneUcscGenes\ transMapInfo transMapInfoUcscGenes\ transMapSrc hgFixed.transMapSrcUcscGenes\ transMapTypeDesc UCSC Gene\ type psl\ visibility pack\ transMapAlnRefSeq TransMap RefGene psl TransMap RefSeq Gene Mappings 3 37.003 0 100 0 127 177 127 0 0 0\ This track contains RefSeq Gene alignments produced by\ the TransMap cross-species alignment algorithm\ from other vertebrate species in the UCSC Genome Browser.\ For closer evolutionary distances, the alignments are created using\ syntenically filtered BLASTZ alignment chains, resulting in a prediction of the\ orthologous genes in human.\
\ \ \ \ TransMap maps genes and related annotations in one species to another\ using synteny-filtered pairwise genome alignments (chains and nets) to\ determine the most likely orthologs. For example, for the mRNA TransMap track\ on the human assembly, more than 400,000 mRNAs from 23 vertebrate species were\ aligned at high stringency to the native assembly using BLAT. The alignments\ were then mapped to the human assembly using the chain and net alignments\ produced using blastz, which has higher sensitivity than BLAT for diverged\ organisms.\\ Compared to translated BLAT, TransMap finds fewer paralogs and aligns more UTR\ bases. For closely related low-coverage assemblies, a reciprocal-best\ relationship is used in the chains and nets to improve the synteny prediction.\
\ \
\ This track follows the display conventions for \ PSL alignment tracks.
\\ This track may also be configured to display codon coloring, a feature that\ allows the user to quickly compare cDNAs against the genomic sequence. For more \ information about this option, click \ here.\ Several types of alignment gap may also be colored; \ for more information, click \ here.\ \ \
\
\ To ensure unique identifiers for each alignment, cDNA and gene accessions were\ made unique by appending a suffix for each location in the source genome and\ again for each mapped location in the destination genome. The format is:\
\ accession.version-srcUniq.destUniq\\ \ Where srcUniq is a number added to make each source alignment unique, and\ destUniq is added to give the subsequent TransMap alignments unique\ identifiers.\ \
\ For example, in the cow genome, there are two alignments of mRNA BC149621.1.\ These are assigned the identifiers BC149621.1-1 and BC149621.1-2.\ When these are mapped to the human genome, BC149621.1-1 maps to a single\ location and is given the identifier BC149621.1-1.1. However, BC149621.1-2\ maps to two locations, resulting in BC149621.1-2.1 and BC149621.1-2.2. Note\ that multiple TransMap mappings are usually the result of tandem duplications, where both\ chains are identified as syntenic.\
\ \ \\ This track was produced by Mark Diekhans at UCSC from cDNA sequence data\ submitted to the international public sequence databases by \ scientists worldwide.
\ \\ Zhu J, Sanborn JZ, Diekhans M, Lowe CB, Pringle TH, Haussler D. \ Comparative genomics search for losses of long-established genes \ on the human lineage. \ PLoS Comput Biol. 2007 Dec;3(12):e247.
\\ Stanke M, Diekhans M, Baertsch R, Haussler D.\ Using native and syntenically mapped cDNA alignments to improve \ de novo gene finding.\ Bioinformatics. 2008 Mar 1;24(5):637-44.
\\ Siepel A, Diekhans M, Brejová B, Langton L, Stevens M, Comstock CL, Davis C, \ Ewing B, Oommen S, Lau C et al.\ Targeted discovery of novel human exons by comparative \ genomics.\ Genome Res. 2007 Dec;17(12):1763-73.
\ \ genes 1 baseColorDefault diffCodons\ baseColorUseCds table hgFixed.transMapGeneRefSeq\ baseColorUseSequence extFile hgFixed.transMapSeqRefSeq hgFixed.transMapExtFileRefSeq\ color 0,100,0\ group genes\ indelDoubleInsert on\ indelQueryInsert on\ longLabel TransMap RefSeq Gene Mappings\ priority 37.003\ shortLabel TransMap RefGene\ showCdsAllScales .\ showCdsMaxZoom 10000.0\ showDiffBasesAllScales .\ showDiffBasesMaxZoom 10000.0\ superTrack transMap pack\ track transMapAlnRefSeq\ transMapGene hgFixed.transMapGeneRefSeq\ transMapInfo transMapInfoRefSeq\ transMapSrc hgFixed.transMapSrcRefSeq\ type psl\ visibility pack\ transMapAlnMRna TransMap mRNA psl TransMap GenBank mRNA Mappings 3 37.004 0 100 0 127 177 127 0 0 0\ This track contains GenBank mRNA alignments produced by\ the TransMap cross-species alignment algorithm\ from other vertebrate species in the UCSC Genome Browser.\ For closer evolutionary distances, the alignments are created using\ syntenically filtered BLASTZ alignment chains, resulting in a prediction of the\ orthologous genes in human.\
\ \ \ \ TransMap maps genes and related annotations in one species to another\ using synteny-filtered pairwise genome alignments (chains and nets) to\ determine the most likely orthologs. For example, for the mRNA TransMap track\ on the human assembly, more than 400,000 mRNAs from 23 vertebrate species were\ aligned at high stringency to the native assembly using BLAT. The alignments\ were then mapped to the human assembly using the chain and net alignments\ produced using blastz, which has higher sensitivity than BLAT for diverged\ organisms.\\ Compared to translated BLAT, TransMap finds fewer paralogs and aligns more UTR\ bases. For closely related low-coverage assemblies, a reciprocal-best\ relationship is used in the chains and nets to improve the synteny prediction.\
\ \
\ This track follows the display conventions for \ PSL alignment tracks.
\\ This track may also be configured to display codon coloring, a feature that\ allows the user to quickly compare cDNAs against the genomic sequence. For more \ information about this option, click \ here.\ Several types of alignment gap may also be colored; \ for more information, click \ here.\ \ \
\
\ To ensure unique identifiers for each alignment, cDNA and gene accessions were\ made unique by appending a suffix for each location in the source genome and\ again for each mapped location in the destination genome. The format is:\
\ accession.version-srcUniq.destUniq\\ \ Where srcUniq is a number added to make each source alignment unique, and\ destUniq is added to give the subsequent TransMap alignments unique\ identifiers.\ \
\ For example, in the cow genome, there are two alignments of mRNA BC149621.1.\ These are assigned the identifiers BC149621.1-1 and BC149621.1-2.\ When these are mapped to the human genome, BC149621.1-1 maps to a single\ location and is given the identifier BC149621.1-1.1. However, BC149621.1-2\ maps to two locations, resulting in BC149621.1-2.1 and BC149621.1-2.2. Note\ that multiple TransMap mappings are usually the result of tandem duplications, where both\ chains are identified as syntenic.\
\ \ \\ This track was produced by Mark Diekhans at UCSC from cDNA sequence data\ submitted to the international public sequence databases by \ scientists worldwide.
\ \\ Zhu J, Sanborn JZ, Diekhans M, Lowe CB, Pringle TH, Haussler D. \ Comparative genomics search for losses of long-established genes \ on the human lineage. \ PLoS Comput Biol. 2007 Dec;3(12):e247.
\\ Stanke M, Diekhans M, Baertsch R, Haussler D.\ Using native and syntenically mapped cDNA alignments to improve \ de novo gene finding.\ Bioinformatics. 2008 Mar 1;24(5):637-44.
\\ Siepel A, Diekhans M, Brejová B, Langton L, Stevens M, Comstock CL, Davis C, \ Ewing B, Oommen S, Lau C et al.\ Targeted discovery of novel human exons by comparative \ genomics.\ Genome Res. 2007 Dec;17(12):1763-73.
\ \ genes 1 baseColorDefault diffCodons\ baseColorUseCds table hgFixed.transMapGeneMRna\ baseColorUseSequence extFile hgFixed.transMapSeqMRna hgFixed.transMapExtFileMRna\ color 0,100,0\ group genes\ indelDoubleInsert on\ indelQueryInsert on\ longLabel TransMap GenBank mRNA Mappings\ priority 37.004\ shortLabel TransMap mRNA\ showCdsAllScales .\ showCdsMaxZoom 10000.0\ showDiffBasesAllScales .\ showDiffBasesMaxZoom 10000.0\ superTrack transMap pack\ track transMapAlnMRna\ transMapGene hgFixed.transMapGeneMRna\ transMapInfo transMapInfoMRna\ transMapSrc hgFixed.transMapSrcMRna\ type psl\ visibility pack\ transMapAlnSplicedEst TransMap ESTs psl TransMap Spliced EST Mappings 0 37.005 0 100 0 127 177 127 0 0 0\ This track contains GenBank spliced EST alignments produced by\ the TransMap cross-species alignment algorithm\ from other vertebrate species in the UCSC Genome Browser.\ For closer evolutionary distances, the alignments are created using\ syntenically filtered BLASTZ alignment chains, resulting in a prediction of the\ orthologous genes in human.\
\ \ \ \ TransMap maps genes and related annotations in one species to another\ using synteny-filtered pairwise genome alignments (chains and nets) to\ determine the most likely orthologs. For example, for the mRNA TransMap track\ on the human assembly, more than 400,000 mRNAs from 23 vertebrate species were\ aligned at high stringency to the native assembly using BLAT. The alignments\ were then mapped to the human assembly using the chain and net alignments\ produced using blastz, which has higher sensitivity than BLAT for diverged\ organisms.\\ Compared to translated BLAT, TransMap finds fewer paralogs and aligns more UTR\ bases. For closely related low-coverage assemblies, a reciprocal-best\ relationship is used in the chains and nets to improve the synteny prediction.\
\ \
\ This track follows the display conventions for \ PSL alignment tracks.
\\ This track may also be configured to display codon coloring, a feature that\ allows the user to quickly compare cDNAs against the genomic sequence. For more \ information about this option, click \ here.\ Several types of alignment gap may also be colored; \ for more information, click \ here.\ \ \
\
\ To ensure unique identifiers for each alignment, cDNA and gene accessions were\ made unique by appending a suffix for each location in the source genome and\ again for each mapped location in the destination genome. The format is:\
\ accession.version-srcUniq.destUniq\\ \ Where srcUniq is a number added to make each source alignment unique, and\ destUniq is added to give the subsequent TransMap alignments unique\ identifiers.\ \
\ For example, in the cow genome, there are two alignments of mRNA BC149621.1.\ These are assigned the identifiers BC149621.1-1 and BC149621.1-2.\ When these are mapped to the human genome, BC149621.1-1 maps to a single\ location and is given the identifier BC149621.1-1.1. However, BC149621.1-2\ maps to two locations, resulting in BC149621.1-2.1 and BC149621.1-2.2. Note\ that multiple TransMap mappings are usually the result of tandem duplications, where both\ chains are identified as syntenic.\
\ \ \\ This track was produced by Mark Diekhans at UCSC from cDNA sequence data\ submitted to the international public sequence databases by \ scientists worldwide.
\ \\ Zhu J, Sanborn JZ, Diekhans M, Lowe CB, Pringle TH, Haussler D. \ Comparative genomics search for losses of long-established genes \ on the human lineage. \ PLoS Comput Biol. 2007 Dec;3(12):e247.
\\ Stanke M, Diekhans M, Baertsch R, Haussler D.\ Using native and syntenically mapped cDNA alignments to improve \ de novo gene finding.\ Bioinformatics. 2008 Mar 1;24(5):637-44.
\\ Siepel A, Diekhans M, Brejová B, Langton L, Stevens M, Comstock CL, Davis C, \ Ewing B, Oommen S, Lau C et al.\ Targeted discovery of novel human exons by comparative \ genomics.\ Genome Res. 2007 Dec;17(12):1763-73.
\ \ genes 1 baseColorDefault none\ baseColorUseSequence extFile hgFixed.transMapSeqSplicedEst hgFixed.transMapExtFileSplicedEst\ color 0,100,0\ group genes\ indelDoubleInsert on\ indelQueryInsert on\ longLabel TransMap Spliced EST Mappings\ priority 37.005\ shortLabel TransMap ESTs\ showDiffBasesAllScales .\ showDiffBasesMaxZoom 10000.0\ superTrack transMap hide\ track transMapAlnSplicedEst\ transMapInfo transMapInfoSplicedEst\ transMapSrc hgFixed.transMapSrcSplicedEst\ type psl\ visibility hide\ vegaGene Vega Genes genePred vegaPep Vega Annotations 0 37.1 0 100 180 127 177 217 0 0 14 chr1,chr16,chr18,chr19,chr6,chr7,chr9,chr10,chr13,chr14,chr20,chr22,chrX,chrY, http://vega.sanger.ac.uk/Homo_sapiens/geneview?transcript=$$\ This track shows gene annotations from the Vertebrate Genome Annotation (Vega)\ database.
\\ The following information is excerpted from the\ Vertebrate Genome Annotation\ home page:
\\ "The Vega database\ is designed to be a central repository for high-quality, frequently updated\ manual annotation of different vertebrate finished genome sequence.\ Vega attempts to present consistent high-quality curation of the published\ chromosome sequences. Finished genomic sequence is analysed on a\ clone-by-clone basis using\ a combination of similarity searches against DNA and protein databases\ as well as a series of ab initio gene predictions (GENSCAN, Fgenes).\ The annotation is based on supporting evidence only."
\\ "In addition, comparative analysis using vertebrate datasets such as\ the Riken mouse cDNAs and Genoscope Tetraodon nigroviridis Ecores\ (Evolutionary Conserved Regions) are used for novel gene discovery."
\\ NOTE: VEGA annotations do not appear on every chromosome in this assembly.
\ \\ This track follows the display conventions for\ gene prediction\ tracks using the following color scheme to indicate the status of the gene\ annotation:\
\ The details pages show the only the Vega gene type and not the transcript type.\ A single gene can have more than one transcript which can belong to\ different classes, so the gene as a whole is classified according to the\ transcript with the "highest" level of classification. Transcript\ type (and other details) may be found by clicking on the transcript\ identifier which forms the outside link to the Vega transcript details page.\ Further information on the gene and transcript classification may be found\ here.\
\ \\ Thanks to Steve Searle at the\ Sanger Institute \ for providing the GTF and FASTA files for the Vega annotations. Vega gene annotations are \ generated by manual annotation from the following groups:\
\
Chromosome 6:\
\ The HAVANA group, \
\ Wellcome Trust Sanger Institute
\
\ Relevant publication: Mungall AJ et al.,\
The DNA sequence and analysis of human \
\ chromosome 6. \
Nature. 2003 Oct 23;425:805-11.
\
Chromosome 7:\
\ Hillier et al., \
\ The Genome Center at Washington University
\
\ Relevant publication: Hillier LW et al., \
The DNA sequence of human \
\ chromosome 7. \
Nature. 2003 Jul 10;424:157-64.
\
Chromosome 9:\
\ The HAVANA group, \
\ Wellcome Trust Sanger Institute
\
Relevant publication: Humphray SJ et al., \
The DNA sequence and analysis of human chromosome 9. \
Nature. 2004 May 27;429;369-74.
\
Chromosome 10:\
\ The HAVANA group, \
\ Wellcome Trust Sanger Institute
\
\ Relevant publication: Deloukas P et al., \
The DNA sequence and comparative analysis of human chromosome 10. \
Nature. 2004 May 27;429:375-81.
\
Chromosome 13:\
\ The HAVANA group, \
\ Wellcome Trust Sanger Institute
\
\ Relevant publication: Dunham A et al., \
The DNA sequence and analysis of human chromosome 13. \
Nature. 2001 Apr 1;428:522-8.
\
Chromosome 14: \
\ \
\ Genoscope
\
\ Relevant publication: Heilig R et al., \
The DNA sequence and analysis of \
\ human chromosome 14. \
Nature. 2003 Feb 6;421:601-7.
\
Chromosome 20: \
\ The HAVANA Group, \
\ Wellcome Trust Sanger Institute
\
\ Relevant publication: Deloukas P et al., \
The DNA sequence and \
\ comparative analysis of human chromosome 20. \
Nature. 2001 Dec 20;414:865-71.
\
Chromosome 22: Chromosome 22 Group,\
\ \
\ Wellcome Trust Sanger Institute
\
\ Relevant publications:
\
\ — Collins JE et al., \
Reevaluating Human Gene Annotation: \
\ A Second-Generation Analysis of Chromosome 22. \
Genome Research. 2003 Jan;13(1):27-36.
\
\ — Dawson E et al., \
A \
\ first-generation linkage disequilibrium map of human chromosome 22. \
Nature. 2002 Aug 1;418:544-8.
\
\ — Dunham I, et al., \
The DNA sequence of human chromosome 22. \
Nature. 1999 Dec 2;402:489-95.
\
Chromosome X: \
\ The HAVANA Group, \
\ Wellcome Trust Sanger Institute
\
\ Relevant publication: Ross MT et al., \
The DNA sequence and \
\ comparative analysis of human chromosome X. \
Nature 2005 Mar 17;434:325-37.
\ This track shows pseudogene annotations from the Vertebrate Genome Annotation \ (Vega) database.
\\ The following information is excerpted from the\ Vertebrate Genome Annotation\ home page:
\\ "The Vega database\ is designed to be a central repository for high-quality, frequently updated\ manual annotation of different vertebrate finished genome sequence.\ Vega attempts to present consistent high-quality curation of the published\ chromosome sequences. Finished genomic sequence is analysed on a\ clone-by-clone basis using\ a combination of similarity searches against DNA and protein databases\ as well as a series of ab initio gene predictions (GENSCAN, Fgenes).\ The annotation is based on supporting evidence only."
\\ "In addition, comparative analysis using vertebrate datasets such as\ the Riken mouse cDNAs and Genoscope Tetraodon nigroviridis Ecores\ (Evolutionary Conserved Regions) are used for novel gene discovery."
\\ NOTE: VEGA annotations do not appear on every chromosome in this assembly.
\ \\ This track follows the display conventions for\ gene prediction\ tracks using the following color scheme to indicate the status of the gene\ annotation:\
\ The details pages show the only the Vega gene type and not the transcript type.\ A single gene can have more than one transcript which can belong to\ different classes, so the gene as a whole is classified according to the\ transcript with the "highest" level of classification. Transcript\ type (and other details) may be found by clicking on the transcript\ identifier which forms the outside link to the Vega transcript details page.\ Further information on the gene and transcript classification may be found\ here.\
\ \\ Thanks to Steve Searle at the\ Sanger Institute \ for providing the GTF and FASTA files for the Vega annotations. Vega gene annotations are \ generated by manual annotation from the following groups:\
\
Chromosome 6:\
\ The HAVANA group, \
\ Wellcome Trust Sanger Institute
\
\ Relevant publication: Mungall AJ et al.,\
The DNA sequence and analysis of human \
\ chromosome 6. \
Nature. 2003 Oct 23;425:805-11.
\
Chromosome 7:\
\ Hillier et al., \
\ The Genome Institute at Washington University
\
\ Relevant publication: Hillier LW et al., \
The DNA sequence of human \
\ chromosome 7. \
Nature. 2003 Jul 10;424:157-64.
\
Chromosome 9:\
\ The HAVANA group, \
\ Wellcome Trust Sanger Institute
\
Relevant publication: Humphray SJ et al., \
The DNA sequence and analysis of human chromosome 9. \
Nature. 2004 May 27;429;369-74.
\
Chromosome 10:\
\ The HAVANA group, \
\ Wellcome Trust Sanger Institute
\
\ Relevant publication: Deloukas P et al., \
The DNA sequence and comparative analysis of human chromosome 10. \
Nature. 2004 May 27;429:375-81.
\
Chromosome 13:\
\ The HAVANA group, \
\ Wellcome Trust Sanger Institute
\
\ Relevant publication: Dunham A et al., \
The DNA sequence and analysis of human chromosome 13. \
Nature. 2001 Apr 1;428:522-8.
\
Chromosome 14: \
\ \
\ Genoscope
\
\ Relevant publication: Heilig R et al., \
The DNA sequence and analysis of \
\ human chromosome 14. \
Nature. 2003 Feb 6;421:601-7.
\
Chromosome 20: \
\ The HAVANA Group, \
\ Wellcome Trust Sanger Institute
\
\ Relevant publication: Deloukas P et al., \
The DNA sequence and \
\ comparative analysis of human chromosome 20. \
Nature. 2001 Dec 20;414:865-71.
\
Chromosome 22: Chromosome 22 Group,\
\ \
\ Wellcome Trust Sanger Institute
\
\ Relevant publications:
\
\ — Collins JE et al., \
Reevaluating Human Gene Annotation: \
\ A Second-Generation Analysis of Chromosome 22. \
Genome Research. 2003 Jan;13(1):27-36.
\
\ — Dawson E et al., \
A \
\ first-generation linkage disequilibrium map of human chromosome 22. \
Nature. 2002 Aug 1;418:544-8.
\
\ — Dunham I, et al., \
The DNA sequence of human chromosome 22. \
Nature. 1999 Dec 2;402:489-95.
\
Chromosome X: \
\ The HAVANA Group, \
\ Wellcome Trust Sanger Institute
\
\ Relevant publication: Ross MT et al., \
The DNA sequence and \
\ comparative analysis of human chromosome X. \
Nature 2005 Mar 17;434:325-37.
\ This track shows sequence annotation curated at \ The Welcome Trust Sanger \ Institute.
\\ Over 10% of the human genome, including two complete chromosomes — \ 20\ and 22 —\ have been annotated by the sequence annotation team in collaboration with \ the individual \ chromosome \ project teams.
\\ NOTE: Sanger20 annotations appear only on chromosome 20.
\ \\ Finished genomic sequence is analyzed on a clone by clone basis using a\ combination of similarity searches against DNA and protein databases as\ well as a series of ab initio gene predictions (Genscan, Fgenesh). \ Gene structures are annotated on the basis of human interpretation of the\ combined supportive evidence generated during sequence analysis. In\ parallel, experimental methods are applied to extend incomplete\ gene structures and discover new genes. The latter is initiated by\ comparative analysis of the finished sequence with vertebrate datasets\ such as the Riken mouse cDNAs, mouse whole-genome shotgun data and\ GenescopeTetraodon Ecores.
\ \\ Thanks to the Sanger Institute for providing this data set. Email inquiries\ may be sent to humquery@sanger.ac.uk.
\ \ genes 1 chromosomes chr20,\ color 0,100,180\ group genes\ longLabel Sanger Institute Chromosome 20 Genes\ priority 37.2\ shortLabel Sanger 20\ track sanger20\ type genePred\ visibility pack\ sanger22 Sanger 22 genePred Sanger Institute Chromosome 22 Genes 3 37.3 0 100 180 127 177 217 0 0 1 chr22,\ This track contains curated annotations of chromosome 22 produced by the\ Chromosome \ 22 Group at the Sanger Institute. They are described in the \ paper Collins, J.E., et al. \ Reevaluating human gene annotation: a second generation \ analysis of human chromosome 22. Genome Res. 13(1), 27-36\ (2003).
\\ Over 10% of the human genome, including two complete\ chromosomes — 20\ and 22 —\ have been annotated by the Sanger Institute Sequence\ Annotation Team in collaboration with the individual \ chromosome \ project teams. \
\ NOTE: Sanger22 annotations appear only on chromosome 22 in the Genome Browser.\
\ \\ Finished genomic sequence is analyzed on a clone by clone basis using a\ combination of similarity searches against DNA and protein databases as\ well as a series of ab initio gene predictions (Genscan, Fgenesh). \ Gene structures are annotated on the basis of human interpretation of the\ combined supportive evidence generated during sequence analysis. In\ parallel, experimental methods are applied to extend incomplete\ gene structures and discover new genes. The latter is initiated by\ comparative analysis of the finished sequence with vertebrate datasets\ such as the Riken mouse cDNAs, mouse whole-genome shotgun data and\ GenescopeTetraodon Ecores.
\ \\ These annotations were obtained from the\ Internet at http://www.sanger.ac.uk/HGP/Chr22. \ Thanks to the Sanger Institute for providing this data set. Email inquiries\ may be sent to humquery@sanger.ac.uk.\ \ genes 1 chromosomes chr22,\ color 0,100,180\ group genes\ longLabel Sanger Institute Chromosome 22 Genes\ priority 37.3\ shortLabel Sanger 22\ track sanger22\ type genePred\ visibility pack\ sanger22pseudo Sanger 22 Pseudo genePred Sanger Center Chromosome 22 Pseudogenes 0 37.4 30 130 210 142 192 232 0 0 1 chr22,
\ This track contains curated annotations of chromosome 22 produced by the\ Chromosome \ 22 Group at the Sanger Institute. They are described in the \ paper Collins, J.E. et al.\ Reevaluating human gene annotation: a second generation \ analysis of human chromosome 22. Genome Res. 13(1),\ 27-36 (2003).
\\ Over 10% of the human genome, including two complete\ chromosomes — 20 and 22 — have been annotated by the Sanger Institute Sequence\ Annotation Team in collaboration with the individual \ chromosome \ project teams.
\\ NOTE: Sanger22 annotations appear only on chromosome 22 in the Genome Browser.\
\ \\ Finished genomic sequence is analyzed on a clone by clone basis using a\ combination of similarity searches against DNA and protein databases as\ well as a series of ab initio gene predictions (Genscan, Fgenesh). \ Gene structures are annotated on the basis of human interpretation of the\ combined supportive evidence generated during sequence analysis. In\ parallel, experimental methods are applied to extend incomplete\ gene structures and discover new genes. The latter is initiated by\ comparative analysis of the finished sequence with vertebrate datasets\ such as the Riken mouse cDNAs, mouse whole-genome shotgun data and\ GenescopeTetraodon Ecores.
\ \\ These annotations were obtained from the\ Internet at http://www.sanger.ac.uk/HGP/Chr22. \ Thanks to the Sanger Institute for providing this data set. Email inquiries\ may be sent to humquery@sanger.ac.uk.
\ \ genes 1 chromosomes chr22,\ color 30,130,210\ group genes\ longLabel Sanger Center Chromosome 22 Pseudogenes\ priority 37.4\ shortLabel Sanger 22 Pseudo\ track sanger22pseudo\ type genePred\ visibility hide\ encodeAffyChIpHl60SitesH4Kac4Hr08 Affy H4Kac4 RA 8h bed 3 . Affymetrix ChIP/Chip (H4Kac4 retinoic acid-treated HL-60, 8hrs) Sites 0 38 125 100 0 190 177 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 1 color 125,100,0\ longLabel Affymetrix ChIP/Chip (H4Kac4 retinoic acid-treated HL-60, 8hrs) Sites\ parent encodeAffyChIpHl60Sites\ priority 38\ shortLabel Affy H4Kac4 RA 8h\ subGroups factor=H4Kac4 time=8h\ track encodeAffyChIpHl60SitesH4Kac4Hr08\ encodeAffyRnaHl60SitesHr08IntergenicDistal Affy Ig Dst HL60 8h bed 4 . Affymetrix Intergenic Distal HL60 Retinoic 8hr Transfrags 0 38 80 0 175 167 127 215 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeAnalysis 1 color 80,0,175\ longLabel Affymetrix Intergenic Distal HL60 Retinoic 8hr Transfrags\ parent encodeNoncodingTransFrags\ priority 38\ shortLabel Affy Ig Dst HL60 8h\ subGroups region=intergenicDistal celltype=hl60 source=affy\ track encodeAffyRnaHl60SitesHr08IntergenicDistal\ reconTransMap Recon TransMap Reconstruction TransMap Alignments 0 38.001 0 0 0 127 127 127 0 0 0\ TransMap cross-species alignment tracks for debugging the ancestral\ reconstruction.\
\ genes 0 group genes\ longLabel Reconstruction TransMap Alignments\ priority 38.001\ shortLabel Recon TransMap\ superTrack on\ track reconTransMap\ reconTransMapAlnRefSeq Recon TransMap RefGene psl Reconstruction TransMap RefSeq Gene Mappings 3 38.003 0 100 0 127 177 127 0 0 0\ TransMap cross-species alignment tracks of RefSeq mRNA alignments for\ debugging the ancestral reconstruction.\
\ genes 1 baseColorDefault diffCodons\ baseColorUseCds table hgFixed.reconTransMapGeneRefSeq\ baseColorUseSequence extFile hgFixed.reconTransMapSeqRefSeq hgFixed.reconTransMapExtFileRefSeq\ color 0,100,0\ group genes\ indelDoubleInsert on\ indelQueryInsert on\ longLabel Reconstruction TransMap RefSeq Gene Mappings\ priority 38.003\ shortLabel Recon TransMap RefGene\ showCdsAllScales .\ showCdsMaxZoom 10000.0\ showDiffBasesAllScales .\ showDiffBasesMaxZoom 10000.0\ superTrack reconTransMap pack\ track reconTransMapAlnRefSeq\ transMapGene hgFixed.reconTransMapGeneRefSeq\ transMapInfo reconTransMapInfoRefSeq\ transMapSrc hgFixed.reconTransMapSrcRefSeq\ type psl\ visibility pack\ encodeAffyChIpHl60PvalH4Kac4Hr32 Affy H4Kac4 RA 32h wig 0.0 534.54 Affymetrix ChIP/Chip (H4Kac4 retinoic acid-treated HL-60, 32hrs) P-Value 0 39 125 100 0 190 177 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 0 color 125,100,0\ longLabel Affymetrix ChIP/Chip (H4Kac4 retinoic acid-treated HL-60, 32hrs) P-Value\ parent encodeAffyChIpHl60Pval\ priority 39\ shortLabel Affy H4Kac4 RA 32h\ subGroups factor=H4Kac4 time=32h\ track encodeAffyChIpHl60PvalH4Kac4Hr32\ encodeAffyRnaHl60SitesHr32IntergenicDistal Affy Ig Dst HL60 32h bed 4 . Affymetrix Intergenic Distal HL60 Retinoic 32hr Transfrags 0 39 105 0 150 180 127 202 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeAnalysis 1 color 105,0,150\ longLabel Affymetrix Intergenic Distal HL60 Retinoic 32hr Transfrags\ parent encodeNoncodingTransFrags\ priority 39\ shortLabel Affy Ig Dst HL60 32h\ subGroups region=intergenicDistal celltype=hl60 source=affy\ track encodeAffyRnaHl60SitesHr32IntergenicDistal\ genieAlt AltGenie genePred genieAltPep Genie Gene Predictions from Affymetrix 1 39 125 0 150 190 127 202 0 0 0Genie predictions are based on \ Affymetrix's \ Genie gene finding software. Genie is a generalized HMM \ which accepts constraints based on mRNA and EST data.
\ genes 1 color 125,0,150\ group genes\ longLabel Genie Gene Predictions from Affymetrix\ priority 39\ shortLabel AltGenie\ track genieAlt\ type genePred genieAltPep\ visibility dense\ encodeAffyChIpHl60SitesH4Kac4Hr32 Affy H4Kac4 RA 32h bed 3 . Affymetrix ChIP/Chip (H4Kac4 retinoic acid-treated HL-60, 32hrs) Sites 0 40 125 100 0 190 177 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 1 color 125,100,0\ longLabel Affymetrix ChIP/Chip (H4Kac4 retinoic acid-treated HL-60, 32hrs) Sites\ parent encodeAffyChIpHl60Sites\ priority 40\ shortLabel Affy H4Kac4 RA 32h\ subGroups factor=H4Kac4 time=32h\ track encodeAffyChIpHl60SitesH4Kac4Hr32\ ensGene Ensembl Genes genePred ensPep Ensembl Genes 0 40 150 0 0 202 127 127 0 0 0\ These gene predictions were generated by Ensembl.
\ \\ For a description of the methods used in Ensembl gene prediction, refer to \ Hubbard, T. et al. (2002) in the References section below.
\ \\ Thanks to Ensembl for providing this annotation.\ A description of the process \ by which it was produced can be found on the Ensembl site.\
\ \\ Hubbard T, Barker D, Birney E, Cameron G, Chen Y, Clark L, Cox T, Cuff J,\ Curwen V, Down T, et al. \ The Ensembl genome database project.\ Nucleic Acids Res. 2002 Jan 1;30(1):38-41.
\ \ genes 1 color 150,0,0\ group genes\ longLabel Ensembl Genes\ priority 40\ shortLabel Ensembl Genes\ track ensGene\ type genePred ensPep\ visibility hide\ encodeYaleAffyNB4RARNATarsIntergenicDistal Yale Ig Dst NB4 RA bed 4 . Yale Intergenic Distal NB4 Retinoic TARs 0 40 130 0 125 192 127 190 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeAnalysis 1 color 130,0,125\ longLabel Yale Intergenic Distal NB4 Retinoic TARs\ parent encodeNoncodingTransFrags\ priority 40\ shortLabel Yale Ig Dst NB4 RA\ subGroups region=intergenicDistal celltype=nb4 source=yale\ track encodeYaleAffyNB4RARNATarsIntergenicDistal\ ensEstGene Ensembl EST Genes genePred ensEstPep Ensembl EST Gene Predictions 0 40.5 150 0 0 202 127 127 0 0 0 http://www.ensembl.org/perl/geneview?db=estgene&transcript=$$\ Gene predictions from Ensembl based on ESTs.
\ \\ ESTs were mapped onto the genome using a combination of Exonerate, Blast \ and Est_Genome, with a threshold defined as an overall percentage identity \ of 90% and at least one exon having a percentage identity of 97% or higher. \ The results were processed by merging the redundant ESTs and setting \ splice sites to the most common ends, resulting in alternative spliced \ forms. This evidence was processed by Genomewise, which finds the longest \ ORF and assigns 5' and 3' UTRs.
\ \\ This track has an optional codon coloring feature that allows users to \ quickly validate and compare gene predictions. To display codon colors, \ select the genomic codons option from the Color track by \ codons pull-down menu at the top of the track description page.\ This page is accessed via the small button to the left of the track's\ graphical display or through the link on the track's control menu. Click \ here for more information about this feature.\
\\ After you have made your configuration selections, click the \ Submit button to return to the tracks display page.
\ \\ Thanks to Ensembl \ for providing this annotation.
\ \ genes 1 color 150,0,0\ group genes\ longLabel Ensembl EST Gene Predictions\ priority 40.5\ shortLabel Ensembl EST Genes\ track ensEstGene\ type genePred ensEstPep\ url http://www.ensembl.org/perl/geneview?db=estgene&transcript=$$\ visibility hide\ acembly Acembly Genes genePred acemblyPep acemblyMrna AceView Gene Models With Alt-Splicing 1 41 155 0 125 205 127 190 0 0 0 http://www.ncbi.nlm.nih.gov/IEB/Research/Acembly/av.cgi?db=hg16&l=$$\ This track shows AceView gene models constructed from\ mRNA, EST and genomic evidence by Danielle and Jean Thierry-Mieg\ and Vahan Simonyan using the \ Acembly program.
\ \\ This track follows the display conventions for \ gene prediction \ tracks. Gene models that fall into the "main" prediction class\ are displayed in purple; "putative" \ genes are displayed in pink.
\\ The track description page offers the following filter and configuration\ options:\
\ AceView attempts to find the best alignment of each mRNA/EST against the\ genome, and clusters the alignments into the least possible number of\ alternatively spliced transcripts. The reconstructed transcripts are then\ clustered into genes by simple transitive contact. To see the evidence that \ supports each transcript, click the "Outside Link" on an individual \ transcript's details page to access the NCBI AceView web site.
\\ Each AceView transcript model has a gene cluster designation\ (alternate name) that is categorized into a prediction class\ of either main or \ putative.
\\
Prediction Class: main \
Class of genes that includes the protein coding genes (defined\
here by CDS > 100 amino acids) and all genes with at least one\
well-defined standard intron, i.e., an intron with a GT-AG or GC-AG\
boundary, supported by at least one clone matching exactly, with\
no ambiguous bases, and the 8 bases on either side of the intron \
identical to the genome. Genes with a CDS smaller than 100 amino acids are\
included in this class if they meet one of the following conditions: they \
have a NCBI RefSeq sequence (NM_#) or an OMIM identifier, or they encode a \
protein with BlastP homology (< 1e-3) to a cDNA-supported nematode AceView \
protein.
\
Prediction Class: putative\
Class of genes that have no standard intron and do not\
encode CDS of more than 100 amino acids, yet may be sufficiently useful to \
justify not disregarding them completely. Putative genes may be of two\
types: either those supported by more than six cDNA clones or those that\
encode a putative protein with an interesting annotation. Examples include\
a PFAM motif, a BlastP hit to a species other than itself (< 1e-3), \
a transmembrane domain or other rare and meaningful domains\
identified by Psort2, or a highly probable localization in a cell\
compartment (excluding cytoplasm and nucleus).
\ Thanks to Danielle and Jean \ Thierry-Mieg at NIH for providing this track.
\ \\ Thierry-Mieg D, Thierry-Mieg J. \ AceView: a comprehensive cDNA-supported gene and transcripts \ annotation.\ Genome Biol. 2006;7 Suppl 1:S12.1-14.
\ genes 1 color 155,0,125\ group genes\ longLabel AceView Gene Models With Alt-Splicing\ priority 41\ shortLabel Acembly Genes\ track acembly\ type genePred acemblyPep acemblyMrna\ url http://www.ncbi.nlm.nih.gov/IEB/Research/Acembly/av.cgi?db=hg16&l=$$\ urlLabel AceView Gene Summary:\ visibility dense\ encodeAffyEncode25bpProbes Affy 25bp Probes bed 4 . Affymetrix 25 bp Probe Locations 0 41 0 0 0 127 127 127 0 0 0\ This track shows the locations of the 25 bp probes on the Affymetrix ENCODE \ tiling array, which are spaced every 22 bp on average. The \ chip, which was produced using Affymetrix GeneChip technology, has 737,680 \ probes representing all the non-repetitive DNA sequence of the ENCODE \ regions. This chip was designed for high throughput experiments to explore the \ human transcriptome at high resolution. The probes represent all the \ transcribed regions, including mRNAs as well as non-coding RNAs that are used \ both structurally and in the regulation of gene expression. Disruption of these\ structures or changes in the levels of transcription or translation may play a \ role in disease pathogenesis; therefore, this array is a valuable tool for the \ discovery and elucidation of disease processes.
\ \\ Probe locations are indicated by solid blocks in the graphical display.
\ \\ Probe positions were provided by Affymetrix, and the sequence was verified upon\ mapping to the genome. The array can be utilized to study transcribed\ regions (see Affy RNA Signal and Affy Transfrags tracks), transcription factor \ binding sites (Affy pVal and Affy Sites tracks), sites of chromatin \ modification, sites for DNA methylation and chromosomal origins of replication.\
\ \\ This chip was generated and analyzed by the Gingeras/Struhl\ collaboration with the Tom Gingeras group at \ Affymetrix and the \ Kevin Struhl group at Harvard Medical School.
\ \\ Please see the \ Affymetrix Transcriptome site for a project overview and\ additional references to Affymetrix tiling array publications.
\\ Bolstad, B. M., Irizarry, R. A., Astrand, M., and Speed, T. P. \ A comparison of normalization methods for high density \ oligonucleotide array data based on variance and bias. \ Bioinformatics 19(2), 185-193 (2003).
\\ Cawley, S., Bekiranov, S., Ng, H. H., Kapranov, P., Sekinger,\ E. A., Kampa, D., Piccolboni, A., Sementchenko, V., Cheng, J.,\ Williams, A. J., et al. \ Unbiased mapping of transcription factor binding sites along \ human chromosomes 21 and 22 points to widespread regulation of noncoding \ RNAs. \ Cell 116(4), 499-509 (2004).
\\ Kapranov, P., Cawley, S. E., Drenkow, J., Bekiranov, S., Strausberg,\ R. L., Fodor, S. P., and Gingeras, T. R. \ Large-scale transcriptional activity in chromosomes 21 and \ 22. \ Science 296(5569), 916-919 (2002).
\ encodeTxLevels 1 dataVersion ENCODE June 2005 Freeze\ group encodeTxLevels\ longLabel Affymetrix 25 bp Probe Locations\ origAssembly hg16\ priority 41\ shortLabel Affy 25bp Probes\ track encodeAffyEncode25bpProbes\ type bed 4 .\ visibility hide\ encodeAffyChIpHl60PvalP300Hr00 Affy P300 RA 0h wig 0.0 534.54 Affymetrix ChIP/Chip (P300 retinoic acid-treated HL-60, 0hrs) P-Value 0 41 100 125 0 177 190 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 0 color 100,125,0\ longLabel Affymetrix ChIP/Chip (P300 retinoic acid-treated HL-60, 0hrs) P-Value\ parent encodeAffyChIpHl60Pval\ priority 41\ shortLabel Affy P300 RA 0h\ subGroups factor=P300 time=0h\ track encodeAffyChIpHl60PvalP300Hr00\ encodeYaleAffyNB4TPARNATarsIntergenicDistal Yale Ig Dst NB4 TPA bed 4 . Yale Intergenic Distal TPA-Treated NB4 TARs 0 41 155 0 100 205 127 177 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeAnalysis 1 color 155,0,100\ longLabel Yale Intergenic Distal TPA-Treated NB4 TARs\ parent encodeNoncodingTransFrags\ priority 41\ shortLabel Yale Ig Dst NB4 TPA\ subGroups region=intergenicDistal celltype=nb4 source=yale\ track encodeYaleAffyNB4TPARNATarsIntergenicDistal\ sibGene SIB Genes genePred Swiss Institute of Bioinformatics Gene Predictions from mRNA and ESTs 0 41.4 195 90 0 225 172 127 0 0 0 http://www.isrec.isb-sib.ch/cgi-bin/tromer/tromer_quick_search.pl?query_str=$$\ The SIB Genes track shows gene predictions based on data\ from RefSeq and EMBL/GenBank. This is transcript-based set of\ predictions. Genes all have the support of at least one GenBank full length RNA sequence,\ one RefSeq RNA, or one spliced EST. The track includes both protein-coding\ and non-coding transcripts. The coding regions are predicted using\ ESTScan.
\ \\ This track in general follows the display conventions for\ gene prediction\ tracks. The exons for putative noncoding genes and untranslated regions \ are represented by relatively thin blocks, while those for coding open \ reading frames are thicker.
\\ This track contains an optional codon coloring\ feature that allows users to quickly validate and compare gene predictions.\ To display codon colors, select the genomic codons option from the\ Color track by codons pull-down menu. Click\ here for more\ information about this feature.
\Further information on the predicted transcripts can be found on the\ Transcriptome Web\ interface.
\ \ \\ The SIB Genes are built using a multi-step pipeline: \
\ The SIB Genes track was produced on the Vital-IT high-performance \ computing platform\ using a computational pipeline developed by Christian Iseli with help from\ colleagues at the Ludwig institute \ for Cancer\ Research and the Swiss Institute \ of Bioinformatics. It is based on data from NCBI RefSeq and GenBank/EMBL. Our\ thanks to the people running these databases and to the scientists worldwide\ who have made contributions to them.
\ \\ Benson DA, Karsch-Mizrachi I, Lipman DJ, Ostell J,\ Wheeler DL. \ GenBank: update. \ Nucleic Acids Res. 2004 Jan 1;32:D23-6.
\ genes 1 color 195,90,0\ group genes\ longLabel Swiss Institute of Bioinformatics Gene Predictions from mRNA and ESTs\ priority 41.4\ shortLabel SIB Genes\ track sibGene\ type genePred\ url http://www.isrec.isb-sib.ch/cgi-bin/tromer/tromer_quick_search.pl?query_str=$$\ urlLabel SIB link:\ visibility hide\ ECgene ECgene Genes genePred ECgenePep ECgene Gene Predictions with Alt-Splicing 0 41.5 155 0 125 205 127 190 0 0 0\ ECgene (gene prediction by EST clustering) predicts genes by combining \ genome-based EST clustering and transcript \ assembly methods. The EST clustering is based on genomic alignment of mRNA \ and ESTs similar to that of NCBI's UniGene for the human genome. The \ transcript assembly procedure yields gene models for each cluster that \ include alternative splicing variants. This algorithm was developed by Prof. \ Sanghyuk Lee's Lab of Bioinformatics at Ewha Womans University in Seoul, \ Korea.
\\ For more detailed information, see the \ ECgene website.\
\ \\ This track follows the display conventions for \ gene prediction \ tracks.
\ \\ The predictions for this track were produced by Namshin Kim and Sanghyuk Lee \ at Ewha Womans Univeristy, Seoul, KOREA.\ genes 1 color 155,0,125\ group genes\ longLabel ECgene Gene Predictions with Alt-Splicing\ priority 41.5\ shortLabel ECgene Genes\ track ECgene\ type genePred ECgenePep\ visibility hide\ encodeAffyChIpHl60SitesP300Hr00 Affy P300 RA 0h bed 3 . Affymetrix ChIP/Chip (P300 retinoic acid-treated HL-60, 0hrs) Sites 0 42 100 125 0 177 190 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 1 color 100,125,0\ longLabel Affymetrix ChIP/Chip (P300 retinoic acid-treated HL-60, 0hrs) Sites\ parent encodeAffyChIpHl60Sites\ priority 42\ shortLabel Affy P300 RA 0h\ subGroups factor=P300 time=0h\ track encodeAffyChIpHl60SitesP300Hr00\ ensEst Ensembl ESTs genePred ensEstPep Human ESTs From Ensembl 0 42 175 20 125 215 137 190 0 0 0
\ Gene predictions from Ensembl based on expressed sequence tags (ESTs).
\ \\ For a description of the methods used, refer to \ Hubbard, T. et al. (2002) in the References section below.
\ \\ This track has an optional codon coloring feature that allows users to \ quickly validate and compare gene predictions. To display codon colors, \ select the genomic codons option from the Color track by \ codons pull-down menu at the top of the track description page.\ This page is accessed via the small button to the left of the track's\ graphical display or through the link on the track's control menu. Click \ here for more information about this feature.
\\ After you have made your configuration selections, click the \ Submit button to return to the tracks display page.
\ \\ Thanks to Ensembl \ for providing this annotation.
\ \\ Hubbard T, Barker D, Birney E, Cameron G, Chen Y, Clark L, Cox T, Cuff J,\ Curwen V, Down T et al..\ The Ensembl genome database project.\ Nucleic Acids Research. 2002 Jan 1;30(1):38-41.
\ genes 1 color 175,20,125\ group genes\ longLabel $Organism ESTs From Ensembl\ priority 42\ shortLabel Ensembl ESTs\ track ensEst\ type genePred ensEstPep\ visibility hide\ encodeYaleAffyNB4UntrRNATarsIntergenicDistal Yale Ig Dst NB4 Un bed 4 . Yale Intergenic Distal Untreated NB4 TARs 0 42 180 0 75 217 127 165 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeAnalysis 1 color 180,0,75\ longLabel Yale Intergenic Distal Untreated NB4 TARs\ parent encodeNoncodingTransFrags\ priority 42\ shortLabel Yale Ig Dst NB4 Un\ subGroups region=intergenicDistal celltype=nb4 source=yale\ track encodeYaleAffyNB4UntrRNATarsIntergenicDistal\ encodeAffyChipSuper Affy ChIP Affymetrix ChIP-chip 0 43 0 0 0 127 127 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX,\ This super-track combines related tracks of ChIP-chip data\ generated by the Affymetrix/Harvard ENCODE collaboration.\ ChIP-chip, also known as genome-wide location analysis, is a technique for\ isolation and identification of DNA sequences bound by specific proteins in\ cells.\
\\ These tracks contain ChIP-chip data of multiple transcription\ factors, RNA polymerase II and histones, in multiple cell lines,\ including HL-60 (leukemia) and ME-180 (cervical carcinoma),\ and at different time points after drug cell treatment.\ Binding was assayed on Affymetrix ENCODE tiling arrays.\ Data are displayed as signals, \ median p-values, "strict" p-values and sites.\
\ \\ These data were generated and analyzed by collaboration of\ the Tom Gingeras group at\ Affymetrix and the\ Kevin Struhl lab\ at Harvard Medical School.
\ \\ Please see the Affymetrix Transcriptome site for a project overview and\ additional references to Affymetrix tiling array publications.
\\ Bolstad BM, Irizarry RA, Astrand M, and Speed TP.\ A comparison\ of normalization methods for high density oligonucleotide array data based\ on variance and bias.\ Bioinformatics. 2003 Jan 22;19(2):185-93.
\\ Cawley S, Bekiranov S, Ng HH, Kapranov P, Sekinger EA,\ Kampa D, Piccolboni A, Sementchenko V, Cheng J,\ Williams AJ et al.\ Unbiased mapping of\ transcription factor binding sites along human chromosomes 21 and 22 points\ to widespread regulation of noncoding RNAs.\ Cell. 2004 Feb 20;116(4):499-509.\ \ encodeChip 0 chromosomes chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX\ group encodeChip\ longLabel Affymetrix ChIP-chip\ priority 43.0\ shortLabel Affy ChIP\ superTrack on\ track encodeAffyChipSuper\ encodeAffyChIpHl60PvalP300Hr02 Affy P300 RA 2h wig 0.0 534.54 Affymetrix ChIP/Chip (P300 retinoic acid-treated HL-60, 2hrs) P-Value 0 43 100 125 0 177 190 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 0 color 100,125,0\ longLabel Affymetrix ChIP/Chip (P300 retinoic acid-treated HL-60, 2hrs) P-Value\ parent encodeAffyChIpHl60Pval\ priority 43\ shortLabel Affy P300 RA 2h\ subGroups factor=P300 time=2h\ track encodeAffyChIpHl60PvalP300Hr02\ encodeAffyChIpHl60Pval Affy pVal wig 0.0 534.54 Affymetrix ChIP/Chip (retinoic acid-treated HL-60 cells) P-Values 0 43 0 0 0 127 127 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX,
\ This track shows regions that co-precipitate with antibodies against\ each of ten factors in all ENCODE regions, in retinoic-acid stimulated\ HL-60 cells harvested after 0, 2, 8, and 32 hours. Median P-values are shown in\ separate subtracks for each of the ten antibodies:\
\ Only median P-values are displayed; data for all biological replicates \ can be downloaded from Affymetrix in \ wiggle,\ cel, and\ soft formats.
\ \\ The subtracks within this composite annotation track\ may be configured in a variety of ways to highlight different aspects of the \ displayed data. The graphical configuration options for the subtracks \ are shown at the top of the track description page, followed by a list of \ subtracks. \ For more information about the graphical configuration options, click the \ Graph\ configuration help link.
\\ Color differences among the subtracks are arbitrary. They provide a\ visual cue for finding the same antibody in different timepoint tracks.
\ \\ The data from replicate arrays were quantile-normalized (Bolstad et\ al., 2003) and all arrays were scaled to a median array intensity of\ 22. Within a sliding 1001 bp window centered on each probe, a signal\ estimator S = ln[max(PM - MM, 1)] (where PM is perfect match and MM is\ mismatch) was computed for each biological replicate treatment- and\ all replicate control-probe pairs. An estimate of the significance of\ the enrichment of treatment signal for each replicate over control\ signal in each window was given by the P-value computed using the\ Wilcoxon Rank Sum test over each biological replicate treatment and\ all control signal estimates in that window. The median of the log\ transformed P-value (-10 log[10] P) across processed replicate data is\ displayed.
\\ Several independent biological replicates (four each for Brg1, CEBPe,\ CTCF, PU1, and SIRT1; five each for H3K27me3, H4Kac4, P300, Pol2 and\ RARA) were generated and hybridized\ to duplicate arrays (two technical replicates). Reproducible enriched\ regions were generated from the signal by first applying a cutoff of\ 20 to the log transformed P-values, a maxGap and minRun of 500 and 0\ basepairs respectively, to each biological replicate. Since each\ region or site may be comprised of more than one probe, a median\ based on the distribution of log transformed P-values was computed per\ site for each of the respective replicates. These seed sites were then\ ranked individually within each of the replicates. If a site was\ absent in a replicate, the maximum or worst rank of the distribution\ was assigned to it.
\\ The following three values were computed for each\ site by combining data from all biological replicates: \
\ The final sites were selected when all of the\ above three metrics were relatively low, where "low" corresponds to \ the top 25 percentile of the distribution.
\ \\ Using the P-values from the biological replicates, all pairwise\ rank correlation coefficients were computed among biological\ replicates. Data sets showing both consistent pairwise correlation\ coefficients and at least weak positive correlation across all pairs\ were considered reproducible.
\ \\ These data were generated and analyzed by the Gingeras/Struhl\ collaboration with the Tom Gingeras group at \ Affymetrix and \ Kevin Struhl's group at Harvard Medical School.
\ \\ Please see the \ Affymetrix Transcriptome site for a project overview and\ additional references to Affymetrix tiling array publications.
\\ Bolstad, B. M., Irizarry, R. A., Astrand, M., and Speed, T. P. \ A comparison \ of normalization methods for high density oligonucleotide array data based \ on variance and bias. \ Bioinformatics 19(2), 185-193 (2003).
\\ Cawley, S., Bekiranov, S., Ng, H. H., Kapranov, P., Sekinger,\ E. A., Kampa, D., Piccolboni, A., Sementchenko, V., Cheng, J.,\ Williams, A. J., et al. \ Unbiased mapping of \ transcription factor binding sites along human chromosomes 21 and 22 points \ to widespread regulation of noncoding RNAs. \ Cell 116(4), 499-509 (2004).
\ encodeChip 0 autoScale off\ chromosomes chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX\ compositeTrack on\ dataVersion ENCODE June 2005 Freeze\ group encodeChip\ longLabel Affymetrix ChIP/Chip (retinoic acid-treated HL-60 cells) P-Values\ maxHeightPixels 128:16:16\ origAssembly hg16\ priority 43.0\ shortLabel Affy pVal\ spanList 1\ subGroup1 time Timepoint 0h=0hrs 2h=2hrs 8h=8hrs 32h=32hrs\ subGroup2 factor Factor Brg1=Brg1 CEBPe=CEBPe CTCF=CTCF H3K27me3=H3K27me3 H4Kac4=H4Kac4 P300=P300 PU1=PU1 RARA=RARA Pol2=Pol2 SIRT1=SIRT1 TFIIB=TFIIB\ track encodeAffyChIpHl60Pval\ type wig 0.0 534.54\ viewLimits 0:100\ visibility hide\ ncbiGenes NCBI Gene Models genePred ncbiPep Human Gene Models from NCBI 0 43 0 0 0 127 127 127 0 0 0\ This track shows regions that co-precipitate with antibodies against\ each of ten factors in all ENCODE regions, in retinoic-acid stimulated\ HL-60 cells harvested after 0, 2, 8, and 32 hours. Clustered sites are shown in\ separate subtracks for each of the ten antibodies:\
\ The subtracks within this composite annotation track\ may be configured in a variety of ways to highlight different aspects of the \ displayed data. The graphical configuration options for the subtracks \ are shown at the top of the track description page, followed by a list of \ subtracks. \ For more information about the graphical configuration options, click the \ Graph\ configuration help link.
\\ Color differences among the subtracks are arbitrary. They provide a\ visual cue for finding the same antibody in different timepoint tracks.
\ \\ The data from replicate arrays were quantile-normalized (Bolstad et\ al., 2003) and all arrays were scaled to a median array intensity of\ 22. Within a sliding 1001 bp window centered on each probe, a signal\ estimator S = ln[max(PM - MM, 1)] (where PM is perfect match and MM is\ mismatch) was computed for each biological replicate treatment- and\ all replicate control-probe pairs. An estimate of the significance of\ the enrichment of treatment signal for each replicate over control\ signal in each window was given by the P-value computed using the\ Wilcoxon Rank Sum test over each biological replicate treatment and\ all control signal estimates in that window. The median of the log\ transformed P-value (-10 log10 P) across processed replicate \ data is displayed.
\\ Several independent biological replicates (four each for Brg1, CEBPe,\ CTCF, PU1, and SIRT1; five each for H3K27me3, H4Kac4, P300, Pol2 and\ RARA) were generated and hybridized\ to duplicate arrays (two technical replicates). Reproducible enriched\ regions were generated from the signal by first applying a cutoff of\ 20 to the log transformed P-values, a maxGap and minRun of 500 and 0\ basepairs respectively, to each biological replicate. Since each\ region or site may be comprised of more than one probe, a median\ based on the distribution of log transformed P-values was computed per\ site for each of the respective replicates. These seed sites were then\ ranked individually within each of the replicates. If a site was\ absent in a replicate, the maximum or worst rank of the distribution\ was assigned to it.
\\ The following three values were computed for each\ site by combining data from all biological replicates: \
\ The final sites were selected when all of the\ above three metrics were relatively low, where "low" corresponds to \ the top 25 percentile of the distribution.
\ \\ Using the P-values from the biological replicates, all pairwise\ rank correlation coefficients were computed among biological\ replicates. Data sets showing both consistent pairwise correlation\ coefficients and at least weak positive correlation across all pairs\ were considered reproducible.
\ \\ These data were generated and analyzed by the Gingeras/Struhl\ collaboration with the Tom Gingeras group at \ Affymetrix and \ Kevin Struhl's group at Harvard Medical School.
\ \\ Please see the \ Affymetrix Transcriptome site for a project overview and\ additional references to Affymetrix tiling array publications.
\\ Bolstad, B. M., Irizarry, R. A., Astrand, M., and Speed, T. P. \ A comparison \ of normalization methods for high density oligonucleotide array data based \ on variance and bias. \ Bioinformatics 19(2), 185-193 (2003).
\\ Cawley, S., Bekiranov, S., Ng, H. H., Kapranov, P., Sekinger,\ E. A., Kampa, D., Piccolboni, A., Sementchenko, V., Cheng, J.,\ Williams, A. J., et al. \ Unbiased mapping of \ transcription factor binding sites along human chromosomes 21 and 22 points \ to widespread regulation of noncoding RNAs. \ Cell 116(4), 499-509 (2004).
\ encodeChip 1 chromosomes chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX\ compositeTrack on\ dataVersion ENCODE June 2005 Freeze\ group encodeChip\ longLabel Affymetrix ChIP/Chip (retinoic acid-treated HL-60 cells) Sites\ origAssembly hg16\ priority 43.1\ shortLabel Affy Sites\ subGroup1 time Timepoint 0h=0hrs 2h=2hrs 8h=8hrs 32h=32hrs\ subGroup2 factor Factor Brg1=Brg1 CEBPe=CEBPe CTCF=CTCF H3K27me3=H3K27me3 H4Kac4=H4Kac4 P300=P300 PU1=PU1 RARA=RARA Pol2=Pol2 SIRT1=SIRT1 TFIIB=TFIIB\ track encodeAffyChIpHl60Sites\ type bed 3 .\ visibility hide\ encodeAffyChIpHl60PvalStrict Affy Strict pVal wig 0 696.62 Affymetrix ChIP-chip (HL-60 and ME-180 cells) Strict P-Value 1 43.7 0 0 0 127 127 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX,\ This track shows regions that co-precipitate with antibodies against\ each of 4 factors in all ENCODE regions, in retinoic-acid stimulated\ \ HL-60\ (leukemia) cells harvested after 0, 2, 8, and 32 hours, and in a fifth\ factor tested in \ \ ME-180 cervical carcinoma cells.\ Median of the transformed P-value (-10 log[10] P) across\ processed replicate data is displayed as \ separate subtracks for each antibody:\
\ Only the median of the transformed P-value (-10 log[10] P) is displayed; \ data for all biological replicates \ can be downloaded from Affymetrix in \ wiggle,\ cel, and\ soft formats.
\ \\ The subtracks within this composite annotation track\ may be configured in a variety of ways to highlight different aspects of the \ displayed data. The graphical configuration options for the subtracks \ are shown at the top of the track description page, followed by a list of \ subtracks. \ For more information about the graphical configuration options, click the \ Graph\ configuration help link.
\\ Color differences among the subtracks are arbitrary. They provide a\ visual cue for finding the same antibody in different timepoint tracks.
\ \\ The data from replicate arrays were quantile-normalized (Bolstad et\ al., 2003) and all arrays were scaled to a median array intensity of\ 22. Within a sliding 1001 bp window centered on each probe, a signal\ estimator S = ln[max(PM - MM, 1)] (where PM is perfect match and MM is\ mismatch) was computed for each biological replicate treatment- and\ all replicate control-probe pairs. An estimate of the significance of\ the enrichment of treatment signal for each replicate over control\ signal in each window was given by the P-value computed using the\ Wilcoxon Rank Sum test over each biological replicate treatment and\ all control signal estimates in that window. The median of the \ transformed P-value (-10 log[10] P) across processed replicate data is\ displayed.
\ \\ Using the P-values from the biological replicates, all pairwise\ rank correlation coefficients were computed among biological\ replicates. Data sets showing both consistent pairwise correlation\ coefficients and at least weak positive correlation across all pairs\ were considered reproducible.
\ \\ These data were generated and analyzed by the Gingeras/Struhl\ collaboration with the Tom Gingeras group at \ Affymetrix and \ Kevin Struhl's group at Harvard Medical School.
\ \\ Please see the \ Affymetrix Transcriptome site for a project overview and\ additional references to Affymetrix tiling array publications.
\\ Bolstad, B. M., Irizarry, R. A., Astrand, M., and Speed, T. P. \ A comparison \ of normalization methods for high density oligonucleotide array data based \ on variance and bias. \ Bioinformatics 19(2), 185-193 (2003).
\\ Cawley, S., Bekiranov, S., Ng, H. H., Kapranov, P., Sekinger,\ E. A., Kampa, D., Piccolboni, A., Sementchenko, V., Cheng, J.,\ Williams, A. J., et al. \ Unbiased mapping of \ transcription factor binding sites along human chromosomes 21 and 22 points \ to widespread regulation of noncoding RNAs. \ Cell 116(4), 499-509 (2004).
\\ Yang A, Zhu Z, Kapranov P, McKeon F, Church GM, Gingeras TR, Struhl K.\ \ Relationships between p63 binding, DNA sequence, transcription\ activity, and biological function in human cells. \ Mol. Cell. 24(4), 593-602 (2006).
\ \ encodeChip 0 autoScale off\ chromosomes chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX\ compositeTrack on\ dataVersion ENCODE Oct 2005 Freeze\ group encodeChip\ longLabel Affymetrix ChIP-chip (HL-60 and ME-180 cells) Strict P-Value\ maxHeightPixels 128:16:16\ origAssembly hg17\ priority 43.7\ shortLabel Affy Strict pVal\ spanList 1\ subGroup1 factor Factor H3K9K14ac2=H3K9K14ac2 H4Kac4=H4Kac4 Pol2=Pol2 actd=p63_ActD mactd=p63_mActD\ subGroup2 time Timepoint 0h=0hrs 2h=2hrs 8h=8hrs 32h=32hrs\ superTrack encodeAffyChipSuper dense\ track encodeAffyChIpHl60PvalStrict\ type wig 0 696.62\ viewLimits 0:250\ encodeAffyChIpHl60SignalStrict Affy Strict Sig wig -2.78 3.97 Affymetrix ChIP-chip (HL-60 and ME-180 cells) Strict Signal 0 43.8 0 0 0 127 127 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX,\ This track shows regions that co-precipitate with antibodies against\ each of 4 factors in all ENCODE regions, in retinoic-acid stimulated\ \ HL-60\ (leukemia) cells harvested after 0, 2, 8, and 32 hours, and in a fifth\ factor tested in \ \ ME-180 cervical carcinoma cells.\ \ Median of the signal estimate across processed replicate data is displayed as \ separate subtracks for each antibody:\
\ Only the median of the signal estimate across processed replicate data\ is displayed; data for all biological replicates \ can be downloaded from Affymetrix in \ wiggle,\ cel, and\ soft formats.
\ \\ The subtracks within this composite annotation track\ may be configured in a variety of ways to highlight different aspects of the \ displayed data. The graphical configuration options for the subtracks \ are shown at the top of the track description page, followed by a list of \ subtracks. \ For more information about the graphical configuration options, click the \ Graph\ configuration help link.
\\ Color differences among the subtracks are arbitrary. They provide a\ visual cue for finding the same antibody in different timepoint tracks.
\ \\ The data from replicate arrays were quantile-normalized (Bolstad et\ al., 2003) and all arrays were scaled to a median array intensity of\ 22. Within a sliding 1001 bp window centered on each probe, a signal\ estimator S = ln[max(PM - MM, 1)] (where PM is perfect match and MM is\ mismatch) was computed for each biological replicate treatment- and\ all replicate control-probe pairs. An estimate of the significance of\ the enrichment of treatment signal for each replicate over control\ signal in each window was given by the P-value computed using the\ Wilcoxon Rank Sum test over each biological replicate treatment and\ all control signal estimates in that window. The median of the \ signal estimate across processed replicate data is\ displayed.
\ \\ Using the P-values from the biological replicates, all pairwise\ rank correlation coefficients were computed among biological\ replicates. Data sets showing both consistent pairwise correlation\ coefficients and at least weak positive correlation across all pairs\ were considered reproducible.
\ \\ These data were generated and analyzed by the Gingeras/Struhl\ collaboration with the Tom Gingeras group at \ Affymetrix and \ Kevin Struhl's group at Harvard Medical School.
\ \\ Please see the \ Affymetrix Transcriptome site for a project overview and\ additional references to Affymetrix tiling array publications.
\\ Bolstad, B. M., Irizarry, R. A., Astrand, M., and Speed, T. P. \ A comparison \ of normalization methods for high density oligonucleotide array data based \ on variance and bias. \ Bioinformatics 19(2), 185-193 (2003).
\\ Cawley, S., Bekiranov, S., Ng, H. H., Kapranov, P., Sekinger,\ E. A., Kampa, D., Piccolboni, A., Sementchenko, V., Cheng, J.,\ Williams, A. J., et al. \ Unbiased mapping of \ transcription factor binding sites along human chromosomes 21 and 22 points \ to widespread regulation of noncoding RNAs. \ Cell 116(4), 499-509 (2004).
\\ Yang A, Zhu Z, Kapranov P, McKeon F, Church GM, Gingeras TR, Struhl K.\ \ Relationships between p63 binding, DNA sequence, transcription\ activity, and biological function in human cells. \ Mol. Cell. 24(4), 593-602 (2006).
\ \ encodeChip 0 autoScale off\ chromosomes chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX\ compositeTrack on\ dataVersion ENCODE Oct 2005 Freeze\ group encodeChip\ longLabel Affymetrix ChIP-chip (HL-60 and ME-180 cells) Strict Signal\ maxHeightPixels 128:16:16\ origAssembly hg17\ priority 43.8\ shortLabel Affy Strict Sig\ spanList 1\ subGroup1 factor Factor H3K9K14ac2=H3K9ac2 H4Kac4=H4Kac4 Pol2=Pol2 actd=p63_ActD mactd=p63_mActD\ subGroup2 time Timepoint 0h=0hrs 2h=2hrs 8h=8hrs 32h=32hrs\ superTrack encodeAffyChipSuper dense\ track encodeAffyChIpHl60SignalStrict\ type wig -2.78 3.97\ viewLimits 0:2.0\ visibility hide\ encodeAffyChIpHl60SitesStrict Affy Strict Sites bed 3 . Affymetrix ChIP-chip (HL-60 and ME-180 cells) Strict Sites 1 43.9 0 0 0 127 127 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX,\ This track shows regions that co-precipitate with antibodies against\ each of 4 factors in all ENCODE regions, in retinoic-acid stimulated\ \ HL-60\ (leukemia) cells harvested after 0, 2, 8, and 32 hours, and in a fifth\ factor tested in \ \ ME-180 cervical carcinoma cells.\ \ Clustered sites are shown in separate subtracks for each antibody:\
\ Data for all biological replicates \ can be downloaded from Affymetrix in \ wiggle,\ cel, and\ soft formats.
\\
\ The subtracks within this composite annotation track\ may be configured in a variety of ways to highlight different aspects of the \ displayed data. The graphical configuration options for the subtracks \ are shown at the top of the track description page, followed by a list of \ subtracks. \ For more information about the graphical configuration options, click the \ Graph\ configuration help link.
\\ Color differences among the subtracks are arbitrary. They provide a\ visual cue for finding the same antibody in different timepoint tracks.
\ \\ Three independent biological replicates were generated and hybridized\ to duplicate arrays (two technical replicates). Reproducible enriched regions\ were generated from the signal, by first applying a cutoff \ of 0.693(ln(2)=0.693) to the signal estimate, a maxgap and minrun of 500 \ and 0 basepairs respectively, to each biological replicate. \ Since each region or site can comprise of more than a single probe, \ a median based on the distribution of log transformed P-values \ was computed per site for each of the respective replicates. \ These seed sites were then ranked individually within each of the replicates. \ If a site was absent in a replicate the maximum or worst rank of the \ distribution was assigned to it. \
\ The following three values were computed for each\ site by combining data from all biological replicates: \
\ A final signal estimate based filter was applied, where sites with median\ signal estimate of at least 0.693/(total number of individual replcates) \ were considered. This was to ensure that if a site was not \ detected consistently in all replicates but was detected at a significant\ signal level in a subset of the replicates its detection level would\ be weighted accordingly in the final selection of sites. \ The final sites were selected when all of the above three metrics were\ relatively low, where "low" corresponds to the top 25 \ percentile of the distribution.\ \
\ Using the P-values from the biological replicates, all pairwise\ rank correlation coefficients were computed among biological\ replicates. Data sets showing both consistent pairwise correlation\ coefficients and at least weak positive correlation across all pairs\ were considered reproducible.
\ \\ These data were generated and analyzed by the Gingeras/Struhl\ collaboration with the Tom Gingeras group at \ Affymetrix and \ Kevin Struhl's group at Harvard Medical School.
\ \\ Please see the \ Affymetrix Transcriptome site for a project overview and\ additional references to Affymetrix tiling array publications.
\\ Bolstad, B. M., Irizarry, R. A., Astrand, M., and Speed, T. P. \ A comparison \ of normalization methods for high density oligonucleotide array data based \ on variance and bias. \ Bioinformatics 19(2), 185-193 (2003).
\\ Cawley, S., Bekiranov, S., Ng, H. H., Kapranov, P., Sekinger,\ E. A., Kampa, D., Piccolboni, A., Sementchenko, V., Cheng, J.,\ Williams, A. J., et al. \ Unbiased mapping of \ transcription factor binding sites along human chromosomes 21 and 22 points \ to widespread regulation of noncoding RNAs. \ Cell 116(4), 499-509 (2004).
\\ Yang A, Zhu Z, Kapranov P, McKeon F, Church GM, Gingeras TR, Struhl K.\ \ Relationships between p63 binding, DNA sequence, transcription\ activity, and biological function in human cells. \ Mol. Cell. 24(4), 593-602 (2006).
\ \ encodeChip 1 chromosomes chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX\ compositeTrack on\ dataVersion ENCODE Oct 2005 Freeze\ group encodeChip\ longLabel Affymetrix ChIP-chip (HL-60 and ME-180 cells) Strict Sites\ origAssembly hg17\ priority 43.9\ shortLabel Affy Strict Sites\ subGroup1 factor Factor H3K9K14ac2=H3K9K14ac2 H4Kac4=H4Kac4 Pol2=Pol2 actd=p63_ActD mactd=p63_mActD\ subGroup2 time Timepoint 0h=0hrs 2h=2hrs 8h=8hrs 32h=32hrs\ superTrack encodeAffyChipSuper dense\ track encodeAffyChIpHl60SitesStrict\ type bed 3 .\ encodeAffyChIpHl60SitesP300Hr02 Affy P300 RA 2h bed 3 . Affymetrix ChIP/Chip (P300 retinoic acid-treated HL-60, 2hrs) Sites 0 44 100 125 0 177 190 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 1 color 100,125,0\ longLabel Affymetrix ChIP/Chip (P300 retinoic acid-treated HL-60, 2hrs) Sites\ parent encodeAffyChIpHl60Sites\ priority 44\ shortLabel Affy P300 RA 2h\ subGroups factor=P300 time=2h\ track encodeAffyChIpHl60SitesP300Hr02\ npredGene NCBI Prediction genePred npredPep NCBI Gene Predictions 0 44 170 100 0 212 177 127 0 0 0 http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=nucleotide&cmd=search&term=$$This track shows predictions from NCBI Genome\ Assembly/Annotation Projects.\ \
\ The Twinscan program predicts genes in a manner similar to Genscan, except \ that Twinscan takes advantage of genome comparisons to improve gene prediction\ accuracy. In the version of Twinscan used to generate this track, intronless \ copies of known genes are masked out before gene prediction, reducing \ the number of non-processed pseudogenes in gene models. More information and a\ web server can be found at http://mblab.wustl.edu/.
\ \\ This track follows the display conventions for\ gene prediction\ tracks.
\\ The track description page offers the following filter and configuration\ options:\
\ The Twinscan algorithm is described in Korf, I. et al. 2001 in the\ References section below.
\ \\ Thanks to Michael Brent's Computational Genomics Group at Washington\ University St. Louis for providing these data.
\ \\ Korf I, Flicek P, Duan D, Brent MR.\ Integrating genomic homology into gene structure prediction.\ Bioinformatics. 2001 Jun 1;17(90001)S140-8.
\ genes 1 color 0,100,100\ group genes\ longLabel Twinscan Gene Predictions Using Mouse/Human Homology\ priority 45\ shortLabel Twinscan\ track twinscan\ type genePred twinscanPep\ visibility hide\ ucscFromMouse UCSC Mm3 genePred UCSC Gene Predictions from Known Mouse Genes Mapped to Human 0 45 0 100 100 0 50 50 0 0 0 genes 1 altColor 0,50,50\ color 0,100,100\ group genes\ longLabel UCSC Gene Predictions from Known Mouse Genes Mapped to Human\ priority 45\ shortLabel UCSC Mm3\ track ucscFromMouse\ type genePred\ visibility hide\ nscanGene N-SCAN genePred nscanPep N-SCAN Gene Predictions 0 45.1 34 139 34 144 197 144 0 0 0\ This track shows gene predictions using the N-SCAN gene structure prediction\ software provided by the Computational Genomics Lab at Washington University \ in St. Louis, MO, USA.\
\ \\ N-SCAN combines biological-signal modeling in the target genome sequence along\ with information from a multiple-genome alignment to generate de novo gene\ predictions. It extends the TWINSCAN target-informant genome pair to allow for\ an arbitrary number of informant sequences as well as richer models of\ sequence evolution. N-SCAN models the phylogenetic relationships between the\ aligned genome sequences, context-dependent substitution rates, insertions,\ and deletions.\
\BUG: INFORMANT DESCRIPTION NOT SET IN TRACK DB FILE
\ \\ Thanks to Michael Brent's Computational Genomics Group at Washington \ University St. Louis for providing this data.\
\\ Special thanks for this implementation of N-SCAN to Aaron Tenney in\ the Brent lab, and Robert Zimmermann, currently at Max F. Perutz\ Laboratories in Vienna, Austria.\
\ \\ Gross SS, Brent MR.\ Using\ multiple alignments to improve gene prediction. In\ Proc. 9th Int'l Conf. on Research in Computational Molecular Biology\ (RECOMB '05):374-388 and J Comput Biol. 2006 Mar;13(2):379-93.\
\\ Korf I, Flicek P, Duan D, Brent MR.\ Integrating genomic homology into gene structure prediction.\ Bioinformatics. 2001 Jun 1;17(90001):S140-8.
\\ van Baren MJ, Brent MR.\ Iterative gene prediction and pseudogene removal improves\ genome annotation.\ Genome Res. 2006 May;16(5):678-85.
\\ Haas BJ, Delcher AL, Mount SM, Wortman JR, Smith RK Jr, Hannick LI, Maiti R, Ronning CM,\ Rusch DB, Town CD et al.\ \ Improving the Arabidopsis genome annotation using maximal transcript \ alignment assemblies.\ Nucleic Acids Res 2003 Oct 1;31(19):5654-66.
\ genes 1 baseColorDefault genomicCodons\ baseColorUseCds given\ color 34,139,34\ group genes\ informant BUG: INFORMANT DESCRIPTION NOT SET IN TRACK DB FILE\ longLabel N-SCAN Gene Predictions\ priority 45.1\ shortLabel N-SCAN\ track nscanGene\ type genePred nscanPep\ visibility hide\ contrastGene CONTRAST coloredExon CONTRAST Gene Predictions 0 45.2 34 34 139 144 144 197 0 0 0\ This track shows protein-coding gene predictions generated by \ CONTRAST. \ Each predicted exon is colored according to confidence level: green (high\ confidence), orange (medium confidence), or red (low confidence).\
\ \\ CONTRAST predicts protein-coding genes from a multiple genomic\ alignment using a combination of discriminative machine learning techniques. A two-stage approach is used, in which output from local classifiers is combined with a global model of gene structure. CONTRAST is trained using a novel procedure designed to\ maximize expected coding region boundary detection accuracy.\ \
\ Please see the \ CONTRAST web site for details on how these predictions were generated and an estimate of accuracy. \ \
\ Thanks to Samuel Gross of the Batzoglou lab at Stanford University for providing these predictions.\ \
\ Gross SS, Do CB, Sirota M, Batzoglou S.\ CONTRAST: A Discriminative, Phylogeny-free Approach to Multiple Informant De Novo Gene Prediction.\ Genome Biology. 2007 December;8(12):R269.\
\ genes 0 color 34,34,139\ group genes\ longLabel CONTRAST Gene Predictions\ priority 45.2\ shortLabel CONTRAST\ track contrastGene\ type coloredExon\ visibility hide\ slamMouse Slam Mouse genePred Slam Gene Predictions Using Human/Mouse Homology 0 45.5 100 50 0 175 150 128 0 0 0\ Slam predicts coding exons and conserved noncoding regions in a pair of \ homologous DNA sequences, incorporating both statistical sequence properties \ and degree of conservation in making the predictions. The model is symmetric \ and the same gene structure (with possibly different exon lengths) is \ predicted in both sequences.
\\ The symmetry of the model gives it a higher degree of accuracy for regions \ where the true underlying gene structures contain the same number of coding \ exons, in cases where this is not true, or when one of the sequences is of \ lower quality and contains in-frame stop codons, the resulting predictions \ tend to have lower accuracy.
\\ More information and a web server can be found on the \ Slam website.
\ \\ Alexandersson, M., Cawley, S., and Pachter, L. \ SLAM - Cross-species gene finding and alignment with a \ generalized pair hidden Markov model. \ Genome Res. 13(3), 496-502.
\\ Cawley, S., Pachter, L., and Alexandersson, M. \ SLAM web server for comparative gene finding and alignment.\ Nucleic Acids Res. 31(13), 3507-3509 (2003).
\\ Pachter, L., Alexandersson, M., and Cawley, S. \ Applications of generalized pair hidden Markov models to \ alignment and gene finding problems. \ J Comput Biol. 9(2), 389-99 (2002).
\\ Pachter, L., Alexandersson, M., and Cawley, S. Applications of generalized \ pair hidden Markov models to alignment and gene finding problems. \ Proceedings of the Fifth Annual International Conference on Computational \ Molecular Biology (RECOMB 2001) (2001).
\ \ genes 1 altColor 175,150,128\ color 100,50,0\ group genes\ longLabel Slam Gene Predictions Using Human/Mouse Homology\ priority 45.5\ shortLabel Slam Mouse\ track slamMouse\ type genePred\ visibility hide\ slamRat Slam Rat genePred Slam Gene Predictions Using Human/Rat Homology 0 45.6 100 50 0 175 150 128 0 0 0\ Slam predicts coding exons and conserved noncoding regions in a pair of \ homologous DNA sequences, incorporating both statistical sequence properties \ and degree of conservation in making the predictions. The model is symmetric \ and the same gene structure (with possibly different exon lengths) is \ predicted in both sequences.
\\ The symmetry of the model gives it a higher degree of accuracy for regions \ where the true underlying gene structures contain the same number of coding \ exons, in cases where this is not true, or when one of the sequences is of \ lower quality and contains in-frame stop codons, the resulting predictions \ tend to have lower accuracy.
\\ More information and a web server can be found on the \ Slam website.
\ \\ Alexandersson, M., Cawley, S., and Pachter, L. \ SLAM - Cross-species gene finding and alignment with a \ generalized pair hidden Markov model. \ Genome Res. 13(3), 496-502.
\\ Cawley, S., Pachter, L., and Alexandersson, M. \ SLAM web server for comparative gene finding and alignment.\ Nucleic Acids Res. 31(13), 3507-3509 (2003).
\\ Pachter, L., Alexandersson, M., and Cawley, S. \ Applications of generalized pair hidden Markov models to \ alignment and gene finding problems. \ J Comput Biol. 9(2), 389-99 (2002).
\\ Pachter, L., Alexandersson, M., and Cawley, S. Applications of generalized \ pair hidden Markov models to alignment and gene finding problems. \ Proceedings of the Fifth Annual International Conference on Computational \ Molecular Biology (RECOMB 2001) (2001).
\ \ genes 1 altColor 175,150,128\ color 100,50,0\ group genes\ longLabel Slam Gene Predictions Using Human/Rat Homology\ priority 45.6\ shortLabel Slam Rat\ track slamRat\ type genePred\ visibility hide\ encodeAffyChIpHl60SitesP300Hr08 Affy P300 RA 8h bed 3 . Affymetrix ChIP/Chip (P300 retinoic acid-treated HL-60, 8hrs) Sites 0 46 100 125 0 177 190 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 1 color 100,125,0\ longLabel Affymetrix ChIP/Chip (P300 retinoic acid-treated HL-60, 8hrs) Sites\ parent encodeAffyChIpHl60Sites\ priority 46\ shortLabel Affy P300 RA 8h\ subGroups factor=P300 time=8h\ track encodeAffyChIpHl60SitesP300Hr08\ genomeScan NCBI GenomeScan genePred genomeScanPep Human GenomeScan Models from NCBI 0 46 0 0 0 127 127 127 0 0 0\ This track shows genome-wide p53 binding sites as determined by chromatin\ immunoprecipitation (ChIP) and paired-end di-tag (PET) sequencing. \ The p53 protein is a transcription factor involved in the control \ of cell growth that is often expressed at high levels in cancer cells. \ See the Methods section below for more information about ChIP and\ PET.
\\ The PET sequences in this track are derived from 65,572\ individual p53 ChIP fragments of 5-fluorouracil (5FU) stimulated\ HCT116 cells. More datasets will be submitted in the future, including\ STAT1, TAF250, and E2F1.
\ \\ In the graphical display, PET sequences are shown as two blocks,\ representing the ends of the pair, connected by a thin arrowed\ line. Overlapping PET clusters (PET fragments that overlap one\ another) originating from the ChIP enrichment process define the\ genomic loci that are potential transcription factor binding sites (TFBSs). \ PET singletons, from non-specific ChIP fragments that did not cluster, are \ not shown.
\\ In full and packed display modes, the arrowheads on the horizontal line \ represent the orientation of the PET sequence, and an ID of the format \ XXXXX-M is shown to the left of each PET,\ where X is the unique ID for each PET \ and M is the number of PET sequences at this location.\ The track coloring reflects the value of M: \ light gray indicates one or two sequences (score = 333), dark gray is used for \ three sequences (score = 800) and black indicates four or more PET sequences \ (score = 1000) at the location.
\ \\ HCT116 cells were treated with 5FU for six hours. The cross-linked\ chromatin was sheared and precipitated with a high affinity\ antibody. The DNA fragments were end-polished and cloned into the plasmid\ vector, pGIS3. pGIS3 contains two MmeI recognition sites that\ flank the cloning site, which were used to produce a 36 bp\ PET from the original ChIP DNA fragments (18 bp from each of the 5' and 3' \ ends). Multiple 36 bp PETs were concatenated and cloned into pZero-1 for \ sequencing, where each sequence read can generate 10-15 PETs. The PET \ sequences were extracted from raw sequence reads and mapped to the genome, \ defining the boundaries of each ChIP DNA fragment. The following specific\ mapping criteria were used:\
\ Due to the known possibility of MmeI slippage (+/- 1 bp) that\ leads to ambiguities at the PET signature boundaries, a minimal 17 bp\ match was set for each 18 bp signature. The total count of PET sequences\ mapped to the same locus but with slight nucleotide differences may reflect\ the expression level of the transcripts. Only PETs with specific mapping \ (one location) to the genome were considered. PETs that mapped to multiple \ locations may represent low complexity or repetitive sequences, and therefore \ were not included for further analysis.
\ \\ Statistical and experimental verification exercises have shown that\ the overlapping PET clusters result from ChIP enrichment events.\
\ Monte Carlo simulation using the p53 ChIP-PET data estimated that\ about 27% of PET-2 clusters (PET clusters with two overlapping\ members), 3% of the PET clusters with 3 overlapping members (PET-3\ clusters), and less than 0.0001% of PET clusters with more than 3\ overlapping members were due to random chance. This suggests that the\ PET clusters most likely represent the real enrichment events by ChIP\ and that a higher number of overlapping fragments correlates to a\ higher probability of a real ChIP enrichment event. Furthermore, based\ on goodness-of-fit analysis for assessing the reliability of PET\ clusters, it was estimated that less than 36% of the PET-2 clusters\ and over 99% of the PET-3+ clusters (clusters with three or more\ overlapping members) are true enrichment ChIP sites. Thus, the\ verification rate is nearly 100% for PET-3+ ChIP clusters, and the\ PET-2 clusters contain significant noise.
\\ In addition to these statistical analyses, 40 genomic locations\ identified by PET-3+ clusters were randomly selected and analyzed by\ quantitative real-time PCR. The relative enrichment of candidate\ regions compared to control GST ChIP DNA was determined and all 40\ regions (100%) were confirmed to have significant enrichment of p53\ ChIP clusters.
\ \\ The p53 ChIP-PET library and sequence data were produced at the \ Genome Institute of Singapore. The data were mapped\ and analyzed by scientists from the Genome Institute of Singapore, the\ Bioinformatics Institute, Singapore, and Boston University.
\ \\ Ng, P. et al. Gene identification signature (GIS) analysis for\ transcriptome characterization and genome annotation. Nature\ Methods 2, 105-111 (2005).
\ \ encodeChip 1 chromosomes chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr17,chr18,chr19,chr2,chr20,chr21,chr22,chr3,chr4,chr5,chr6,chr7,chr8,chr9,chrX,chrY\ color 158,35,135\ dataVersion ENCODE June 2005 Freeze\ group encodeChip\ longLabel GIS ChIP-PET: p53 Ab on 5FU treated HCT116 cells\ origAssembly hg16\ priority 47.0\ shortLabel GIS p53 5FU HCT116\ track encodeGisChipPet\ type bed 12\ useScore 1\ visibility hide\ sgpGene SGP Genes genePred sgpPep SGP Gene Predictions Using Mouse/Human Homology 0 47 0 90 100 127 172 177 0 0 0\ This track shows gene predictions from the SGP program, developed at \ the Genome Bionformatics \ Laboratory (GBL), which is part of the Grup de Recerca en Informàtica Biomèdica (GRIB) at Institut \ Municipal d'Investigació Mèdica (IMIM) / Centre de Regulació Genòmica (CGR) in \ Barcelona. To predict genes in a genomic query, SGP combines geneid predictions \ with tblastx comparisons of the genomic query against other genomic sequences.\
\\ Thanks to GBL for providing these gene predictions.\
\ \ \ \ genes 1 color 0,90,100\ group genes\ longLabel SGP Gene Predictions Using Mouse/Human Homology\ priority 47\ shortLabel SGP Genes\ track sgpGene\ type genePred sgpPep\ visibility hide\ encodeGisChipPetStat1 GIS STAT1 HeLa bed 12 GIS ChIP-PET: STAT1 Ab on (+/-)gIF HeLa cells 0 47.1 125 140 35 67 79 35 1 0 24 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr17,chr18,chr19,chr2,chr20,chr21,chr22,chr3,chr4,chr5,chr6,chr7,chr8,chr9,chrX,chrY,\ This track shows STAT1 binding sites as determined by chromatin \ immunoprecipitation (ChIP) and paired-end di-tag (PET) sequencing. \
\ The PET sequences in this track are derived from\ 327,838 STAT1 ChIP fragments of interferon gamma-stimulated HeLa cells and\ 263,901 STAT1 ChIP fragments of non-stimulated HeLa cells. \ Of these individual ChIP fragments, 3,180 of the PETs from the stimulated cells and\ 4,007 PETs from unstimulated cells were mapped to the ENCODE regions. \ The data from the unstimulated cells were used as the negative control.
\\ Only PETs mapped to the ENCODE regions are shown in this track. \ \
\ In the graphical display, PET sequences are shown as two blocks,\ representing the ends of the pair, connected by a thin, arrowed\ line. Overlapping PET clusters (PET fragments that overlap one\ another) originating from the ChIP enrichment process define the\ genomic loci that are potential transcription factor binding sites (TFBSs).\ PET singletons, from non-specific ChIP fragments that did not cluster, are\ not shown.
\\ In full and packed display modes, the arrowheads on the horizontal line\ represent the orientation of the PET sequence, and an ID of the format\ XXXXX-M is shown to the left of each PET,\ where X is the unique ID for each PET\ and M is the number of PET sequences at this location.\ The track coloring reflects the value of M:\ light gray indicates one or two sequences (score = 333), dark gray is used for\ three sequences (score = 800) and black indicates four or more PET sequences\ (score = 1000) at the location.
\ \\ The STAT1 chromatin immuno-precipitated DNA fragments from stimulated and \ non-stimulated control cells were end-polished and cloned into the plasmid \ vector, pGIS3. pGIS3 contains two MmeI recognition sites that flank \ the cloning site, which were used to produce a 36 bp PET from the original \ ChIP DNA fragments (18 bp from each of the 5' and 3' ends). Multiple 36 bp \ PETs were concatenated and cloned into pZero-1 for sequencing, where each \ sequence read can generate 10-15 PETs. The PET sequences were extracted from \ raw sequence reads and mapped to the genome, defining the boundaries of each \ ChIP DNA fragment. The following specific mapping criteria were used: \
\ Due to the known possibility of MmeI slippage (+/- 1 bp) that leads to \ ambiguities at the PET signature boundaries, a minimal 17 bp match was set \ for each 18 bp signature. Only PETs with specific mapping (one location) to \ the genome were considered. PETs that mapped to multiple locations may \ represent low complexity or repetitive sequences, and therefore were not \ included for further analysis.
\ \\ Statistical and experimental verification exercises have shown that the \ overlapping PET clusters result from ChIP enrichment events.
\\ Monte Carlo simulation using the STAT1 ChIP-PET data from interferon \ gamma-stimulated dataset estimated that random chance accounted for about \ 58% of PET-3 clusters (maximal numbers of PETs within the overlap region \ of any cluster), 21% of the PET clusters with 4 overlapping members (PET-4 \ clusters), and less than 0.5% of PET clusters with more than 5 overlapping \ members. This suggests that the PET-5+ clusters represent the real enrichment \ events by ChIP and that a higher number of overlapping fragments correlates \ to a higher probability of a real ChIP enrichment event. Furthermore, based \ on goodness-of-fit analysis for assessing the reliability of PET clusters, it \ was estimated that less than 30% of the PET-4 clusters and over 90% of the \ PET-5+ clusters (clusters with five or more overlapping members) are true \ enrichment ChIP sites.
\\ In addition to these statistical analyses, 9 out of 14 genomic locations (64%) \ identified by PET-5+ clusters in the ENCODE regions were supported by ChIP-chip \ data from Yale using the same ChIP DNA as hybridization material.
\ \\ The ChIP fragment prep was provided by Ghia Euskirchen from Michael Snyder's \ lab at Yale. The ChIP-PET library and sequence data were produced at the\ Genome Institute of Singapore. The data were mapped\ and analyzed by scientists from the Genome Institute of Singapore and\ the Bioinformatics \ Institute, Singapore.
\ \\ Ng, P. et al. \ Gene identification signature (GIS) analysis for\ transcriptome characterization and genome annotation. Nature\ Methods 2, 105-111 (2005).
\ encodeChip 1 altColor 67,79,35\ chromosomes chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr17,chr18,chr19,chr2,chr20,chr21,chr22,chr3,chr4,chr5,chr6,chr7,chr8,chr9,chrX,chrY\ color 125,140,35\ compositeTrack on\ dataVersion ENCODE Oct 2005 Freeze\ group encodeChip\ longLabel GIS ChIP-PET: STAT1 Ab on (+/-)gIF HeLa cells\ priority 47.1\ shortLabel GIS STAT1 HeLa\ track encodeGisChipPetStat1\ type bed 12\ useScore 1\ visibility hide\ encodeAffyChIpHl60SitesP300Hr32 Affy P300 RA 32h bed 3 . Affymetrix ChIP/Chip (P300 retinoic acid-treated HL-60, 32hrs) Sites 0 48 100 125 0 177 190 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 1 color 100,125,0\ longLabel Affymetrix ChIP/Chip (P300 retinoic acid-treated HL-60, 32hrs) Sites\ parent encodeAffyChIpHl60Sites\ priority 48\ shortLabel Affy P300 RA 32h\ subGroups factor=P300 time=32h\ track encodeAffyChIpHl60SitesP300Hr32\ softberryGene Fgenesh++ Genes genePred softberryPep Fgenesh++ Gene Predictions 0 48 0 100 0 127 177 127 0 0 0\ Fgenesh++ predictions are based on Softberry's gene-finding software.
\ \\ Fgenesh++ uses both hidden Markov models (HMMs) and protein similarity to \ find genes in a completely automated manner. For more information, see \ Solovyev, V.V. (2001) in the References section below.
\ \\ The Fgenesh++ gene predictions were produced by \ Softberry Inc. \ Commercial use of these predictions is restricted to viewing in \ this browser. Please contact Softberry Inc. to make arrangements for further \ commercial access.
\ \\ Solovyev, V.V. \ "Statistical approaches in Eukaryotic gene prediction" in the \ Handbook of Statistical Genetics (ed. Balding, D. et al.), \ 83-127. John Wiley & Sons, Ltd. (2001).
\ genes 1 color 0,100,0\ group genes\ longLabel Fgenesh++ Gene Predictions\ priority 48\ shortLabel Fgenesh++ Genes\ track softberryGene\ type genePred softberryPep\ visibility hide\ encodeLIChIP LI ChIP Various bedGraph 4 Ludwig Institute/UCSD ChIP-chip: Pol2 8WG16, TAF1, H3ac, H3K4me2, H3K27me3 antibodies 0 48 0 0 0 127 127 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX,\ ENCODE region-wide location analyses were conducted of binding to the \ initiation-complex form of RNA polymerase II (Pol2), \ TATA-associated factor (TAF1), \ acetylated histone H3 (H3ac), \ lysine-4-dimethylated H3 (H3K4me2), \ suppressor of zeste 12 protein homolog (SUZ12), and \ lysine-27-tri-methylated H3 (H3K27me3).\ The analyses used chromatin extracted from \ IMR90 (lung fibroblast),\ HCT116 (colon epithelial carcinoma),\ HeLa (cervix epithelial adenocarcinoma), and\ THP1 (blood monocyte leukemia) cells.\ The initiation-complex form of Pol2 is associated with the transcription\ start site, as is TAF1. Both H3ac and H3K4me2 are associated with\ transcriptionally-active "open" chromatin.
\ \\ This annotation follows the display conventions for composite tracks. \ Data for each antibody/cell line pair is displayed in a separate \ subtrack. See the top of the track description page for a complete\ list of the subtracks available for this annotation. The subtracks \ may be configured in a variety of ways to highlight different aspects of the \ displayed data. The graphical configuration options are shown at the top of \ the track description page, followed by the list of subtracks. To display \ only selected subtracks, uncheck the boxes next to the tracks you wish to \ hide. For more information about the graphical configuration options, click \ the \ Graph\ configuration help link.
\ \\ Chromatin from each of the four cell lines was separately cross-linked, \ precipitated with antibody to one of the six proteins, sheared, amplified and\ hybridized to a PCR DNA tiling array produced at the Ren Lab at UC San Diego. \ The array was composed of 24,537 non-repetitive sequences within the 44 \ ENCODE regions.
\\ For each marker, there were three biological replicates. Each experiment was \ normalized using the median values. The P-value and R-value were \ calculated using the modified single array error model \ (Li, Z. et al., 2003).\ The P-value and R-value were then derived from the weighted average \ results of the replicates.
\\ The displayed values were scaled to 0 - 16, corresponding to negative log \ base 10 of the P-value.
\ \\ Each of the experiments has three biological replicates. The \ array platform, the \ raw and normalized data for each experiment, and the \ image files have all been deposited at the NCBI \ GEO Microarray \ Database.
\ \\ The data for this track were generated at the \ Ren Lab, Ludwig \ Institute for Cancer Research at UC San Diego.
\ \\ Kim, T., Barrera, L.O., Qu, C., van Calcar, S., Trinklein, N.,\ Cooper, S., Luna, R., Glass, C.K., Rosenfeld, M.G., \ Myers, R., Ren, B. \ \ Direct isolation and identification of promoters in the human genome.\ Genome Research 15,830-839 (2005).
\\ Li, Z., Van Calcar, S., Qu, C., Cavenee, W.K., Zhang, M.Z., and Ren, B. \ A global transcriptional regulatory role for c-Myc in \ Burkitt's lymphoma cells.\ Proc. Natl. Acad. Sci. 100(14), 8164-8169 (2003).\
\ Ren, B., Robert, F., Wyrick, J. W., Aparicio, O., Jennings, E.\ G., Simon, I., Zeitlinger, J., Schreiber, J., Hannett, N., Kanin, E.,\ Volkert , T. L., Wilson, C., Bell, S. P. and Young, R. A. \ Genome-wide location and function of DNA-associated proteins\ Science 290(5500), 2306-2309 (2000).
\ \ encodeChip 0 autoScale Off\ chromosomes chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX\ compositeTrack on\ dataVersion ENCODE June 2005 Freeze\ group encodeChip\ longLabel Ludwig Institute/UCSD ChIP-chip: Pol2 8WG16, TAF1, H3ac, H3K4me2, H3K27me3 antibodies\ maxHeightPixels 128:16:16\ maxLimit 16\ minLimit 0\ origAssembly hg16\ priority 48.0\ shortLabel LI ChIP Various\ superTrack encodeUcsdChipSuper dense\ track encodeLIChIP\ type bedGraph 4\ viewLimits 0:14\ visibility hide\ encodeUcsdChipSuper LI/UCSD ChIP Ludwig Institute/UC San Diego ChIP-chip 0 48 0 0 0 127 127 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX,\ This super-track combines related tracks of ChIP-chip data generated by\ the Ludwig Institute/UCSD ENCODE group.\ ChIP-chip, also known as genome-wide location analysis, is a technique for\ isolation and identification of DNA sequences bound by specific proteins in\ cells, including histones. Histone methylation and acetylation serves as a\ stable genomic imprint\ that regulates gene expression and other epigenetic\ phenomena. These histones are found in transcriptionally active domains\ called euchromatin.
\\ These tracks contain ChIP-chip data for transcription initiation\ complex (such as Pol2 and TAF1) and H3, H4 histones in multiple\ cell lines, including HeLa (cervical carcinoma), IMR90 (human fibroblast), \ and HCT116 (colon epithelial carcinoma), with some experiments\ including interferon-gamma induction.\
\ \\ The data for this track were generated at the\ Ren Lab, Ludwig\ Institute for Cancer Research at UC San Diego.
\ \\ Kim TH, Barrera LO, Qu C, Van Calcar S, Trinklein ND, Cooper SJ, Luna RM,\ Glass CK, Rosenfeld MG, Myers RM, Ren B.\ Direct isolation and identification of promoters in the human\ genome.\ Genome Res. 2005 Jun;15(6):830-9.
\\ Li Z, Van Calcar S, Qu C, Cavenee WK, Zhang MQ, Ren B.\ A global transcriptional regulatory role for c-Myc in Burkitt's\ lymphoma cells.\ Proc Natl Acad Sci U S A. 2003 Jul 8;100(14):8164-9.
\\ Ren B, Robert F, Wyrick JJ, Aparicio O, Jennings EG, Simon I, Zeitlinger J,\ Schreiber J, Hannett N, Kanin E et al.\ Genome-wide location and function\ of DNA-associated proteins.\ Science. 2000 Dec 22;290(5500):2306-9.
\\ Kim TH, Barrera LO, Zheng M, Qu C, Singer MA, Richmond TA, Wu Y, Green RD,\ Ren B.\ A high-resolution map of active promoters in the human genome.\ Nature. 2005 Aug 11;436(7052):876-80.
\ encodeChip 0 chromosomes chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX\ group encodeChip\ longLabel Ludwig Institute/UC San Diego ChIP-chip\ priority 48\ shortLabel LI/UCSD ChIP\ superTrack on\ track encodeUcsdChipSuper\ encodeAffyChIpHl60PvalPu1Hr00 Affy PU1 RA 0h wig 0.0 534.54 Affymetrix ChIP/Chip (PU1 retinoic acid-treated HL-60, 0hrs) P-Value 0 49 75 150 0 165 202 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 0 color 75,150,0\ longLabel Affymetrix ChIP/Chip (PU1 retinoic acid-treated HL-60, 0hrs) P-Value\ parent encodeAffyChIpHl60Pval\ priority 49\ shortLabel Affy PU1 RA 0h\ subGroups factor=PU1 time=0h\ track encodeAffyChIpHl60PvalPu1Hr00\ geneid Geneid Genes genePred geneidPep Geneid Gene Predictions 0 49 0 90 100 127 172 177 0 0 0\ This track shows gene predictions from the geneid program developed at the \ Genome Bionformatics \ Laboratory (GBL), which is part of the \ Grup de Recerca\ en Informàtica Biomèdica (GRIB) at the Institut Municipal d'Investigació \ Mèdica (IMIM) / Centre de Regulació Genòmica (CRG) in Barcelona."\ \ \
\\ Geneid is a program to predict genes in anonymous genomic sequences designed \ with a hierarchical structure. In the first step, splice sites, start and stop \ codons are predicted and scored along the sequence using Position Weight Arrays \ (PWAs). Next, exons are built from the sites. Exons are scored as the sum of the \ scores of the defining sites, plus the the log-likelihood ratio of a \ Markov Model for coding DNA. Finally, from the set of predicted exons, the gene \ structure is assembled, maximizing the sum of the scores of the assembled exons. \
\\ Thanks to GBL for providing these data.\
\ genes 1 color 0,90,100\ group genes\ longLabel Geneid Gene Predictions\ priority 49\ shortLabel Geneid Genes\ track geneid\ type genePred geneidPep\ visibility hide\ encodeLIChIPgIF LI gIF ChIP bedGraph 4 Ludwig Institute/UCSD ChIP-chip - Gamma Interferon Experiments 0 49 0 0 0 127 127 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX,\ ENCODE region-wide location analysis of histones H3 and H4 with antibodies\ H3K4me2, H3K4me3, H3ac, H4ac, STAT1, RNA polymerase II and TAF1 \ was conducted with ChIP-chip, using chromatin extracted from HeLa\ cells induced for 30 min with interferon-gamma as well as uninduced cells.\ The H3K4me2, H3K4me3, H3ac form of histone H3, and H4ac form of \ histone H4 are associated with up-regulation of gene expression. STAT1\ (signal transducer and activator of transcription) binds to DNA and activates\ transcription in response to various cytokines, including interferon-gamma.\
\ \\ This annotation follows the display conventions for composite \ "wiggle" tracks. The subtracks within this annotation \ may be configured in a variety of ways to highlight different aspects of the \ displayed data. The graphical configuration options are shown at the top of \ the track description page, followed by a list of subtracks. To display only \ selected subtracks, uncheck the boxes next to the tracks you wish to hide. \ For more information about the graphical configuration options, click the \ Graph\ configuration help link.
\ \\ Chromatin from both induced and uninduced cells was separately cross-linked,\ precipitated with the antibodies, sheared, amplified and hybridized\ to a PCR DNA tiling array produced at the Ren Lab at UC San Diego. The array\ was composed of 24,537 non-repetitive sequences within the 44 ENCODE regions.\
\\ Each state had three or more biological replicates. Each \ experiment was loess-normalized using R. The P-value and R-value were \ calculated using the modified single array error model (Li, Z. et \ al., 2003). The P-value and R-value were then derived from the weighted \ average results of the replicates.
\\ The displayed values were scaled to 0 - 16, corresponding to negative log base\ 10 of the P-value.
\ \\ Each of the two experiments has three biological replicates. The \ array platform, the \ raw and normalized data for each experiment, and the\ image files have all been deposited at the NCBI \ GEO Microarray \ Database (pending approval).
\ \\ The data for this track were generated at the \ Ren Lab, Ludwig \ Institute for Cancer Research at UC San Diego.
\ \\ Kim, T., Barrera, L.O., Qu, C., van Calcar, S., Trinklein, N.,\ Cooper, S., Luna, R., Glass, C.K., Rosenfeld, M.G.,\ Myers, R., Ren, B.\ \ Direct isolation and identification of promoters in the human genome.\ Genome Research 15,830-839 (2005).
\\ Li, Z., Van Calcar, S., Qu, C., Cavenee, W.K., Zhang, M.Z., and Ren, B. \ A global transcriptional regulatory role for c-Myc in Burkitt's \ lymphoma cells.\ Proc. Natl. Acad. Sci. 100(14), 8164-8169 (2003).
\\ Ren, B., Robert, F., Wyrick, J. W., Aparicio, O., Jennings, E.\ G., Simon, I., Zeitlinger, J., Schreiber, J., Hannett, N., Kanin, E.,\ Volkert , T. L., Wilson, C., Bell, S. P. and Young, R. A. \ Genome-wide location and function of DNA-associated proteins\ Science 290(5500), 2306-2309 (2000).
\ encodeChip 0 autoScale Off\ chromosomes chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX\ compositeTrack on\ dataVersion ENCODE June 2005 Freeze\ group encodeChip\ longLabel Ludwig Institute/UCSD ChIP-chip - Gamma Interferon Experiments\ maxHeightPixels 128:16:16\ maxLimit 16\ minLimit 0\ origAssembly hg16\ priority 49.0\ shortLabel LI gIF ChIP\ superTrack encodeUcsdChipSuper dense\ track encodeLIChIPgIF\ type bedGraph 4\ viewLimits 0:12\ visibility hide\ encodeGisRnaPet GIS-PET RNA bed 12 Gene Identification Signature Paired-End Tags of PolyA+ RNA 0 49.2 0 0 0 127 127 127 0 0 23 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr17,chr18,chr19,chr2,chr20,chr21,chr22,chr3,chr4,chr5,chr6,chr7,chr8,chr9,chrX,\ This track shows the starts and ends of mRNA transcripts \ determined by paired-end ditag (PET) sequencing. PETs are composed of 18 \ bases from either end of a cDNA; 36 bp PETs from many clones were \ concatenated together and cloned into pZero-1 for efficient sequencing. See \ the Methods and References sections below for more details on PET sequencing.\
\\ The PET sequences in this track are full-length transcripts\ derived from two cell lines and mapped on whole genome:\
\ In the graphical display, \ the ends are represented by blocks connected by a horizontal line. In full\ and packed display modes, the arrowheads on the horizontal line represent the \ direction of transcription, and an ID of the format XXXXX-N-M is \ shown to the left of each PET, where X is the unique ID for each\ PET, N indicates the number of mapping locations in the genome \ (1 for a single mapping location, 2 for two mapping locations, and so forth),\ and M is the number of PET sequences at this location. The total \ count of PET sequences mapped to the same locus but with slight nucleotide\ differences may reflect the expression level of the transcripts. PETs that \ mapped to multiple locations may represent low complexity or repetitive \ sequences.
\\ The graphical display also uses color coding to reflect the uniqueness\ and expression level of each PET:
\Color | Mapping | PETS observed at location |
---|---|---|
dark blue | unique | 2 or more |
light blue | unique | 1 |
medium brown | multiple | 2 or more |
light brown | multiple | 1 |
\ PolyA+ RNA was isolated from the cells. A full-length cDNA library was\ constructed and converted into a PET library for Gene\ Identification Signature analysis (Ng et al., 2005). Generation of \ PET sequences involved cloning of cDNA sequences into the plasmid vector, \ pGIS3. pGIS3 contains two MmeI recognition sites that\ flank the cloning site, which were used to produce a 36 bp PET. Each 36 bp PET \ sequence contains 18 bp from each of the 5' and 3' ends of the original \ full-length cDNA clone. The 18 bp 3' signature contains 16 bp 3'-specific \ nucleotides and an AA residual of the polyA tail to indicate the sequence \ orientation. PET sequences were mapped to the genome using the following \ specific criteria:\
\ Most of the PET sequences (more than 90%) were mapped to specific locations\ (single mapping loci). PETs mapping to 2 - 10 locations are\ also included and may represent duplicated genes or pseudogenes in\ the genome.
\ \\ To assess overall PET quality and mapping specificity, the top ten most \ abundant PET clusters that mapped to well-characterized known genes were\ examined. Over 99% of the PETs represented full-length transcripts, and the \ majority fell within ten bp of the known 5' and 3' boundaries of these \ transcripts. The PET mapping was further verified by confirming the existence \ of physical cDNA clones represented by the ditags. PCR primers were designed\ based on the PET sequences and amplified the corresponding cDNA inserts from \ the parental GIS flcDNA library for sequencing analysis. In a set of 86\ arbitrarily-selected PETs representing a wide range of annotation\ categories — including known genes (38 PETs), predicted genes (2 PETs),\ and novel transcripts (46 PETs) — 84 (97.7%) confirmed\ the existence of bona fide transcripts.
\ \\ The GIS-PET libraries and sequence data for transcriptome analysis were \ produced at the \ Genome Institute of Singapore. The data were\ mapped and analyzed by scientists from the Genome Institute of\ Singapore and the \ Bioinformatics Institute of \ Singapore.
\ \\ Ng, P. et al.\ Gene identification signature (GIS) analysis for transcriptome \ characterization and genome annotation.\ Nat. Methods 2(2), 105-11 (2005).
\ encodeGenes 1 chromosomes chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr17,chr18,chr19,chr2,chr20,chr21,chr22,chr3,chr4,chr5,chr6,chr7,chr8,chr9,chrX\ compositeTrack on\ dataVersion ENCODE June 2005 Freeze\ group encodeGenes\ itemRgb on\ longLabel Gene Identification Signature Paired-End Tags of PolyA+ RNA\ priority 49.2\ shortLabel GIS-PET RNA\ track encodeGisRnaPet\ type bed 12\ visibility hide\ encodeAffyChIpHl60SitesPu1Hr00 Affy PU1 RA 0h bed 3 . Affymetrix ChIP/Chip (PU1 retinoic acid-treated HL-60, 0hrs) Sites 0 50 75 150 0 165 202 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 1 color 75,150,0\ longLabel Affymetrix ChIP/Chip (PU1 retinoic acid-treated HL-60, 0hrs) Sites\ parent encodeAffyChIpHl60Sites\ priority 50\ shortLabel Affy PU1 RA 0h\ subGroups factor=PU1 time=0h\ track encodeAffyChIpHl60SitesPu1Hr00\ genscan Genscan Genes genePred genscanPep Genscan Gene Predictions 0 50 170 100 0 212 177 127 0 0 0\ This track shows predictions from the \ Genscan program \ written by Chris Burge.\ The predictions are based on transcriptional, \ translational and donor/acceptor splicing signals as well as the length \ and compositional distributions of exons, introns and intergenic regions.
\ \\ This track follows the display conventions for \ gene prediction \ tracks. \
\ The track description page offers the following filter and configuration\ options:\
\ For a description of the Genscan program and the model that underlies it, \ refer to Burge and Karlin (1997) in the References section below. \ The splice site models used are described in more detail in Burge (1998)\ below.
\ \\ Burge C. \ Modeling Dependencies in Pre-mRNA Splicing Signals. \ In: Salzberg S, Searls D, Kasif S, editors. \ Computational Methods in Molecular Biology. \ Amsterdam: Elsevier Science; 1998. p. 127-163.
\\ Burge C, Karlin S. \ Prediction of complete gene structures in human genomic DNA.\ J. Mol. Biol. 1997 Apr 25;268(1):78-94.
\ genes 1 color 170,100,0\ group genes\ longLabel Genscan Gene Predictions\ priority 50\ shortLabel Genscan Genes\ track genscan\ type genePred genscanPep\ visibility hide\ encodeUcsdNgGif LI Ng gIF ChIP bedGraph 4 Ludwig Institute/UCSD ChIP/Chip NimbleGen - Gamma Interferon Experiments 0 50 0 0 0 127 127 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX,\ This track displays results of the following ChIP-chip (NimbleGen) \ gamma interferon experiments on HeLa cells:\
\ ENCODE region-wide location analysis of trimethylated K4 histone H3 (H3K4me3,\ or triMeH3K4) and RNA polymerase II was conducted with ChIP-chip\ using chromatin extracted from HeLa cells induced for 30 minutes with\ gamma interferon as well as uninduced cells.
\ \\ Chromatin from both induced and uninduced HeLa cells was separately\ cross-linked, precipitated with different antibodies, sheared,\ amplified and hybridized to an oligonucleotide tiling array produced by \ NimbleGen Systems. \ The array includes non-repetitive sequences within the 44 ENCODE\ regions tiled from NCBI Build 35 (UCSC hg17) with 50-mer probes at 38\ bp interval. Resulting genomic coordinates were translated to NCBI\ Build 34 (UCSC hg16).
\\ Intensity values for biological replicate arrays were combined after\ quantile normalization using \ R. \ The averages of the quantile\ normalized intensity values for each probe were then median-scaled and\ loess-normalized using R to obtain the adjusted log R values.
\ \\ Three biological replicates were used to generate the track for each\ factor at each time point with the exception of RNA Pol2 uninduced,\ for which only two biological replicates were used.
\ \\ The data for this track were generated at the \ Ren Lab, \ Ludwig Institute for Cancer Research at UC San Diego.
\ \ encodeChip 0 autoScale off\ chromosomes chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX\ compositeTrack on\ dataVersion ENCODE June 2005 Freeze\ group encodeChip\ longLabel Ludwig Institute/UCSD ChIP/Chip NimbleGen - Gamma Interferon Experiments\ maxHeightPixels 128:16:16\ maxLimit 5\ minLimit -5\ origAssembly hg17\ priority 50.0\ shortLabel LI Ng gIF ChIP\ track encodeUcsdNgGif\ type bedGraph 4\ viewLimits -1:3\ windowingFunction mean\ caseControl Case Control chromGraph Case Control Consortium 0 50.2 0 0 0 127 127 127 0 0 0\ This track displays the trend p-values (-log10) of the seven \ diseases reported by The Wellcome Trust Case Control Consortium (see \ References section below). The diseases studied are:\
\ Reported p-values were taken for each of the Illumina550 probes on the genome.\ For visualization purposes, these p-values were logged and negated.
\\ Most of the data for chromosome X is omitted from this track \ because of the difference in statistical power and the difference in \ control sets used. \ The trend p-value captures the majority of the associated signals, therefore\ it was used to create this track instead of the genotypic p-value.\
\\ For the complete dataset, visit \ The Wellcome\ Trust Case Control Consortium Web Site.\
\ \\ When making use of these data, please cite:
\\ The Wellcome Trust Case Control Consortium.\ Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controls.\ Nature. 2007 Jun 7;447(7145):661-78.
\ phenDis 0 compositeTrack on\ group phenDis\ linesAt 5,10\ longLabel Case Control Consortium\ minMax 0,15\ priority 50.2\ shortLabel Case Control\ track caseControl\ type chromGraph\ visibility hide\ nimhBipolar NIMH Bipolar chromGraph NIMH Bipolar Disease 0 50.3 0 0 0 127 127 127 0 0 0\ This track displays the p-values (-log10) of the bipolar disorder \ pooled data as reported by the NIMH Genetics Initiative Bipolar Disorder \ Consortium (see References below). \ The Consortium performed a genome-wide asociation study on two populations:\
\ Reported p-values were taken for each of the Illumina550 probes on the genome. \ For visualization purposes, these p-values were logged and negated. \ These p-values have not been Bonferroni adjusted.
\\ All of the people in the US population had an affected sibling. 96% of the\ people in the German\ sample reported that both parents and grandparents were born in Germany.\ The German sample was used to replicate the US sample. The following\ conditions were applied to the US sample:\
\ When individually genotyped, 76% remained significant in the US sample,\ and 36% of those remained significant in the German sample. Combined p-values\ were also calculated for the SNPs yielding the most significant find\ rs1012053 p-value = 1.5e-8 for the gene DGKH diacylglyceral kinase,\ which is still significant after Bonferroni adjustment. DGKH\ protein is key in the lithium-sensitive phosphatidyl inositol pathway.\ Several other risk SNPs were identified of small effect.\
\\ For the complete dataset, visit The National Instututes of Mental Health \ (NIMH) \ MAP Genetics Web Site.\
\ \\ Baum AE, Akula N, Cabanero M, Cardona I, Corona W, Klemens B, Schulze TG, \ Cichon S, Rietschel M, Nothen MM et al.\ \ A genome-wide association study implicates diacylglycerol kinase eta \ (DGKH) and several other genes in the etiology of bipolar disorder.\ Molecular Psychiatry. 2007 May 8;:1-11.
\\ In addition, those using data from the NIMH sample should cite \ the NIMH Genetics Initiative Bipolar Disorder Consortium by \ use of the language specified \ in the Distribution Agreement from the \ NIMH Center \ for Collaborative Genetic Studies.\
\ \ phenDis 0 compositeTrack on\ group phenDis\ linesAt 1,3\ longLabel NIMH Bipolar Disease\ minMax 0,5\ priority 50.3\ shortLabel NIMH Bipolar\ track nimhBipolar\ type chromGraph\ visibility hide\ exoniphy Exoniphy genePred Exoniphy Human/Mouse/Rat/Dog 0 50.9 173 17 162 214 136 208 0 0 0\ The exoniphy program identifies evolutionarily conserved protein-coding exons in\ multiple, aligned sequences using a phylogenetic hidden Markov model\ (phylo-HMM), a kind of statistical model that simultaneously describes exon\ structure and exon evolution. This track shows exoniphy predictions\ for the human Jul. 2003 (hg16), mouse Feb. 2003 (mm3), and rat Jun. 2003\ (rn3) genomes, as aligned by the multiz program.\
\\ Exoniphy is described in Siepel A & Haussler D (2004), "Computational\ identification of evolutionarily conserved exons," RECOMB '04.\ Multiz is described in Blanchette M et al. (2004), "Aligning\ multiple genomic sequences with the threaded blockset aligner,"\ Genome Res. 14:708-175.\ genes 1 color 173,17,162\ group genes\ longLabel Exoniphy Human/Mouse/Rat/Dog\ priority 50.9\ shortLabel Exoniphy\ track exoniphy\ type genePred\ visibility hide\ exoniphyGene Exoniphy Genes genePred Predicted Genes and Gene Fragments from Exoniphy Exons (Human/Mouse/Rat) 0 50.9 173 17 162 214 136 208 0 0 0 genes 1 color 173,17,162\ group genes\ longLabel Predicted Genes and Gene Fragments from Exoniphy Exons (Human/Mouse/Rat)\ priority 50.9\ shortLabel Exoniphy Genes\ track exoniphyGene\ type genePred\ visibility hide\ encodeAffyChIpHl60PvalPu1Hr02 Affy PU1 RA 2h wig 0.0 534.54 Affymetrix ChIP/Chip (PU1 retinoic acid-treated HL-60, 2hrs) P-Value 0 51 75 150 0 165 202 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 0 color 75,150,0\ longLabel Affymetrix ChIP/Chip (PU1 retinoic acid-treated HL-60, 2hrs) P-Value\ parent encodeAffyChIpHl60Pval\ priority 51\ shortLabel Affy PU1 RA 2h\ subGroups factor=PU1 time=2h\ track encodeAffyChIpHl60PvalPu1Hr02\ genscanExtra Genscan Extra bed 6 . Genscan Extra (Suboptimal) Exon Predictions 0 51 180 90 0 217 172 127 0 0 1 chr22, genes 1 chromosomes chr22,\ color 180,90,0\ group genes\ longLabel Genscan Extra (Suboptimal) Exon Predictions\ priority 51\ shortLabel Genscan Extra\ track genscanExtra\ type bed 6 .\ visibility hide\ gwasCatalog GWAS Catalog bed 4 + NHGRI Catalog of Published Genome-Wide Association Studies 0 51 0 90 0 127 172 127 0 0 0 http://www.ncbi.nlm.nih.gov/SNP/snp_ref.cgi?type=rs&rs=$$
\ This track displays single nucleotide polymorphisms (SNPs) identified by published \ Genome-Wide Association Studies (GWAS), collected in the \ Catalog of Published \ Genome-Wide Association Studies (www.genome.gov/gwastudies) at the \ National Human Genome Research \ Institute (NHGRI).\ Some abbreviations\ are used above.\
\\ From http://www.genome.gov/gwastudies:\
\ The genome-wide association study (GWAS) publications listed here\ include only those attempting to assay at least 100,000 single\ nucleotide polymorphisms (SNPs) in the initial stage. Publications are\ organized from most to least recent date of publication, indexing from\ online publication if available. Studies focusing only on candidate\ genes are excluded from this catalog. Studies are identified through\ weekly PubMed literature searches, daily NIH-distributed compilations\ of news and media reports and occasional comparisons with an existing\ database of GWAS literature \ (HuGE Navigator).\\ \ \
\ From http://www.genome.gov/gwastudies:\
\ SNP-trait associations listed here are limited to those with p-values\ < 1.0 x 10-5 (see full methods for additional details). Multipliers of\ powers of 10 in p-values are rounded to the nearest single digit; odds\ ratios and allele frequencies are rounded to two decimals. Standard\ errors are converted to 95 percent confidence intervals where\ applicable. Allele frequencies, p-values and odds ratios derived from\ the largest sample size, typically a combined analysis (initial plus\ replication studies), are recorded below if reported; otherwise,\ statistics from the initial study sample are recorded. For\ quantitative traits, information on % variance explained, SD\ increment, or unit difference is reported where available. Odds ratios (OR)\ < 1 in the original paper are converted to OR > 1 for the alternate\ allele. Where results from multiple genetic models are available, we\ prioritized effect sizes (ORs or beta-coefficients) as follows: 1)\ genotypic model, per-allele estimate; 2) genotypic model, heterozygote\ estimate, 3) allelic model, allelic estimate.\\ \
\ Gene regions corresponding to SNPs were identified from the UCSC\ Genome Browser. Gene names and risk alleles are those reported by the\ authors in the original paper. Only one SNP within a gene or region of\ high linkage disequilibrium is recorded unless there was evidence of\ independent association.\
\ Occasionally the term "pending" is used to denote one or more studies\ that we identified as an eligible GWAS, but for which SNP information\ has not yet been extracted; studies of CNVs are also noted as pending.\
\ Hindorff LA, Sethupathy P, Junkins HA, Ramos EM, Mehta JP, Collins FS, Manolio TA.\ Potential etiologic and functional implications of genome-wide association \ loci for human diseases and traits. PNAS 2009 Jun 9;106(23):9362-7.\ phenDis 1 color 0,90,0\ group phenDis\ longLabel NHGRI Catalog of Published Genome-Wide Association Studies\ priority 51\ shortLabel GWAS Catalog\ snpTable snp130\ snpVersion 130\ track gwasCatalog\ type bed 4 +\ url http://www.ncbi.nlm.nih.gov/SNP/snp_ref.cgi?type=rs&rs=$$\ urlLabel dbSNP:\ visibility hide\ augustus Augustus genePred Augustus Gene Predictions 0 51.7 180 0 0 217 127 127 0 0 0
\ This track shows predictions of AUGUSTUS.\ AUGUSTUS is available through the GOBICS web\ server.
\ \\ This annotation follows the display conventions for composite tracks.\ To display only selected subtracks, uncheck the boxes next to the tracks \ you wish to hide. This track also follows the display conventions for \ gene prediction \ tracks.
\\ This track contains an optional codon coloring feature that allows users to quickly validate and compare gene predictions.\ To display codon colors, select the genomic codons option from the\ Color track by codons pull-down menu. Click the\ Help on codon coloring \ link for more information about this feature.
\ \\ AUGUSTUS uses a generalized hidden Markov model (GHMM) that models coding and \ non-coding sequence, splice sites, the branch point region, translation start \ and end, and lengths of exons and introns. This version has been trained on a \ set of 1284 human genes.\ \
\ This subtrack was made using hints from several other tracks:\
This subtrack was made using only the target genome sequence and \ evolutionary conservation. The conservation information was extracted \ from the Exoniphy track and the PhastCons Conserved Elements track. \ Further, hints about retroposed genes were used, that are based only \ on previous de novo predictions of AUGUSTUS. No transcribed \ sequences were used for this track.
\ \\ The Augustus subtracks were created by Mario Stanke. The TransMap track \ was created by Mark Diekhans, the Retroposed Genes tracks by Robert \ Baertsch, and the Exoniphy and PhastCons Conserved Elements tracks by \ Adam Siepel's group.
\ \\ Stanke M. \ Gene prediction with a hidden Markov model.\ Ph.D. thesis. Universität Göttingen, Germany. 2004.
\ \\ Stanke M, Steinkamp R, Waack S, Morgenstern B. \ AUGUSTUS: a web server for gene finding in eukaryotes. \ Nucl Acids Res. 2004 Jul 1;32(Web Server Issue):W309-12.
\ \\ Stanke M, Tzvetkova A, Morgenstern B. \ \ AUGUSTUS at EGASP: using EST, protein and genomic alignments for improved \ gene prediction in the human genome. \ Genome Biology. 2006;7(Suppl 1):S11.
\ \\ Stanke M, Waack S. \ Gene prediction with a hidden Markov model and a new intron\ submodel.\ Bioinformatics. 2003 Sep;19(Suppl. 2):ii215-25.
\ genes 1 color 180,0,0\ group genes\ longLabel Augustus Gene Predictions\ priority 51.7\ shortLabel Augustus\ track augustus\ type genePred\ visibility hide\ encodeAffyChIpHl60SitesPu1Hr02 Affy PU1 RA 2h bed 3 . Affymetrix ChIP/Chip (PU1 retinoic acid-treated HL-60, 2hrs) Sites 0 52 75 150 0 165 202 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 1 color 75,150,0\ longLabel Affymetrix ChIP/Chip (PU1 retinoic acid-treated HL-60, 2hrs) Sites\ parent encodeAffyChIpHl60Sites\ priority 52\ shortLabel Affy PU1 RA 2h\ subGroups factor=PU1 time=2h\ track encodeAffyChIpHl60SitesPu1Hr02\ encodeRna Known+Pred RNA bed 6 + Known and Predicted RNA Transcription in the ENCODE Regions 0 52 0 0 0 127 127 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX,\ This track shows the locations of known and predicted non-protein-coding RNA \ genes and pseudogenes that fall within the ENCODE regions. It contains all \ information in Sean Eddy's RNA Genes track for these regions, combined with \ computational predictions generated by Jakob Skou Pedersen's EvoFold algorithm. \ In addition to the fields contained in the RNA Genes track, this track also \ includes ENCODE-related fields describing overlap with transcribed regions and \ repeats.
\\ Feature types in this annotation include:\
\ The locations of the RNA genes and pseudogenes are represented by blocks in the \ graphical display, color-coded as follows: \
\ The display may be filtered to show only those items \ with unnormalized scores that meet or exceed a certain threshhold. To set a \ threshhold, type the minimum score into the text box at the top of the \ description page.
\ \\ The RNA Genes track was supplemented with EvoFold predictions and filtered to \ include only those items that lie within the ENCODE regions. \ Regions that are at least 10 percent Repeatmasked are flagged because no \ transcriptional data is available for them. A region is considered transcribed \ if at least 10 percent overlaps with any Affymetrix transcribed fragment \ (transfrag), derived from six microarray experiments, or Yale \ transcriptionally-active region (TAR), derived from 15 microarray experiments. \ In these cases, each array from which the overlapped transfrags and TARs were \ derived is listed.
\\ EvoFold is a comparative method that exploits the evolutionary signal\ of genomic multiple-sequence alignments for identifying conserved\ functional RNA structures. The method makes use of phylogenetic\ stochastic context-free grammars (phylo-SCFGs), which are combined\ probabilistic models of RNA secondary structure and primary sequence\ evolution. The predictions consist both of a specific RNA secondary\ structure and an overall score. The overall score is essentially a\ log-odd score phylo-SCFG modeling the constrained evolution of\ stem-pairing regions and one which only models unpaired regions.
\\ Two sets of EvoFold predictions are included in this track. The first,\ labeled EvoFold, contains predictions based on the conserved elements of an \ 8-way vertebrate alignment of the human, chimpanzee, mouse, rat, dog, chicken, \ zebrafish, and Fugu assemblies. The second set of predictions, TBA23_EvoFold, \ was based on the conserved elements of the 23-way TBA alignments present in the \ ENCODE regions. When a pair of these predictions overlap, only the EvoFold \ prediction is shown.
\ \\ These data were kindly provided by Sean Eddy at Washington University,\ Jakob Skou Pedersen at UC Santa Cruz, and The Encode Consortium.
\\ This annotation track was generated by Matt Weirauch.
\ \\ Knudsen, B. and J.J. Hein. \ RNA secondary structure prediction using stochastic context-free \ grammars and evolutionary history.\ Bioinformatics 15(6), 446-54 (1999).
\\ Pedersen, J.S., Bejerano, G. and Haussler, D. Identification and\ classification of conserved RNA secondary structures in the human\ genome. (In preparation).\ encodeGenes 1 chromosomes chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX\ dataVersion ENCODE June 2005 Freeze\ group encodeGenes\ longLabel Known and Predicted RNA Transcription in the ENCODE Regions\ origAssembly hg16\ priority 52\ shortLabel Known+Pred RNA\ track encodeRna\ type bed 6 +\ visibility hide\ rnaGene RNA Genes bed 6 + Non-coding RNA Genes (dark) and Pseudogenes (light) 0 52 170 80 0 230 180 130 0 0 0
\ This track shows the location of non-protein coding RNA genes and\ pseudogenes. \
\ Feature types include:\
\ \
\
Eddy-tRNAscanSE (tRNA genes, Sean Eddy):
\
tRNAscan-SE 1.23 with default parameters.\
Score field contains tRNAscan-SE bit score; >20 is good, >50 is great.
\
Eddy-BLAST-tRNAlib (tRNA pseudogenes, Sean Eddy):
\
Wublast 2.0, with options "-kap wordmask=seg B=50000 \
W=8 cpus=1".\
Score field contains % identity in blast-aligned region.\
Used each of 602 tRNAs and pseudogenes predicted by tRNAscan-SE\
in the human oo27 assembly as queries. Kept all nonoverlapping\
regions that hit one or more of these with P <= 0.001.
\
Eddy-BLAST-snornalib (known snoRNAs and snoRNA pseudogenes, Steve Johnson):
\
Wublastn 2.0, with options "-V=25 -hspmax=5000 -kap wordmask=seg \
B=5000 W=8 cpus=1".\
Score field contains blast score.\
Used each of 104 unique snoRNAs in snorna.lib as a query.\
Any hit >=95% full length and >=90% identity is annotated as a\
"true gene".\
Any other hit with P <= 0.001 is annotated as a "related \
sequence" and interpreted as a putative pseudogene.
\
Eddy-BLAST-otherrnalib \
(non-tRNA, non-snoRNA noncoding RNAs with GenBank entries\
for the human gene.):
\
Wublastn 2.0 [15 Apr 2002]\
with options: "-kap -cpus=1 -wordmask=seg -W=8 -E=0.01 -hspmax=0\
-B=50000 -Z=3000000000". Exceptions to this are:\
\ The score field contains the blastn score.\ 41 unique miRNAs and 29 other ncRNAs were used as queries.\ Any hit >=95% full length and >=95% identity is annotated as a\ "true gene".\ Any other hit with P <= 0.001 and >= 65% identity is annotated\ as a "related sequence". There is an exception to this:\ all miRNAs consist of 16-26 bp sequences in GenBank\ and are annotated only if they are 100% full length and have\ 100% identity. The set of miRNAs used consists of Let-7 from\ Pasquinelli et al. (2000) and 40 miRNAs from Mourelatos et al. (2002),\ as mentioned in the references section below.\ \
\ These data were kindly provided by Sean Eddy at Washington University.
\ \\ Pasquinelli AE, Reinhart BJ, Slack F, Martindale MQ, Kuroda MI, Maller B,\ Hayward DC, Ball EE, Degnan B, Müller P, et al.\ \ Conservation of the sequence and temporal expression of let-7 \ heterochronic regulatory RNA. Nature.\ 2000 Nov 2;408(6808):86-9.
\\ Mourelatos Z, Dostie J, Paushkin S, Sharma A, Charroux B, Abel L,\ Rappsilber J, Mann M, Dreyfuss G.\ \ miRNPs: a novel class of ribonucleoproteins containing numerous microRNAs.\ Genes Dev. 2002 Mar 15;16(6):720-8.
\ \ genes 1 altColor 230,180,130\ color 170,80,0\ group genes\ longLabel Non-coding RNA Genes (dark) and Pseudogenes (light)\ priority 52\ shortLabel RNA Genes\ track rnaGene\ type bed 6 +\ visibility hide\ encodeAffyChIpHl60PvalPu1Hr08 Affy PU1 RA 8h wig 0.0 534.54 Affymetrix ChIP/Chip (PU1 retinoic acid-treated HL-60, 8hrs) P-Value 0 53 75 150 0 165 202 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 0 color 75,150,0\ longLabel Affymetrix ChIP/Chip (PU1 retinoic acid-treated HL-60, 8hrs) P-Value\ parent encodeAffyChIpHl60Pval\ priority 53\ shortLabel Affy PU1 RA 8h\ subGroups factor=PU1 time=8h\ track encodeAffyChIpHl60PvalPu1Hr08\ superfamily Superfamily bed 4 + Superfamily/SCOP: Proteins Having Homologs with Known Structure/Function 0 53 150 0 0 202 127 127 0 0 0 http://supfam.mrc-lmb.cam.ac.uk/SUPERFAMILY/cgi-bin/gene.cgi?genome=\ The \ Superfamily \ track shows proteins having homologs with known structures or functions.
\\ Each entry on the track shows the coding region of a gene (based on Ensembl gene predictions).\ In full display mode, the label for an entry consists of the names of \ all known protein domains encoded by this gene. This \ usually contains structural and/or functional descriptions that provide valuable \ information to help users get a quick grasp of the biological significance of the \ gene.
\ \\ Data are downloaded from the Superfamily server.\ Using the cross-reference between Superfamily entries and Ensembl gene prediction \ entries and their alignment to the appropriate genome, the associated data are \ processed to generate a simple BED format track.
\\ Superfamily was developed by\ Julian\ Gough at the MRC Laboratory\ of Molecular Biology, Cambridge.
\\ Gough, J., Karplus, K., Hughey, R. and\ Chothia, C. (2001). "Assignment of Homology to Genome Sequences using a\ Library of Hidden Markov Models that Represent all Proteins of Known Structure". \ J. Mol. Biol., 313(4), 903-919.
\ \ genes 1 color 150,0,0\ group genes\ longLabel Superfamily/SCOP: Proteins Having Homologs with Known Structure/Function\ priority 53\ shortLabel Superfamily\ track superfamily\ type bed 4 +\ url http://supfam.mrc-lmb.cam.ac.uk/SUPERFAMILY/cgi-bin/gene.cgi?genome=\ visibility hide\ pseudoYale Yale Pseudo genePred Yale Pseudogenes. 0 53 100 50 0 255 240 200 1 0 0 http://www.pseudogene.org/cgi-bin/display-by-acc.cgi?id=$$\ This track shows identified pseudogenes as recorded in the Yale\ Pseudogene Database. For information on how these pseudogenes were\ identified and access to the database, see http://www.pseudogene.org. \ genes 1 altColor 255,240,200\ autoTranslate 0\ color 100,50,0\ group genes\ longLabel Yale Pseudogenes.\ priority 53\ shortLabel Yale Pseudo\ spectrum on\ track pseudoYale\ type genePred\ url http://www.pseudogene.org/cgi-bin/display-by-acc.cgi?id=$$\ visibility hide\ pseudoYale60 Yale Pseudo60 genePred Yale Pseudogenes based on Ensembl Release 60 0 53 0 0 0 127 127 127 1 0 0 http://tables.pseudogene.org/index.cgi?table=Human60&value=$$ genes 1 autoTranslate 0\ dataVersion December 2010\ gClass_Ambiguous 100,91,191\ gClass_Duplicated 100,50,0\ gClass_Processed 180,0,0\ geneClasses Processed Duplicated Ambiguous\ group genes\ itemClassTbl pseudoYale60Class\ longLabel Yale Pseudogenes based on Ensembl Release 60\ priority 53\ shortLabel Yale Pseudo60\ spectrum on\ track pseudoYale60\ type genePred\ url http://tables.pseudogene.org/index.cgi?table=Human60&value=$$\ urlLabel Yale pseudogene.org link:\ visibility hide\ mrna Human mRNAs psl . Human mRNAs from GenBank 1 54 0 0 0 127 127 127 1 0 0
\ The mRNA track shows alignments between human mRNAs\ in GenBank and the genome.
\ \\ This track follows the display conventions for \ PSL alignment tracks. In dense display mode, the items that\ are more darkly shaded indicate matches of better quality.
\\ The description page for this track has a filter that can be used to change \ the display mode, alter the color, and include/exclude a subset of items \ within the track. This may be helpful when many items are shown in the track \ display, especially when only some are relevant to the current task.
\\ To use the filter:\
\ This track may also be configured to display codon coloring, a feature that\ allows the user to quickly compare mRNAs against the genomic sequence. For more \ information about this option, go to the\ \ Codon and Base Coloring for Alignment Tracks page.\ Several types of alignment gap may also be colored; \ for more information, go to the\ \ Alignment Insertion/Deletion Display Options page.\
\ \\ GenBank human mRNAs were aligned against the genome using the \ blat program. When a single mRNA aligned in multiple places, \ the alignment having the highest base identity was found. \ Only alignments having a base identity level within 0.5% of\ the best and at least 96% base identity with the genomic sequence were kept.\
\ \\ The mRNA track was produced at UCSC from mRNA sequence data\ submitted to the international public sequence databases by \ scientists worldwide.
\ \\ Benson DA, Karsch-Mizrachi I, Lipman DJ, Ostell J,\ Wheeler DL.\ GenBank: update. Nucleic Acids Res.\ 2004 Jan 1;32(Database issue):D23-6.
\\ Kent WJ.\ BLAT - the BLAST-like alignment tool.\ Genome Res. 2002 Apr;12(4):656-64.
\ rna 1 baseColorDefault diffCodons\ baseColorUseCds genbank\ baseColorUseSequence genbank\ group rna\ indelDoubleInsert on\ indelPolyA on\ indelQueryInsert on\ longLabel $Organism mRNAs from GenBank\ priority 54\ shortLabel $Organism mRNAs\ showDiffBasesAllScales .\ spectrum on\ table all_mrna\ track mrna\ type psl .\ visibility dense\ encodeAffyChIpHl60SitesPu1Hr08 Affy PU1 RA 8h bed 3 . Affymetrix ChIP/Chip (PU1 retinoic acid-treated HL-60, 8hrs) Sites 0 54 75 150 0 165 202 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 1 color 75,150,0\ longLabel Affymetrix ChIP/Chip (PU1 retinoic acid-treated HL-60, 8hrs) Sites\ parent encodeAffyChIpHl60Sites\ priority 54\ shortLabel Affy PU1 RA 8h\ subGroups factor=PU1 time=8h\ track encodeAffyChIpHl60SitesPu1Hr08\ encodeSangerChipH3H4 Sanger ChIP bedGraph 4 Sanger ChIP/Chip (histones H3,H4 antibodies in GM06990, K562 cells) 0 54 0 0 0 127 127 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX,\ ENCODE region-wide location analysis of H3 and H4 histones\ was conducted employing ChIP-chip using chromatin extracted from GM06990\ (lymphoblastoid) and K562 (myeloid leukemia-derived) cells.\ Histone methylation and acetylation serves as a stable genomic imprint\ that regulates gene expression and other epigenetic \ phenomena. These histones are found in transcriptionally active domains \ called euchromatin.
\\
\
|
\ This annotation follows the display conventions for composite \ "wiggle" tracks. The subtracks within this annotation \ may be configured in a variety of ways to highlight different aspects of the \ displayed data. The graphical configuration options are shown at the top of \ the track description page, followed by a list of subtracks. To display only \ selected subtracks, uncheck the boxes next to the tracks you wish to hide. \ For more information about the graphical configuration options, click the \ Graph\ configuration help link.
\ \\ Chromatin from the cell line was cross-linked with 1% formaldehyde,\ precipitated with antibody binding to the histone, and sheared and hybridized to\ a DNA array. DNA was not amplified prior to hybridization.
\\ The raw and transformed data files reflect fold enrichment over background,\ averaged over six replicates.
\ \\ There are six replicates: two technical replicates (immunoprecipitations)\ for each of the three biological replicates (cell cultures).
\\ Raw and transformed (averaged) data can be downloaded from the Wellcome\ Trust Sanger Institute FTP site as indicated in the table above.
\ \\ The data for this track were generated by the \ ENCODE investigators at the \ Wellcome Trust Sanger \ Institute, Hinxton, UK.
\ encodeChip 0 autoScale Off\ chromosomes chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX\ compositeTrack on\ dataVersion ENCODE June 2005 Freeze\ group encodeChip\ longLabel Sanger ChIP/Chip (histones H3,H4 antibodies in GM06990, K562 cells)\ maxHeightPixels 128:40:16\ maxLimit 75\ minLimit -46.1\ priority 54.0\ shortLabel Sanger ChIP\ track encodeSangerChipH3H4\ type bedGraph 4\ viewLimits 0:10\ visibility hide\ luNega UCSC Pseudo bed 12 UCSC Pseudogenes 0 54 0 0 0 127 127 127 0 0 0 genes 1 group genes\ longLabel UCSC Pseudogenes\ priority 54\ shortLabel UCSC Pseudo\ track luNega\ type bed 12\ visibility hide\ acescan ACEScan genePred ACEScan alternative conserved Human-Mouse exon predictions 0 55 125 38 205 190 146 230 0 0 0\ \ This track identifies predicted Alternative Conserved Exons (human-mouse \ conservation), as predicted by ACEScan. These are exons that are \ present in some transcripts, but skipped by alternative splicing in \ other transcripts in both human and mouse. Alternate use of skipped \ exons has important consequences during gene expression and in disease.\ \
\ \ Putative alternative conserved exons on mRNAs were identified using a \ machine-learning algorithm, Regularized Least-Squares Classification. \ Characteristics of known exons that have been skipped in both human \ and mouse mRNAs were determined by considering factors such as\ exon and intron length, splice-site strength, sequence conservation, \ and region-specific oligonucleotide composition.
\ \A training set was made by comparing known exons that are skipped \ in some transcripts to exons never skipped.\ These characteristics were then applied to the whole genome to predict\ skipped exons in other transcripts. This track displays exons with \ positive ACEScan scores.
\ \For further details of the method used to generate this annotation, \ please refer to Yeo et al. (2005). \ \
\ \ Thanks to Gene Yeo at the Crick-Jacobs Center, Salk Institute and \ Christopher Burge, MIT, for providing this annotation. For additional \ information on ACEscan predictions please contact \ geneyeo@salk.edu\ \ or \ cburge@mit.edu.\ \ \
\ \ Yeo GW, Van Nostrand E, Holste D, Poggio T, Burge CB (2005), \ \ Identification and analysis of alternative splicing events conserved \ in human and mouse. \ Proc Natl Acad Sci U S A. 2005 \ Feb 22;102(8):2850-5.\ \
\ genes 1 color 125,38,205\ group genes\ longLabel ACEScan alternative conserved Human-Mouse exon predictions\ priority 55\ shortLabel ACEScan\ track acescan\ type genePred\ visibility hide\ encodeAffyChIpHl60PvalPu1Hr32 Affy PU1 RA 32h wig 0.0 534.54 Affymetrix ChIP/Chip (PU1 retinoic acid-treated HL-60, 32hrs) P-Value 0 55 75 150 0 165 202 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 0 color 75,150,0\ longLabel Affymetrix ChIP/Chip (PU1 retinoic acid-treated HL-60, 32hrs) P-Value\ parent encodeAffyChIpHl60Pval\ priority 55\ shortLabel Affy PU1 RA 32h\ subGroups factor=PU1 time=32h\ track encodeAffyChIpHl60PvalPu1Hr32\ encodeStanfordChipSuper Stanf ChIP Stanford ChIP-chip 0 55 0 0 0 127 127 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX,\ This super-track combines related tracks of ChIP-chip data \ generated by the Stanford ENCODE group.\ ChIP-chip, also known as genome-wide location analysis, is a technique for\ isolation and identification of DNA sequences bound by specific proteins in\ cells. \
\ These tracks contain data for the Sp1 and Sp3 \ transcription factors in multiple cell lines,\ including HCT116 (colon epithelial carcinoma), \ Jurkat (T-cell lymphoblast), and K562 (myeloid leukemia).\ \
\ The Sp1 and Sp3 data were generated in the\ Richard M. Myers lab at Stanford University (now at HudsonAlpha Institute for Biotechnology).
\ \\ Mikkelsen TS, Ku M, Jaffe DB, Issac B, Lieberman E, Giannoukos G, Alvarez P,\ Brockman W, Kim TK, Koche RP et al.\ Genome-wide maps of chromatin state in pluripotent and \ lineage-committed cells.\ Nature. 2007 Aug 2;448, 553-60.
\\ Trinklein ND, Murray JI, Hartman SJ, Botstein D, Myers RM.\ The role of heat shock transcription factor 1 in the\ genome-wide regulation of the mammalian heat shock response.\ Mol. Biol. Cell. 2004 Mar;15(3):1254-61.
\ encodeChip 0 chromosomes chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX\ group encodeChip\ longLabel Stanford ChIP-chip\ priority 55.0\ shortLabel Stanf ChIP\ superTrack on\ track encodeStanfordChipSuper\ encodeStanfordChip Stanf ChIP bedGraph 4 Stanford ChIP-chip (HCT116, Jurkat, K562 cells; Sp1, Sp3 ChIP) 0 55 120 0 20 150 0 25 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX,\ This track displays regions bound by Sp1 and Sp3, in the following \ three cell lines, \ assayed by ChIP and microarray hybridization:\
\\
\\ \ \\
\\ Cell Line Classification Isolated From \ HCT 116 colorectal carcinoma colon \ Jurkat, Clone E6-1 acute T cell leukemia T lymphocyte \ K-562 chronic myelogenous leukemia (CML) bone marrow
\ This annotation follows the display conventions for composite \ tracks. The subtracks within this annotation \ may be configured in a variety of ways to highlight different aspects of the \ displayed data. The graphical configuration options are shown at the top of \ the track description page, followed by a list of subtracks. To display only \ selected subtracks, uncheck the boxes next to the tracks you wish to hide. \ For more information about the graphical configuration options, click the \ Graph\ configuration help link.
\ \\ Chromatin IP was performed as described in Trinklein et\ al. (2004). Amplified and labeled ChIP DNA was hybridized to\ oligo tiling arrays produced by NimbleGen, along with a total genomic\ reference sample. The data for each array were median subtracted (log\ 2 ratios) and normalized (divided by the standard deviation). \ The value given for each probe is \ the transformed mean ratio of ChIP DNA:Total DNA.
\ \\ Three biological replicates and two technical replicates were\ performed. The Myers lab is currently testing the specificity and\ sensitivity using real-time PCR.
\ \\ These data were generated in the Richard M. \ Myers lab at Stanford University (now at\ HudsonAlpha Institute for Biotechnology).
\ \\ Trinklein, N.D., Chen, W.C., Kingston, R.E. and Myers, R.M. \ The role of heat shock transcription factor 1 in the \ genome-wide regulation of the mammalian heat shock response.\ Mol. Biol. Cell 15(3), 1254-61 (2004).
\ encodeChip 0 altColor 150,0,25\ autoScale off\ chromosomes chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX\ color 120,0,20\ compositeTrack on\ dataVersion ENCODE June 2005 Freeze\ group encodeChip\ longLabel Stanford ChIP-chip (HCT116, Jurkat, K562 cells; Sp1, Sp3 ChIP)\ maxHeightPixels 128:16:16\ maxLimit 114\ minLimit 0\ origAssembly hg16\ priority 55.0\ shortLabel Stanf ChIP\ superTrack encodeStanfordChipSuper dense\ track encodeStanfordChip\ type bedGraph 4\ viewLimits 0:10\ visibility hide\ tightMrna Tight mRNAS psl . Tightly Filtered Human mRNAs from GenBank 0 55 0 0 0 127 127 127 1 0 0 rna 1 baseColorDefault diffCodons\ baseColorUseCds genbank\ baseColorUseSequence genbank\ group rna\ indelDoubleInsert on\ indelPolyA on\ indelQueryInsert on\ longLabel Tightly Filtered $Organism mRNAs from GenBank\ priority 55\ shortLabel Tight mRNAS\ showDiffBasesAllScales .\ spectrum on\ track tightMrna\ type psl .\ visibility hide\ encodeStanfordChipSmoothed Stanf ChIP Score bedGraph 4 Stanford ChIP-chip Smoothed Score 0 55.1 120 0 20 150 0 25 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX,\ This track displays smoothed (sliding-window mean) scores for \ regions bound by Sp1 and Sp3 in the following \ three cell lines, assayed by ChIP and microarray hybridization:
\\
\\ \ \\
\\ Cell Line Classification Isolated From \ HCT 116 colorectal carcinoma colon \ Jurkat, Clone E6-1 acute T cell leukemia T lymphocyte \ K-562 chronic myelogenous leukemia (CML) bone marrow
\ This annotation follows the display conventions for composite \ tracks. The subtracks within this annotation \ may be configured in a variety of ways to highlight different aspects of the \ displayed data. The graphical configuration options are shown at the top of \ the track description page, followed by a list of subtracks. To display only \ selected subtracks, uncheck the boxes next to the tracks you wish to hide. \ For more information about the graphical configuration options, click the \ Graph\ configuration help link.
\ \\ Chromatin IP was performed as described in Trinklein et\ al. (2004). Amplified and labeled ChIP DNA was hybridized to\ oligo tiling arrays produced by NimbleGen along with a total genomic\ reference sample. The data for each array were median subtracted (log\ 2 ratios) and normalized (divided by the standard deviation).
\\ The transformed mean ratios of ChIP DNA:Total DNA for \ all probes were then smoothed by calculating a sliding-window mean. \ Windows of six neighboring probes (sliding two probes at a time) were \ used; within each window, the highest and lowest value were dropped, \ and the remaining 4 values were averaged. To increase the \ contrast between high and low values for visual display, the average \ was converted to a score by the formula:
\ score = 8^(average) * 10.\ These scores are for visualization purposes; for all analyses, \ the raw ratios, which are available in the Stanf ChIP track, should be used. \ \ \
\ Three biological replicates and two technical replicates were\ performed. The Myers lab is currently testing the specificity and\ sensitivity using real-time PCR.
\ \\ These data were generated in the Richard M. Myers lab \ at Stanford University (now at \ HudsonAlpha Institute for Biotechnology).
\ \\ This track shows alignments between human expressed sequence tags \ (ESTs) in GenBank and the genome that show signs of splicing when\ aligned against the genome. ESTs are single-read sequences, typically about \ 500 bases in length, that usually represent fragments of transcribed genes.\
\\ To be considered spliced, an EST must show \ evidence of at least one canonical intron, i.e. the genomic \ sequence between EST alignment blocks must be at least 32 bases in \ length and have GT/AG ends. By requiring splicing, the level \ of contamination in the EST databases is drastically reduced\ at the expense of eliminating many genuine 3' ESTs.\ For a display of all ESTs (including unspliced), see the \ human EST track.
\ \\ This track follows the display conventions for \ PSL alignment tracks. In dense display mode, darker shading\ indicates a larger number of aligned ESTs.
\\ The strand information (+/-) indicates the\ direction of the match between the EST and the matching\ genomic sequence. It bears no relationship to the direction\ of transcription of the RNA with which it might be associated.
\\ The description page for this track has a filter that can be used to change \ the display mode, alter the color, and include/exclude a subset of items \ within the track. This may be helpful when many items are shown in the track \ display, especially when only some are relevant to the current task.
\\ To use the filter:\
\ This track may also be configured to display base labeling, a feature that\ allows the user to display all bases in the aligning sequence or only those \ that differ from the genomic sequence. For more information about this option,\ go to the \ \ Base Coloring for alignment Tracks page.\ Several types of alignment gap may also be colored; \ for more information, go to the \ \ Alignment Insertion/Deletion Display Options page.\
\ \\ To make an EST, RNA is isolated from cells and reverse\ transcribed into cDNA. Typically, the cDNA is cloned\ into a plasmid vector and a read is taken from the 5'\ and/or 3' primer. For most — but not all — ESTs, the\ reverse transcription is primed by an oligo-dT, which\ hybridizes with the poly-A tail of mature mRNA. The\ reverse transcriptase may or may not make it to the 5'\ end of the mRNA, which may or may not be degraded.
\\ In general, the 3' ESTs mark the end of transcription\ reasonably well, but the 5' ESTs may end at any point\ within the transcript. Some of the newer cap-selected\ libraries cover transcription start reasonably well. Before the \ cap-selection techniques\ emerged, some projects used random rather than poly-A\ priming in an attempt to retrieve sequence distant from the\ 3' end. These projects were successful at this, but as\ a side effect also deposited sequences from unprocessed\ mRNA and perhaps even genomic sequences into the EST databases.\ Even outside of the random-primed projects, there is a\ degree of non-mRNA contamination. Because of this, a\ single unspliced EST should be viewed with considerable\ skepticism.
\\ To generate this track, human ESTs from GenBank were aligned \ against the genome using blat. Note that the maximum intron length\ allowed by blat is 750,000 bases, which may eliminate some ESTs with very \ long introns that might otherwise align. When a single \ EST aligned in multiple places, the alignment having the \ highest base identity was identified. Only alignments having\ a base identity level within 0.5% of the best and at least 96% base identity \ with the genomic sequence are displayed in this track.
\ \\ This track was produced at UCSC from EST sequence data\ submitted to the international public sequence databases by \ scientists worldwide.
\ \\ Benson DA, Karsch-Mizrachi I, Lipman DJ, Ostell J, \ Wheeler DL. \ GenBank: update. Nucleic Acids Res.\ 2004 Jan 1;32(Database issue):D23-6.
\\ Kent WJ.\ BLAT - the BLAST-like alignment tool.\ Genome Res. 2002 Apr;12(4):656-64.
\ rna 1 baseColorUseSequence genbank\ group rna\ indelDoubleInsert on\ indelQueryInsert on\ intronGap 30\ longLabel $Organism ESTs That Have Been Spliced\ maxItems 300\ priority 56\ shortLabel Spliced ESTs\ showDiffBasesAllScales .\ spectrum on\ track intronEst\ type psl est\ visibility dense\ encodeStanfordChipJohnson Stanf ChIP2 bedGraph 4 Stanford ChIP-chip Johnson (GMO6990, HeLa, HepG2, Jurkat, K562 cells; GABP, SRF, TAF, NRST/REST ChIP) 0 56 120 0 20 150 0 25 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 0 altColor 150,0,25\ autoScale off\ chromosomes chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX\ color 120,0,20\ compositeTrack on\ dataVersion Mar 2007\ group encodeChip\ longLabel Stanford ChIP-chip Johnson (GMO6990, HeLa, HepG2, Jurkat, K562 cells; GABP, SRF, TAF, NRST/REST ChIP)\ maxHeightPixels 128:16:16\ maxLimit 1000\ minLimit 500\ origAssembly hg17\ priority 56.0\ shortLabel Stanf ChIP2\ track encodeStanfordChipJohnson\ type bedGraph 4\ viewLimits 0:10\ visibility hide\ encodeUCDavisE2F1Median UCD Ng E2F1 bedGraph 4 UC Davis ChIP/Chip NimbleGen - E2F1 ab, HeLa Cells 0 56 32 128 180 180 128 32 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX,\ ChIP analysis was performed using an antibody to E2F1 and HeLa cell chromatin.\ E2F1 is a transcription factor important in controlling cell division.\ Three independently crosslinked preparations of HeLa cells\ were used to provide three independent biological replicates. ChIP\ assays were performed (with minor modifications which can be provided\ upon request) using the protocol found at\ The Farnham \ Laboratory. Array hybridizations were performed using standard\ NimbleGen Systems Inc. conditions.
\ \\ This track may be configured in a variety of ways to highlight different \ aspects of the displayed data. For more information about the graphical \ configuration options, click the \ Graph\ configuration help link.
\ \\ Ratio intensity values (E2F1 vs. total) for each of three biological \ replicates were calculated and converted to log2. Each set \ of ratio values was then independently scaled by its Tukey bi-weight mean. \ The three replicates were then combined by taking the median scaled \ log2 ratio for each oligo.
\ \\ Primers were chosen to correspond to 13 individual peaks. \ PCR reactions were performed for each of the 13 primer sets using \ amplicons derived from each of three biological samples (39 reactions). \ The PCR reactions confirmed that all of the 13 chosen peaks were bound \ by E2F1 in all three biological samples.
\ \\ These data were contributed by Mike Singer, Kyle Munn, Nan Jiang, \ Todd Richmond and Roland Green of NimbleGenSystems, Inc., and Matt Oberley, \ David Inman, Mark Bieda, Shally Xu and Peggy Farnham of Farnham Lab.
\ encodeChip 0 altColor 180,128,32\ autoScale Off\ chromosomes chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX\ color 32,128,180\ dataVersion ENCODE June 2005 Freeze\ group encodeChip\ longLabel UC Davis ChIP/Chip NimbleGen - E2F1 ab, HeLa Cells\ maxHeightPixels 128:16:16\ maxLimit 4.59\ minLimit -2.41\ origAssembly hg16\ priority 56.0\ shortLabel UCD Ng E2F1\ track encodeUCDavisE2F1Median\ type bedGraph 4\ viewLimits 0:2.2\ visibility hide\ est Human ESTs psl est Human ESTs Including Unspliced 0 57 0 0 0 127 127 127 1 0 0\ This track shows alignments between human expressed sequence tags \ (ESTs) in GenBank and the genome. ESTs are single-read sequences, \ typically about 500 bases in length, that usually represent fragments of \ transcribed genes.
\\ NOTE: As of April, 2007, we no longer include GenBank sequences \ that contain the following URL as part of the record:\
\ http://fulllength.invitrogen.com\\ Some of these entries are the result of alignment to pseudogenes,\ followed by "correction" of the EST to match the genomic sequence. \ It is therefore not the sequence of the actual EST and makes it appear that \ the EST is transcribed. Invitrogen no longer sells the clones.\ \ \
\ This track follows the display conventions for \ PSL alignment tracks. In dense display mode, the items that\ are more darkly shaded indicate matches of better quality.
\\ The strand information (+/-) indicates the\ direction of the match between the EST and the matching\ genomic sequence. It bears no relationship to the direction\ of transcription of the RNA with which it might be associated.
\\ The description page for this track has a filter that can be used to change \ the display mode, alter the color, and include/exclude a subset of items \ within the track. This may be helpful when many items are shown in the track \ display, especially when only some are relevant to the current task.
\\ To use the filter:\
\ This track may also be configured to display base labeling, a feature that\ allows the user to display all bases in the aligning sequence or only those \ that differ from the genomic sequence. For more information about this option,\ click \ here.\ Several types of alignment gap may also be colored; \ for more information, click \ here.\
\ \\ To make an EST, RNA is isolated from cells and reverse\ transcribed into cDNA. Typically, the cDNA is cloned\ into a plasmid vector and a read is taken from the 5'\ and/or 3' primer. For most — but not all — ESTs, the\ reverse transcription is primed by an oligo-dT, which\ hybridizes with the poly-A tail of mature mRNA. The\ reverse transcriptase may or may not make it to the 5'\ end of the mRNA, which may or may not be degraded.
\\ In general, the 3' ESTs mark the end of transcription\ reasonably well, but the 5' ESTs may end at any point\ within the transcript. Some of the newer cap-selected\ libraries cover transcription start reasonably well. Before the \ cap-selection techniques\ emerged, some projects used random rather than poly-A\ priming in an attempt to retrieve sequence distant from the\ 3' end. These projects were successful at this, but as\ a side effect also deposited sequences from unprocessed\ mRNA and perhaps even genomic sequences into the EST databases.\ Even outside of the random-primed projects, there is a\ degree of non-mRNA contamination. Because of this, a\ single unspliced EST should be viewed with considerable\ skepticism.
\\ To generate this track, human ESTs from GenBank were aligned \ against the genome using blat. Note that the maximum intron length\ allowed by blat is 750,000 bases, which may eliminate some ESTs with very \ long introns that might otherwise align. When a single \ EST aligned in multiple places, the alignment having the \ highest base identity was identified. Only alignments having\ a base identity level within 0.5% of the best and at least 96% base identity \ with the genomic sequence were kept.
\ \\ This track was produced at UCSC from EST sequence data\ submitted to the international public sequence databases by \ scientists worldwide.
\ \\ Benson DA, Karsch-Mizrachi I, Lipman DJ, Ostell J,\ Wheeler DL.\ GenBank: update. Nucleic Acids Res.\ 2004 Jan 1;32(Database issue):D23-6.
\\ Kent WJ.\ BLAT - the BLAST-like alignment tool.\ Genome Res. 2002 Apr;12(4):656-64.
\ rna 1 baseColorUseSequence genbank\ group rna\ indelDoubleInsert on\ indelQueryInsert on\ intronGap 30\ longLabel $Organism ESTs Including Unspliced\ maxItems 300\ priority 57\ shortLabel $Organism ESTs\ spectrum on\ table all_est\ track est\ type psl est\ visibility hide\ encodeAffyChIpHl60PvalRnapHr00 Affy Pol2 RA 0h wig 0.0 534.54 Affymetrix ChIP/Chip (Pol2 8WG16 antibody, retinoic acid-treated HL-60, 0hrs) P-Value 0 57 50 175 0 152 215 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 0 color 50,175,0\ longLabel Affymetrix ChIP/Chip (Pol2 8WG16 antibody, retinoic acid-treated HL-60, 0hrs) P-Value\ parent encodeAffyChIpHl60Pval\ priority 57\ shortLabel Affy Pol2 RA 0h\ subGroups factor=Pol2 time=0h\ track encodeAffyChIpHl60PvalRnapHr00\ rgdEst RGD EST psl est RGD EST 0 57.5 12 12 120 133 133 187 1 0 0 http://rgd.mcw.edu/generalSearch/RgdSearch.jsp?quickSearch=1&searchKeyword=\ This track shows expressed sequence tags (ESTs) downloaded from the\ Rat Genome Database (RGD). An EST is a partial sequence of a randomly-chosen \ cDNA, obtained from the results of a single DNA sequencing reaction. ESTs \ are used to identify transcribed regions in genomic sequence and to \ characterize patterns of gene expression in the tissue from which the \ cDNA was derived.
\ \\ The data used to create this annotation were obtained from the file \ RGD_EST.gff downloaded from the RGD website.
\ \\ Thanks to the RGD for \ providing this annotation. RGD is funded by grant HL64541 entitled \ "Rat Genome Database", awarded to Dr. Howard J Jacob, Medical College of \ Wisconsin, from the National Heart Lung and Blood Institute \ (NHLBI) of the \ National Institutes of Health \ (NIH).
\ \ rna 1 color 12,12,120\ group rna\ longLabel RGD EST\ priority 57.5\ shortLabel RGD EST\ spectrum on\ track rgdEst\ type psl est\ url http://rgd.mcw.edu/generalSearch/RgdSearch.jsp?quickSearch=1&searchKeyword=\ visibility hide\ encodeAffyChIpHl60SitesRnapHr00 Affy Pol2 RA 0h bed 3 . Affymetrix ChIP/Chip (Pol2 8WG16 antibody, retinoic acid-treated HL-60, 0hrs) Sites 0 58 50 175 0 152 215 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 1 color 50,175,0\ longLabel Affymetrix ChIP/Chip (Pol2 8WG16 antibody, retinoic acid-treated HL-60, 0hrs) Sites\ parent encodeAffyChIpHl60Sites\ priority 58\ shortLabel Affy Pol2 RA 0h\ subGroups factor=Pol2 time=0h\ track encodeAffyChIpHl60SitesRnapHr00\ tightEst Tight ESTs psl est Tightly Filtered Human ESTs Including Unspliced 0 58 0 0 0 127 127 127 1 0 0 rna 1 baseColorUseSequence genbank\ group rna\ indelDoubleInsert on\ indelQueryInsert on\ longLabel Tightly Filtered $Organism ESTs Including Unspliced\ priority 58\ shortLabel Tight ESTs\ spectrum on\ track tightEst\ type psl est\ visibility hide\ encodeAffyChIpHl60PvalRnapHr02 Affy Pol2 RA 2h wig 0.0 534.54 Affymetrix ChIP/Chip (Pol2 8WG16 antibody, retinoic acid-treated HL-60, 2hrs) P-Value 0 59 50 175 0 152 215 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 0 color 50,175,0\ longLabel Affymetrix ChIP/Chip (Pol2 8WG16 antibody, retinoic acid-treated HL-60, 2hrs) P-Value\ parent encodeAffyChIpHl60Pval\ priority 59\ shortLabel Affy Pol2 RA 2h\ subGroups factor=Pol2 time=2h\ track encodeAffyChIpHl60PvalRnapHr02\ encodeAffyChIpHl60SitesRnapHr02 Affy Pol2 RA 2h bed 3 . Affymetrix ChIP/Chip (Pol2 8WG16 antibody, retinoic acid-treated HL-60, 2hrs) Sites 0 60 50 175 0 152 215 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 1 color 50,175,0\ longLabel Affymetrix ChIP/Chip (Pol2 8WG16 antibody, retinoic acid-treated HL-60, 2hrs) Sites\ parent encodeAffyChIpHl60Sites\ priority 60\ shortLabel Affy Pol2 RA 2h\ subGroups factor=Pol2 time=2h\ track encodeAffyChIpHl60SitesRnapHr02\ encodeBu_ORChID1 BU ORChID wig -0.56 1.58 Boston University ORChID (OH Radical Cleavage Intensity Database) 0 60 44 44 200 200 44 44 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX,\ This track displays the predicted hydroxyl radical cleavage\ intensity on naked DNA for each nucleotide in the ENCODE regions.\ Because the hydroxyl radical cleavage intensity is proportional to the solvent\ accessible surface area of the deoxyribose hydrogen atoms \ (Balasubramanian et al., 1998), this track represents a structural \ profile of the DNA in the ENCODE regions.
\\ Please visit the ORChID website maintained by the Tullius group for \ access to experimental hydroxyl radical cleavage data, and to a server \ which can be used to predict the cleavage pattern for any input sequence.
\ \\ This track may be configured in a variety of ways to highlight different \ aspects of the displayed data. The graphical configuration options \ are shown at the top of the track description page. For more information, \ click the \ Graph\ configuration help link.
\ \\ Hydroxyl radical cleavage intensity predictions were performed using\ an in-house sliding trimer window (STW) algorithm. This algorithm\ draws data from the ·OH Radical Cleavage Intensity Database\ (ORChID), which contains more than 150 experimentally determined cleavage\ patterns. These predictions are fairly accurate, with a Pearson\ coefficient of ~0.85 between the predicted and experimentally\ determined cleavage intensities. For more details on the hydroxyl\ radical cleavage method, see the References section below.
\ \\ The STW algorithm has been cross-validated by removing each test\ sequence from the training set and performing a prediction. The\ mean correlation coefficient (between predicted and experimental\ cleavage patterns) from this study was 0.85.
\ \\ These data were generated through the combined effort of\ Bo Pang at MIT and \ \ Jason Greenbaum,\ \ Steve Parker, and \ \ Tom Tullius of Boston University.
\ \ \\ Balasubramanian, B., Pogozelski, W.K., and Tullius, T.D. \ DNA strand breaking by the hydroxyl radical is governed by the \ accessible surface areas of the hydrogen atoms of the DNA backbone. \ Proc. Natl. Acad. Sci. USA 95(17), 9738-9743 (1998).
\\ Price, M. A., and Tullius, T. D. \ Using the Hydroxyl Radical to Probe DNA Structure.\ Meth. Enzymol. 212, 194-219 (1992).
\\ Tullius, T. D. Probing DNA Structure with Hydroxyl Radicals. \ In Current Protocols in Nucleic Acid Chemistry, (eds.\ Beaucage, S.L., Bergstrom, D.E., Glick, G.D. and Jones, R.A.) (Wiley, 2001), \ pp. 6.7.1-6.7.8.
\ encodeChrom 0 altColor 200,44,44\ autoScale Off\ chromosomes chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX\ color 44,44,200\ dataVersion ENCODE June 2005 Freeze\ group encodeChrom\ longLabel Boston University ORChID (OH Radical Cleavage Intensity Database)\ maxHeightPixels 128:36:16\ origAssembly hg16\ priority 60.0\ shortLabel BU ORChID\ spanList 1\ track encodeBu_ORChID1\ type wig -0.56 1.58\ viewLimits 0.22:0.5\ visibility hide\ windowingFunction Mean\ encodeUtexChipSuper UT-Austin ChIP University of Texas, Austin ChIP-chip and STAGE 0 60 0 0 0 127 127 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX,\ This super-track combines related tracks of ChIP data generated by\ the Iyer laboratory at \ The University of Texas at Austin.\ Two technologies are presented in this super-track: ChIP-chip and ChIP-STAGE.\ ChIP-chip, also known as genome-wide location analysis, is a technique for\ isolation and identification of DNA sequences bound by specific proteins in\ cells. Instead of detecting bound fragments by microarray, ChIP-STAGE uses\ Sequence Tag Analysis of Genomic Enrichment, or STAGE, technology by cloning \ STAGE tags, sequencing and mapping to the human genome.\
\\ These tracks contain ChIP data for several transcription\ factors, including c-Myc, E2F4 and STAT1, in cell lines\ including 2091 (foreskin fibroblast) and HeLa (cervical carcinoma).\
\ \\ ChIP-chip data were contributed by Jonghwan Kim, Akshay Bhinge, and Vishy Iyer \ from the Iyer lab\ at The University of Texas at Austin, in collaboration with Mike Singer, \ Nan Jiang, and Roland Green of NimbleGen Systems, Inc.
\\ ChIP-STAGE data were contributed by Jonghwan Kim, Akshay Bhinge, and Vishy Iyer \ from the Iyer lab, and by Ghia Euskirchen and Michael Snyder of the \ Snyder lab at\ Yale University.
\ \\ Bhinge AA, Kim J, Euskirchen G, Snyder M, Iyer VR.\ \ Mapping the chromosomal targets of STAT1 by Sequence Tag Analysis of Genomic \ Enrichment (STAGE). \ Genome Res. 2007 Jun;17(6):910-6.
\\ Kim J, Bhinge A, Morgan XC, Iyer VR. \ Mapping DNA-protein interactions in large genomes by sequence tag \ analysis of genomic enrichment. Nat Methods. 2005 Jan;2(1):47-53.\
\ \ encodeChip 0 chromosomes chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX\ group encodeChip\ longLabel University of Texas, Austin ChIP-chip and STAGE\ origAssembly hg17\ priority 60\ shortLabel UT-Austin ChIP\ superTrack on\ track encodeUtexChipSuper\ visibility hide\ encodeUtexChip UT-Austin ChIP bedGraph 4 University of Texas, Austin ChIP-chip 0 60 0 0 0 127 127 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX,\ ChIP-chip analysis of c-Myc and E2F4 was performed using 2091 foreskin \ fibroblasts and HeLa cells. ChIP was carried out from normally-growing HeLa \ cells and from 2091 quiescent (0.1% serum FBS), as well as serum-stimulated \ (10% FBS, 4hrs), fibroblasts. \ Microarray hybridizations were performed using NimbleGen ENCODE arrays and \ protocols.
\ \\ This annotation follows the display conventions for composite \ tracks. The subtracks within this annotation \ may be configured in a variety of ways to highlight different aspects of the \ displayed data. The graphical configuration options are shown at the top of \ the track description page, followed by a list of subtracks. To display only \ selected subtracks, uncheck the boxes next to the tracks you wish to hide. \ For more information about the graphical configuration options, click the \ Graph\ configuration help link.
\ \\ Chromatin from each cell line under a given condition was cross-linked with \ 1% formaldehyde, sheared, precipitated with antibody, and reverse cross-linked \ to obtain enriched DNA fragments. ChIP material was amplified and hybridized \ to a NimbleGen ENCODE region array. \ The raw and processed files reflect fold enrichment over the mock ChIP sample, \ which was used as a reference in the hybridization.
\ \\ Each of the four experiments has three independent biological replicates. \ Data from all three replicates were averaged to generate a single data file. \ The NimbleGen method for hit identification was used to generate the peaks at a false positive rate of <= 0.05.
\ \\ These data were contributed by Jonghwan Kim, Akshay Bhinge, and Vishy Iyer \ from the Iyer lab\ at the University of Texas at Austin, in collaboration with Mike Singer, \ Nan Jiang, and Roland Green of NimbleGen Systems, Inc.
\ \\ Kim, J., Bhinge, A., Morgan, X.C. and Iyer, V.R. \ Mapping DNA-protein interactions in large genomes by sequence tag \ analysis of genomic enrichment. Nature Methods 2, 47-53 \ (2005).
\ encodeChip 0 autoScale Off\ chromosomes chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX\ compositeTrack on\ dataVersion ENCODE Oct 2005 Freeze\ group encodeChip\ longLabel University of Texas, Austin ChIP-chip\ maxHeightPixels 128:16:16\ maxLimit 4.35\ minLimit -3.23\ origAssembly hg17\ priority 60\ shortLabel UT-Austin ChIP\ subGroup1 dataType Data_Type raw=Raw peaks=Peaks\ superTrack encodeUtexChipSuper dense\ track encodeUtexChip\ type bedGraph 4\ viewLimits 0:2\ visibility hide\ windowingFunction mean\ encodeAffyChIpHl60PvalRnapHr08 Affy Pol2 RA 8h wig 0.0 534.54 Affymetrix ChIP/Chip (Pol2 8WG16 antibody, retinoic acid-treated HL-60, 8hrs) P-Value 0 61 50 175 0 152 215 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 0 color 50,175,0\ longLabel Affymetrix ChIP/Chip (Pol2 8WG16 antibody, retinoic acid-treated HL-60, 8hrs) P-Value\ parent encodeAffyChIpHl60Pval\ priority 61\ shortLabel Affy Pol2 RA 8h\ subGroups factor=Pol2 time=8h\ track encodeAffyChIpHl60PvalRnapHr08\ encodeNhgriDnaseHs NHGRI DNaseI-HS bed 5 . NHGRI DNaseI-Hypersensitive Sites 0 61 0 0 0 127 127 127 1 0 19 chr1,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr5,chr6,chr7,chr8,chr9,chrX,\ This track displays DNaseI-hypersensitive sites in CD4+ T-cells\ before and after activation by anti-CD3 and anti-CD28 antibodies. \ DNaseI-hypersensitive sites are\ associated with gene regulatory regions, particularly for upregulated\ genes. CD4+ T-cells, also known as helper or inducer T cells, are\ involved in generating an immune response. CD4+ T-cells are also\ one of the primary targets of the HIV virus.
\ \\ The top subtrack of this annotation corresponds to unactivated T cells; the \ bottom subtrack shows activated T cells. Within the subtracks, the gray and \ black blocks (which appear as vertical lines when the display is zoomed-out) \ represent probable hypersensitive sites. The darker the blocks, the \ more likely the site is to be hypersensitive.
\\ To display only selected subtracks, uncheck the boxes next to the tracks you \ wish to hide. The display may also be filtered to show only those items\ with unnormalized scores that meet or exceed a certain threshhold. To set a\ threshhold, type the minimum score into the text box at the top of the \ description page.
\ \\ Primary human CD4+ T cells were activated by incubation with anti-CD3 and \ anti-CD28 antibodies for 24 hours.\ DNaseI-hypersensitive sites were cloned from the cells \ before and after activation, and\ sequenced using massively parallel signature sequencing \ (Brenner et al., 2000; Crawford et al., 2006). \ Only those clusters of multiple DNaseI library sequences that map within 500 \ bases of each other are displayed.\ Each cluster has a unique identifier, visible when the track is displayed\ in full or packed mode. The last digit of each identifier represents the \ number of sequences that map within that particular cluster. The sequence number\ is also reflected in the score, e.g. a cluster of two sequences scores \ 500, three sequences scores 750 and four or more sequences scores 1000.
\ \\ Real-time PCR assay was used to verify valid\ DNaseI-hypersensitive sites. Approximately 50% of\ clusters of two sequences are valid. These clusters are shown\ in light gray. 80% of clusters of three sequences are valid, indicated by\ dark gray. 100% of clusters of four or more\ sequences are valid, shown in black.
\\ This data set includes confirmed elements for 35 of the 44\ ENCODE regions.\ It is estimated that these data identify\ 10-20% of all hypersensitive sites within CD4+ T cells. Further\ sequencing will be required to identify additional sites.
\ \\ These data were produced at the \ Collins Lab \ at NHGRI. Thanks to Gregory E. Crawford and Francis S. Collins for supplying \ the information for this track.
\ \\ Brenner S, Johnson M, Bridgham J, Golda G, Lloyd DH, Johnson D, Luo S, \ McCurdy S, Foy M, Ewan M et al.\ Gene expression analysis by massively parallel signature \ sequencing (MPSS) on microbead arrays.\ Nat. Biotechnol. 2000 Jun;18(6):597-8.\
\ Crawford GE, Holt IE, Mullikin JC, Tai D, Blakesley R, Bouffard G, Young A, \ Masiello C, Green ED, Wolfsberg TG et al.\ Identifying gene regulatory elements by genome-wide recovery of \ DNase hypersensitive sites.\ Proc. Natl. Acad. Sci. USA. 2004 Jan 27;101(4):992-7.
\\ Crawford GE, Holt IE, Whittle J, Webb BD, Tai D, Davis S, Margulies EH, Chen Y, \ Bernat JA, Ginsburg D et al.\ Genome-wide mapping of DNase hypersensitive \ sites using massively parallel signature sequencing (MPSS).\ Genome Res. 2006 Jan;16(1):123-31.\ (See also NHGRI's\ data site for the project.)
\\ McArthur M, Gerum S, Stamatoyannopoulos G.\ Quantification of DNaseI-sensitivity by real-time PCR: \ quantitative analysis of DNaseI-hypersensitivity of the mouse beta-globin \ LCR.\ J. Mol. Biol. 2001 Oct 12;313(1):27-34.
\ encodeChrom 1 chromosomes chr1,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr5,chr6,chr7,chr8,chr9,chrX\ compositeTrack on\ dataVersion ENCODE June 2005 Freeze\ group encodeChrom\ longLabel NHGRI DNaseI-Hypersensitive Sites\ origAssembly hg16\ priority 61.0\ shortLabel NHGRI DNaseI-HS\ track encodeNhgriDnaseHs\ type bed 5 .\ useScore 1\ visibility hide\ encodeAffyChIpHl60SitesRnapHr08 Affy Pol2 RA 8h bed 3 . Affymetrix ChIP/Chip (Pol2 8WG16 antibody, retinoic acid-treated HL-60, 8hrs) Sites 0 62 50 175 0 152 215 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 1 color 50,175,0\ longLabel Affymetrix ChIP/Chip (Pol2 8WG16 antibody, retinoic acid-treated HL-60, 8hrs) Sites\ parent encodeAffyChIpHl60Sites\ priority 62\ shortLabel Affy Pol2 RA 8h\ subGroups factor=Pol2 time=8h\ track encodeAffyChIpHl60SitesRnapHr08\ encodeStanfordMeth Stanf Meth bedGraph 4 Stanford Methylation Digest: Be2C, CRL1690, HCT116, HT1080, HepG2, JEG3, Snu182, U87 0 62 120 0 20 150 0 25 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX,\ This track displays experimentally determined regions of unmethylated\ CpGs in the ENCODE regions. These experiments were performed in eight\ cell lines, each of which is displayed as a separate subtrack:\
\\
\\ \ \\
\\ Cell Line Classification Isolated From \ BE(2)-C neuroblastoma brain (metastatic, from bone marrow) \ CRL-1690™ hybridoma B lymphocyte \ HCT 116 colorectal carcinoma colon \ HT-1080 fibrosarcoma connective tissue \ HepG2 hepatocellular carcinoma liver \ JEG-3 choriocarcinoma placenta \ SNU-182 hepatocellular carcinoma liver \ U-87 MG glioblastoma-astrocytoma brain
\ This annotation follows the display conventions for composite \ tracks. The subtracks within this annotation \ may be configured in a variety of ways to highlight different aspects of the \ displayed data. The graphical configuration options are shown at the top of \ the track description page, followed by a list of subtracks. To display only \ selected subtracks, uncheck the boxes next to the tracks you wish to hide. \ For more information about the graphical configuration options, click the \ Graph\ configuration help link.
\ \\ High molecular weight genomic DNA was prepared from each cell line.\ The genomic DNA was digested with a cocktail of six methyl-sensitive\ restriction enzymes (AciI, HhaI, BstUI, HpaII, HgaI, and HpyCH4IV) and\ size selected to deplete the genome of unmethylated regions. Digested\ and undigested DNA (control) were amplified, labeled, and hybridized\ to oligo tiling arrays produced by NimbleGen. The data for each array\ were median subtracted (log 2 ratios) and normalized (divided by the \ standard deviation). The value given\ for each array probe is the transformed mean ratio of\ undigested:digested genomic DNA.
\\ Higher scores in this track indicate regions that are more strongly\ methylated, due to the greater difference between the undigested and\ digested hybridization signals.\
\ \\ Three biological replicates and two technical replicates were done for\ each of the eight cell lines. \ The Myers lab is currently testing the specificity and sensitivity using \ real-time PCR. \
\ \\ These data were generated in the Richard M. Myers lab at Stanford University (now at \ HudsonAlpha Institute for Biotechnology). Please contact \ David Johnson\ \ for further information regarding the methods and the data for this track.
\ encodeChrom 0 altColor 150,0,25\ autoScale off\ chromosomes chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX\ color 120,0,20\ compositeTrack on\ dataVersion ENCODE June 2005 Freeze\ group encodeChrom\ longLabel Stanford Methylation Digest: Be2C, CRL1690, HCT116, HT1080, HepG2, JEG3, Snu182, U87\ maxHeightPixels 128:16:16\ maxLimit 114\ minLimit 0\ origAssembly hg16\ priority 62.0\ shortLabel Stanf Meth\ track encodeStanfordMeth\ type bedGraph 4\ viewLimits 0:10\ visibility hide\ encodeStanfordMethSmoothed Stanf Meth Score bedGraph 4 Stanford Methylation Digest Smoothed Score 0 62.1 120 0 20 150 0 25 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX,\ This track displays smoothed (sliding-window mean) scores for \ experimentally determined regions of unmethylated\ CpGs in the ENCODE regions. These experiments were performed in eight\ cell lines, each of which is displayed as a separate subtrack:\
\\
\\ \ \\
\\ Cell Line Classification Isolated From \ BE(2)-C neuroblastoma brain (metastatic, from bone marrow) \ CRL-1690™ hybridoma B lymphocyte \ HCT 116 colorectal carcinoma colon \ HT-1080 fibrosarcoma connective tissue \ HepG2 hepatocellular carcinoma liver \ JEG-3 choriocarcinoma placenta \ SNU-182 hepatocellular carcinoma liver \ U-87 MG glioblastoma-astrocytoma brain
\ This annotation follows the display conventions for composite \ tracks. The subtracks within this annotation \ may be configured in a variety of ways to highlight different aspects of the \ displayed data. The graphical configuration options are shown at the top of \ the track description page, followed by a list of subtracks. To display only \ selected subtracks, uncheck the boxes next to the tracks you wish to hide. \ For more information about the graphical configuration options, click the \ Graph\ configuration help link.
\ \\ High molecular weight genomic DNA was prepared from each cell line.\ The genomic DNA was digested with a cocktail of six methyl-sensitive\ restriction enzymes (AciI, HhaI, BstUI, HpaII, HgaI, and HpyCH4IV) and\ size selected to deplete the genome of unmethylated regions. Digested\ and undigested DNA (control) were amplified, labeled, and hybridized\ to oligo tiling arrays produced by NimbleGen. The data for each array\ were median subtracted (log 2 ratios) and normalized (divided by the \ standard deviation).
\\ The transformed mean ratios of undigested:digested genomic DNA for \ all probes were then smoothed by calculating a sliding-window mean. \ Windows of six neighboring probes (sliding two probes at a time) were \ used; within each window, the highest and lowest value were dropped, \ and the remaining four values were averaged. In order to increase the \ contrast between high and low values for visual display, the average \ was converted to a score by the formula:
\ score = 8^(average) * 10\ These scores are for visualization purposes; for all analyses, \ the raw ratios, which are available in the Stanf Meth track, should be used. \ \
\ Higher scores in this track indicate regions that are more strongly\ methylated, due to the greater difference between the undigested and\ digested hybridization signals.\
\ \\ Three biological replicates and two technical replicates were done for\ each of the eight cell lines. \ The Myers lab is currently testing the specificity and sensitivity using \ real-time PCR.
\ \\ These data were generated in the Richard M. Myers lab at Stanford University (now at \ \ HudsonAlpha Institute for Biotechnology). \ Please contact David Johnson\ \ for further information regarding the methods and the data for this track.
\ \ encodeChrom 0 altColor 150,0,25\ autoScale off\ chromosomes chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX\ color 120,0,20\ compositeTrack on\ dataVersion ENCODE June 2005 Freeze\ group encodeChrom\ longLabel Stanford Methylation Digest Smoothed Score\ maxHeightPixels 128:16:16\ maxLimit 364088\ minLimit 0\ origAssembly hg16\ priority 62.1\ shortLabel Stanf Meth Score\ track encodeStanfordMethSmoothed\ type bedGraph 4\ viewLimits 0:1000\ visibility hide\ encodeAffyChIpHl60PvalRnapHr32 Affy Pol2 RA 32h wig 0.0 534.54 Affymetrix ChIP/Chip (Pol2 8WG16 antibody, retinoic acid-treated HL-60, 32hrs) P-Value 0 63 50 175 0 152 215 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 0 color 50,175,0\ longLabel Affymetrix ChIP/Chip (Pol2 8WG16 antibody, retinoic acid-treated HL-60, 32hrs) P-Value\ parent encodeAffyChIpHl60Pval\ priority 63\ shortLabel Affy Pol2 RA 32h\ subGroups factor=Pol2 time=32h\ track encodeAffyChIpHl60PvalRnapHr32\ evofold EvoFold bed 6 + EvoFold Predictions of RNA Secondary Structure 0 63 20 90 0 137 172 127 0 0 0\ This track shows RNA secondary structure predictions made with the\ EvoFold program, a comparative method that exploits the evolutionary signal\ of genomic multiple-sequence alignments for identifying conserved\ functional RNA structures.
\ \\ Track elements are labeled using the convention ID_strand_score.\ When zoomed out beyond the base level, secondary structure prediction regions\ are indicated by blocks, with the stem-pairing regions shown in a darker shade \ than unpaired regions. Arrows indicate the predicted strand.\ When zoomed in to the base level, the specific secondary structure predictions \ are shown in parenthesis format. The confidence score for each position is\ indicated in grayscale, with darker shades corresponding to higher scores.\
\ The details page for each track element shows the predicted secondary structure \ (labeled SS anno), together with details of the multiple species \ alignments at that location. Substitutions relative to the human sequence are \ color-coded according to their compatibility with the predicted secondary \ structure (see the color legend on the details page). Each prediction is \ assigned an overall score and a sequence of position-specific scores. The \ overall score measures evidence for any functional RNA structures in the given \ region, while the position-specific scores (0 - 9) measure the confidence of \ the base-specific annotations. Base-pairing positions are annotated \ with the same pair symbol. The offsets are provided to ease\ visual navigation of the alignment in terms of the human sequence. The offset\ is calculated (in units of ten) from the start position of the element on \ the positive strand or from the end position when on the negative strand.
\\ The graphical display may be filtered to show only those track elements \ with scores that meet or exceed a certain threshhold. To set a \ threshhold, type the minimum score into the text box at the top of the \ description page.
\ \\ Evofold makes use of phylogenetic\ stochastic context-free grammars (phylo-SCFGs), which are combined\ probabilistic models of RNA secondary structure and primary sequence\ evolution. The predictions consist of both a specific RNA secondary\ structure and an overall score. The overall score is essentially a\ log-odd score between a phylo-SCFG modeling the constrained evolution of\ stem-pairing regions and one which only models unpaired regions.
\\ The predictions for this track were based on the conserved elements of\ an 8-way vertebrate alignment of the human, chimpanzee, mouse, rat,\ dog, chicken, zebrafish, and Fugu assemblies. NOTE: These predictions\ were originally computed on the hg17 (May 2004) human assembly, from\ which the hg16 (July 2003), hg18 (May 2006), and hg19 (Feb 2009) predictions\ were lifted. As a result, the multiple alignments shown on the track\ details pages may differ from the 8-way alignments used for their\ prediction. Additionally, some weak predictions have been eliminated\ from the set displayed on hg18 and hg19. The hg17 prediction set corresponds\ exactly to the set analyzed in the EvoFold paper referenced below.\
\ \\ The EvoFold program and browser track were developed by \ Jakob \ Skou Pedersen of the UCSC Genome Bioinformatics Group, now at \ Aarhus University, Denmark.
\The RNA secondary structure is rendered using the VARNA Java applet.\ \
\ Pedersen JS, Bejerano G, Siepel A, Rosenbloom K,\ Lindblad-Toh K, Lander ES, Kent J, Miller W,\ Haussler D. Identification and classification of conserved RNA\ secondary structures in the human genome. PLoS Comput\ Biol. 2006 Apr;2(4):e33.
\ \\ Knudsen B, Hein J. \ RNA secondary structure prediction using stochastic context-free \ grammars and evolutionary history.\ Bioinformatics. 1999 Jun;15(6):446-54.
\\ Pedersen JS, Meyer IM, Forsberg R, Simmonds P, Hein J. \ A comparative method for finding and folding RNA\ secondary structures within protein-coding regions. \ Nucleic Acids Res. 2004 Sep 24;32(16):4925-36.
\ \\ Siepel A, Bejerano G, Pedersen JS, Hinrichs AS, Hou M, Rosenbloom\ K, Clawson H, Spieth J, Hillier LW, Richards S, Weinstock GM,\ Wilson RK, Gibbs RA, Kent WJ, Miller W, Haussler D. \ Evolutionarily conserved elements in vertebrate, insect, worm, \ and yeast genomes.\ Genome Res. 2005 Aug;15(8):1034-50.
\ genes 1 color 20,90,0\ group genes\ longLabel EvoFold Predictions of RNA Secondary Structure\ mafTrack mzPt1Mm3Rn3Gg2_pHMM\ origAssembly hg17\ priority 63\ shortLabel EvoFold\ track evofold\ type bed 6 +\ visibility hide\ miRNA miRNA bed 8 . MicroRNAs from miRBase 0 63 255 64 64 255 159 159 1 0 0 http://microrna.sanger.ac.uk/cgi-bin/sequences/mirna_entry.pl?id=$$\ The miRNA track shows microRNAs from the\ \ miRBase at The \ Wellcome Trust Sanger Institute.
\ \\ Mature miRNAs (miRs) are represented by \ thick blocks. The predicted stem-loop portions of the primary transcripts\ are indicated by thinner blocks. miRNAs in the sense orientation are shown in\ black; those in the reverse orientation are colored grey. When a single \ precursor produces two mature miRs from its 5' and 3' parts, it is displayed \ twice with the two different positions of the mature miR.
\\ To display only those items that exceed a specific unnormalized score, enter\ a minimum score between 0 and 1000 in the text box at the top of the track \ description page.\
\ \\ Mature and precursor miRNAs from the miRNA Registry were\ aligned against the genome using blat.\ The extents of the precursor sequences were not generally known, and were\ predicted based on base-paired hairpin structure. \ miRBase is described in Griffiths-Jones, S. et al. (2006).\ The miRNA Registry is\ described in Griffiths-Jones, S. (2004) and Weber, M.J. (2005) in the \ References section below.
\ \\ \ This track was created by Michel Weber of \ Laboratoire de Biologie Moléculaire Eucaryote,\ CNRS Université Paul Sabatier\ (Toulouse, France), Yves Quentin of Laboratoire de Microbiologie et Génétique\ Moléculaires (Toulouse, France) and Sam Griffiths-Jones of\ \ The Wellcome Trust Sanger Institute\ (Cambridge, UK).\
\\ When making use of these data, please cite:
\\ Griffiths-Jones S, Grocock RJ, van Dongen S, Bateman A, Enright AJ.\ miRBase: microRNA sequences, targets and gene nomenclature.\ Nucleic Acids Res. 2006 Jan 1;34(Database issue):D140-4.
\\ Griffiths-Jones S. \ The microRNA Registry.\ Nucleic Acids Res. 2004 Jan 1;32(Database issue):D109-11.
\\ Weber MJ. \ New human and mouse microRNA genes found by homology search.\ Febs J. 2005 Jan;272(1):59-73.
\\ You may also want to cite The Wellcome Trust Sanger Institute \ miRNA Registry.
\\
The following publication provides guidelines on miRNA annotation:\
Ambros V, Bartel B, Bartel DP, Burge CB, Carrington JC, Chen X,\
Dreyfuss G, Eddy SR, Griffiths-Jones S, Marshall M et al.\
A uniform system for microRNA annotation. \
RNA. 2003 Mar;9(3):277-9.
\
For more information on blat, see \
Kent WJ.\
BLAT - the BLAST-like alignment tool.\
Genome Res. 2002 Apr;12(4):656-64.
\ This track displays translated blat alignments of vertebrate and\ invertebrate mRNA in \ GenBank from organisms other than human.\ \
\ This track follows the display conventions for \ PSL alignment tracks. In dense display mode, the items that\ are more darkly shaded indicate matches of better quality.
\\ The strand information (+/-) for this track is in two parts. The\ first + indicates the orientation of the query sequence whose\ translated protein produced the match (here always 5' to 3', hence +).\ The second + or - indicates the orientation of the matching \ translated genomic sequence. Because the two orientations of a DNA \ sequence give different predicted protein sequences, there are four \ combinations. ++ is not the same as --, nor is +- the same as -+.
\\ The description page for this track has a filter that can be used to change \ the display mode, alter the color, and include/exclude a subset of items \ within the track. This may be helpful when many items are shown in the track \ display, especially when only some are relevant to the current task.
\\ To use the filter:\
\ This track may also be configured to display codon coloring, a feature that\ allows the user to quickly compare mRNAs against the genomic sequence. For more \ information about this option, go to the \ \ Codon and Base Coloring for Alignment Tracks page.\ Several types of alignment gap may also be colored; \ for more information, go to the \ \ Alignment Insertion/Deletion Display Options page.\
\ \\ The mRNAs were aligned against the human genome using translated blat. \ When a single mRNA aligned in multiple places, the alignment having the \ highest base identity was found. Only those alignments having a base \ identity level within 1% of the best and at least 25% base identity with the \ genomic sequence were kept.
\ \\ The mRNA track was produced at UCSC from mRNA sequence data\ submitted to the international public sequence databases by \ scientists worldwide.
\ \\ Benson DA, Karsch-Mizrachi I, Lipman DJ, Ostell J, \ Wheeler DL. \ GenBank: update. Nucleic Acids Res.\ 2004 Jan 1;32(Database issue):D23-6.
\\ Kent WJ.\ BLAT - the BLAST-like alignment tool.\ Genome Res. 2002 Apr;12(4):656-64.
\ rna 1 baseColorUseCds genbank\ baseColorUseSequence genbank\ group rna\ indelDoubleInsert on\ indelQueryInsert on\ longLabel Non-$Organism mRNAs from GenBank\ priority 63\ shortLabel Other mRNAs\ showDiffBasesAllScales .\ spectrum on\ track xenoMrna\ type psl xeno\ visibility hide\ encodeUncFaire UNC FAIRE bedGraph 4 UNC FAIRE (Formaldehyde Assisted Isolation of Regulatory Elements) 0 63 20 150 20 50 100 50 0 0 21 chr1,chr4,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr5,chr6,chr7,chr8,chr9,chr10,chrX,\ Formaldehyde-Assisted Isolation of Regulatory Elements (FAIRE) is a procedure \ used to isolate chromatin that is resistant to the formation of protein-DNA \ cross-links. These tracks display FAIRE data from 2091 fibroblast cells \ hybridized to high-resolution NimbleGen arrays that tile the ENCODE regions. The four \ datasets, in practical terms, can be thought of as independent replicates. \ However, because they were part of a series of experiments aimed at optimizing \ cross-linking conditions in human cells, the data represent different \ cross-linking times (1, 2, 4, and 7 minutes). Although the individual \ replicates are not displayed, the replicate data and also the signal averages \ and the peaks for the averages can be \ downloaded.
\ \\ The FAIRE data are represented by three subtracks. One subtrack shows the \ average normalized log2 ratios for the tiled probes; the other two \ subtracks \ display peaks. The peaks in one set were determined using PeakFinder \ software supplied by NimbleGen. A false positive rate (FPR) was estimated for \ the peak \ set using a permutation-based method. All peaks had an FPR of < 0.01. The\ peaks in the other set (Apr. 2006 update) were identified by ChIPOTle, a \ peak-finding algorithm that uses a sliding window to identify statistically\ significant signals that comprise a peak. A null distribution was determined \ by reflecting the negative data, which is presumed to be noise, about zero and \ a Gaussian distribution was fitted to it. Windows were considered \ significant with a p-value < 1e-25, after using the Benjamini-Hochberg \ correction for multiple tests.
\\ This annotation follows the display conventions for composite \ tracks. The subtracks within this annotation \ may be configured in a variety of ways to highlight different aspects of the \ displayed data. The graphical configuration options are shown at the top of \ the track description page, followed by a list of subtracks. To display only \ one subtrack, uncheck the box next to the track you wish to hide. \ For more information about the graphical configuration options, click the \ Graph\ configuration help link. Note that the graphical configuration options are \ available only for the Signal subtrack; the Peaks subtracks are fixed.
\ \\ To perform FAIRE, proteins were cross-linked to DNA using \ 1% formaldehyde solution, the complex was sheared using sonication, and a \ phenol/chloroform extraction was performed to remove DNA fragments \ crosslinked to protein. The DNA recovered in the aqueous phase was \ fluorescently-labeled and hybridized to a microarray along with \ fluorescently-labeled genomic DNA as a control. Ratios were scaled by \ subtracting the Tukey Bi-weight mean for the log-ratio values from each \ log-ratio value, as recomended by NimbleGen. Results in yeast were \ consistent with enrichment for nucleosome-depleted regions of the genome. \ Therefore, the method may have utility as a positive selection for genomic \ regions with properties normally detected by assays like DNAse \ hypersensitivity.
\ \\ The data were verified using PCR with primers designed to promoters enriched \ with FAIRE and downstream coding regions.
\ \\ Cell culture, fixing, and DNA amplification were performed by Jonghwan Kim in \ the Vishy Iyer \ lab at the University of Texas, Austin. FAIRE was done by Paul Giresi in \ the Jason Lieb lab at the University of North Carolina at \ Chapel Hill. Paul Giresi of NimbleGen did the sample labeling and hybridization \ with the help of Mike Singer and Roland Green. Nan Jiang at NimbleGen supplied \ the Software used for the permutation analysis.
\ \\ Buck, M.J., Nobel, A.B., and Lieb, J.D. \ ChIPOTle: a \ user-friendly tool for the analysis of ChIP-chip data. Genome Biol.\ 6(11), R97 (2005).
\\ Nagy, P.L., Cleary, M.L., Brown, P.O., and Lieb, J.L. \ Genomewide demarcation of RNA polymerase II transcription units \ revealed by physical fractionation of chromatin. \ PNAS 100(11), 6364-9 (2003).
\ encodeChrom 0 altColor 50,100,50\ autoScale off\ chromosomes chr1,chr4,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr5,chr6,chr7,chr8,chr9,chr10,chrX\ color 20,150,20\ compositeTrack on\ dataVersion ENCODE Oct 2005 Freeze\ group encodeChrom\ longLabel UNC FAIRE (Formaldehyde Assisted Isolation of Regulatory Elements)\ maxHeightPixels 128:24:16\ maxLimit 3.63\ minLimit -2.61\ origAssembly hg17\ priority 63.0\ shortLabel UNC FAIRE\ spanList 38\ track encodeUncFaire\ type bedGraph 4\ viewLimits -0.6:0.7\ windowingFunction mean\ encode_tba23EvoFold TBA23 EvoFold bed 6 + EvoFold Predictions of RNA Secondary Structure Using TBA23 0 63.1 20 90 0 137 172 127 0 0 0\ This track shows RNA secondary structure predictions made with the\ EvoFold program, a comparative method that exploits the evolutionary signal\ of genomic multiple-sequence alignments for identifying conserved\ functional RNA structures.
\ \\ Track elements are labeled using the convention ID_strand_score.\ At the zoomed-out level, secondary structure prediction regions are\ indicated by blocks, with the stem-pairing regions shown in a darker shade \ than unpaired regions. Arrows indicate the predicted strand.\ When zoomed in to the base level, the specific secondary structure predictions \ are shown in parenthesis format. The confidence score for each position is\ indicated in grayscale, with darker shades corresponding to higher scores.\
\ The details page for each track element shows the predicted secondary structure \ (labeled SS anno), together with details of the multiple species \ alignments at that location. Substitutions relative to the human sequence are \ color-coded according to their compatibility with the predicted secondary \ structure (see the color legend on the details page). Each prediction is \ assigned an overall score and a sequence of position-specific scores. The \ overall score measures evidence for any functional RNA structures in the given \ region, while the position-specific scores (0 - 9) measure the confidence of \ the base-specific annotations. Base-pairing positions are annotated \ with the same pair symbol. The offsets are provided to ease\ visual navigation of the alignment in terms of the human sequence. The offset\ is calculated (in units of ten) from the start position of the element on \ the positive strand or from the end position when on the negative strand.
\\ The graphical display may be filtered to show only those track elements \ with unnormalized scores that meet or exceed a certain threshhold. To set a \ threshhold, type the minimum score into the text box at the top of the \ description page.
\ \\ Evofold makes use of phylogenetic\ stochastic context-free grammars (phylo-SCFGs), which are combined\ probabilistic models of RNA secondary structure and primary sequence\ evolution. The predictions consist of both a specific RNA secondary\ structure and an overall score. The overall score is essentially a\ log-odd score phylo-SCFG modeling the constrained evolution of\ stem-pairing regions and one which only models unpaired regions.
\\ The predictions for this track were based on the conserved elements of\ the 23-way threaded blockset aligner (TBA) alignments present in the ENCODE\ regions (see the TBA Alignment track for more information).
\ \\ The EvoFold program and browser track were developed by \ Jakob Skou Pedersen of the UCSC\ Genome Bioinformatics Group.
\\ The 23-way TBA multiple alignments were created by Elliott Margulies\ of the Green\ Lab at NHGRI.
\ \\ Knudsen B, Hein J. \ RNA secondary structure prediction using stochastic context-free \ grammars and evolutionary history.\ Bioinformatics. 1999 Jun;15(6):446-54.
\\ Pedersen JS, Bejerano G, Siepel A, Rosenbloom K, Lindblad-Toh K, Lander ES, \ Kent J, Miller W, Haussler D. \ Identification and classification of conserved RNA secondary \ structures in the human genome. \ PLoS Comput Biol. 2006 Apr;2(4):e33.
\\ Pedersen JS, Meyer IM, Forsberg R, Simmonds P, Hein J. \ A comparative method for finding and folding RNA\ secondary structures within protein-coding regions. \ Nucl Acids Res. 2004 Sep 24;32(16):4925-36.
\\ Siepel A, Bejerano G, Pedersen JS, Hinrichs AS, Hou M, Rosenbloom K,\ Clawson H, Spieth J, Hillier LW, Richards S, et al.\ Evolutionarily conserved elements in vertebrate, insect, worm,\ and yeast genomes.\ Genome Res. 2005 Aug;15(8):1034-50.\ \ encodeGenes 1 color 20,90,0\ dataVersion ENCODE June 2005 Freeze\ group encodeGenes\ longLabel EvoFold Predictions of RNA Secondary Structure Using TBA23\ mafTrack encodeTbaAlign\ priority 63.1\ shortLabel TBA23 EvoFold\ track encode_tba23EvoFold\ type bed 6 +\ visibility hide\ wgRnaOld sno/miRNA Old bed 8 + C/D and H/ACA Box snoRNAs, scaRNAs, and microRNAs from snoRNABase and miRBase (Old Track) 0 63.5 200 80 0 227 167 127 0 0 0 http://www-snorna.biotoul.fr/plus.php?id=$$
\ This track displays positions of four different types of RNA in the human \ genome: \
\ C/D box and H/ACA box snoRNAs are guides for the 2'O-ribose methylation and \ the pseudouridilation, respectively, of rRNAs and snRNAs, although many of \ them have no documented target RNA. The scaRNAs guide modifications of the\ spliceosomal snRNAs transcribed by RNA polymerase II, and often contain both \ C/D and H/ACA domains.
\ \\ This track follows the general display conventions for \ gene prediction \ tracks.
\\ The miRNA precursor forms (pre-miRNA) are represented by red blocks.
\\ C/D box snoRNAs, H/ACA box snoRNAs and scaRNAs are represented by blue, green and \ magenta blocks, respectively. At a zoomed-in resolution, arrows superimposed \ on the blocks indicate the sense orientation of the snoRNAs.
\ \\ Mature and precursor miRNAs from the miRNA Registry were aligned against the \ genome using blat.\ The extents of the precursor sequences were not generally known and were\ predicted based on base-paired hairpin structure. The miRNA Registry is\ described in Griffiths-Jones, S. (2004) and Weber, M.J. (2005) in the \ References section below.
\\ The snoRNAs and scaRNAs from the snoRNABase were aligned against the \ human genome using blat. \
\ \\ The miRNA annotation was contributed by Michel Weber of \ Laboratoire de Biologie \ Moléculaire Eucaryote, CNRS Université Paul Sabatier (UMR5099, Toulouse, \ France) and Sam Griffiths-Jones of The Wellcome Trust Sanger Institute (Cambridge, UK).
\\ The snoRNA annotations were contributed by Michel Weber and \ Laurent Lestrade of the \ Institut d'Exploration \ Fonctionnelle des Génomes (IFR109, Toulouse, France).
\\ Fan Hsu from the UCSC Genome Bioinformatics \ Group created the combined annotation track.
\ \\ When making use of these data, please cite: \
\ Griffiths-Jones S. \ The microRNA Registry.\ Nucl. Acids Res. 2004 Jan 1;32(D):D109-11.
\\ Weber MJ. \ New human and mouse microRNA genes found by homology search.\ Febs J. 2005 Jan;272(1):59-73.
\\ You may also want to cite The Wellcome Trust Sanger Institute \ miRNA Registry and The Laboratoire de Biologie Moleculaire \ Eucaryote snoRNA \ database.
\\ The following publication provides guidelines on miRNA annotation:\ Ambros V. et al., \ A uniform system for microRNA annotation. \ RNA. 2003;9(3):277-9.
\\ For more information on blat, see \ Kent WJ.\ BLAT - the BLAST-like alignment tool.\ Genome Res. 2002;12(4):656-664.
\ genes 1 color 200,80,0\ group genes\ longLabel C/D and H/ACA Box snoRNAs, scaRNAs, and microRNAs from snoRNABase and miRBase (Old Track)\ priority 63.5\ shortLabel sno/miRNA Old\ track wgRnaOld\ type bed 8 +\ url http://www-snorna.biotoul.fr/plus.php?id=$$\ url2 http://www.mirbase.org/cgi-bin/query.pl?terms=$$\ url2Label miRBase:\ urlLabel Laboratoire de Biologie Moléculaire Eucaryote:\ visibility hide\ wgRna sno/miRNA bed 8 + C/D and H/ACA Box snoRNAs, scaRNAs, and microRNAs from snoRNABase and miRBase 0 63.6 200 80 0 227 167 127 0 0 0 http://www-snorna.biotoul.fr/plus.php?id=$$\ This track displays positions of four different types of RNA in the human \ genome: \
\ C/D box and H/ACA box snoRNAs are guides for the 2'O-ribose methylation and \ the pseudouridilation, respectively, of rRNAs and snRNAs, although many of \ them have no documented target RNA. The scaRNAs guide modifications of the\ spliceosomal snRNAs transcribed by RNA polymerase II, and often contain both \ C/D and H/ACA domains.
\ \\ This track follows the general display conventions for \ gene prediction \ tracks.
\\ The miRNA precursor forms (pre-miRNA) are represented by red blocks.
\\ C/D box snoRNAs, H/ACA box snoRNAs and scaRNAs are represented by blue, \ green and magenta blocks, respectively. At a zoomed-in resolution, arrows \ superimposed on the blocks indicate the sense orientation of the snoRNAs.
\ \\ Precursor miRNA genomic locations from\ \ miRBase\ were calculated using wublastn for sequence alignment with the requirement of\ 100% identity. \ The extents of the precursor sequences were not generally known and were\ predicted based on base-paired hairpin structure. miRBase is\ described in Griffiths-Jones, S. (2004) and Weber, M.J. (2005) in the \ References section below.
\\ The snoRNAs and scaRNAs from the snoRNABase were aligned against the \ human genome using blat. \
\ \\ \
\ When making use of these data, please cite the folowing articles in addition to\ the primary sources of the miRNA sequences:
\\ Griffiths-Jones S, Saini HK, van Dongen S, Enright AJ.\ miRBase: tools for microRNA genomics.\ Nucleic Acids Res. 2008 Jan 1;36(Database issue):D154-8.
\\ Griffiths-Jones S, Grocock RJ, van Dongen S, Bateman A, Enright AJ.\ miRBase: microRNA sequences, targets and gene nomenclature.\ Nucleic Acids Res. 2006 Jan 1;34(Database issue):D140-4.
\\ Griffiths-Jones S.\ The microRNA Registry.\ Nucleic Acids Res. 2004 Jan 1;32(Database issue):D109-11.
\\ Weber MJ.\ New human and mouse microRNA genes found by homology search.\
\ You may also want to cite The Wellcome Trust Sanger Institute \ miRBase and The Laboratoire de Biologie Moleculaire \ Eucaryote snoRNABase.
\\ The following publication provides guidelines on miRNA annotation:\ Ambros V. et al., \ A uniform system for microRNA annotation. \ RNA. 2003;9(3):277-9.
\\ genes 1 color 200,80,0\ dataVersion miRBase Release 13.0 (March 2009) and snoRNABase Version 3\ group genes\ longLabel C/D and H/ACA Box snoRNAs, scaRNAs, and microRNAs from snoRNABase and miRBase\ noScoreFilter .\ priority 63.6\ shortLabel sno/miRNA\ track wgRna\ type bed 8 +\ url http://www-snorna.biotoul.fr/plus.php?id=$$\ url2 http://www.mirbase.org/cgi-bin/query.pl?terms=$$\ url2Label miRBase:\ urlLabel Laboratoire de Biologie Moleculaire Eucaryote:\ visibility hide\ encodeAffyChIpHl60SitesRnapHr32 Affy Pol2 RA 32h bed 3 . Affymetrix ChIP/Chip (Pol2 8WG16 antibody, retinoic acid-treated HL-60, 32hrs) Sites 0 64 50 175 0 152 215 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 1 color 50,175,0\ longLabel Affymetrix ChIP/Chip (Pol2 8WG16 antibody, retinoic acid-treated HL-60, 32hrs) Sites\ parent encodeAffyChIpHl60Sites\ priority 64\ shortLabel Affy Pol2 RA 32h\ subGroups factor=Pol2 time=32h\ track encodeAffyChIpHl60SitesRnapHr32\ xenoBestMrna Other Best mRNAs psl xeno Non-Human mRNAs from GenBank Best in Genome Alignments 0 64 0 0 0 127 127 127 1 0 0
\ This track displays translated blat alignments of vertebrate and\ invertebrate mRNA in \ GenBank from organisms other than human. \ Better alignments are indicated by darker coloration in the display.
\ \\ The mRNAs were aligned against the human genome using translated blat. \ When a single mRNA aligned in multiple places, the alignment having the \ highest base identity was found. Only those alignments having a base \ identity level within 1% of the best and at least 25% base identity with the\ genomic sequence were kept.
\ \\ This track has a filter that can be used to change the display mode, \ change the color, and include/exclude a subset of items within the track.\ This may be helpful when many items are shown in the track display, \ especially when only some are relevant to the current task. \ The filter is located at the top of the track description page, which is \ accessed via the small button to the left of the track's graphical \ display or through the link on the track's control menu. \ To use the filter:\
\ When you have finished configuring the filter, click the Submit \ button.
\ \\ The mRNA track was produced at UCSC from mRNA sequence data\ submitted to the international public sequence databases by \ scientists worldwide.
\ \\ Benson DA, Karsch-Mizrachi I, Lipman DJ, Ostell J, \ Wheeler DL. \ GenBank: update. Nucleic Acids Res. \ 2004 Jan 1;32(Database issue):D23-6.
\\ Kent WJ.\ BLAT - the BLAST-like alignment tool.\ Genome Res. 2002 Apr;12(4):656-64.
\ rna 1 baseColorUseCds genbank\ baseColorUseSequence genbank\ group rna\ indelDoubleInsert on\ indelQueryInsert on\ longLabel Non-$Organism mRNAs from GenBank Best in Genome Alignments\ priority 64\ shortLabel Other Best mRNAs\ showDiffBasesAllScales .\ spectrum on\ track xenoBestMrna\ type psl xeno\ visibility hide\ encodeUvaDnaRepSuper UVa DNA Rep University of Virginia DNA Replication Timing and Origins 0 64 0 0 0 127 127 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX,\ This super-track combines related tracks of DNA\ replication data from the University of Virginia.\ DNA replication is carefully coordinated, both across the genome and with\ respect to development. Earlier replication in S-phase is broadly correlated\ with gene density and transcriptional activity.
\\ These tracks contain temporal profiling of DNA replication and\ origin of DNA replication in multiple cell lines, such as\ HeLa cells (cervix carcinoma). Replication timing was measured by\ analyzing Brd-U-labeled fractions from synchronized cells on tiling arrays.
\ \\ Data generation and analysis for this track were performed by the\ DNA replication group in the\ Dutta Lab\ at the University of Virginia: Neerja Karnani, Christopher Taylor,\ Hakkyun Kim, Louis Lim, Ankit Malhotra, Gabe Robins and Anindya Dutta.
\\ Neerja Karnani and Christopher Taylor prepared the data for presentation in\ the UCSC Genome Browser.
\ \\ Giacca M, Pelizon C, Falaschi A.\ Mapping replication origins by quantifying relative abundance\ of nascent DNA strands using competitive polymerase chain reaction.\ Methods. 1997 Nov;13(3):301-12.
\\ Mesner LD, Crawford EL, Hamlin JL.\ Isolating apparently pure libraries of replication origins\ from complex genomes. Mol Cell. 2006 Mar 3;21(5):719-26.
\\ Jeon Y, Bekiranov S, Karnani N, Kapranov P, Ghosh S, MacAlpine D, Lee C,\ Hwang DS, Gingeras TR, Dutta A.\ Temporal profile of replication of human chromosomes.\ Proc Natl Acad Sci U S A. 2005 May 3;102(18):6419-24.
\ encodeChrom 0 chromosomes chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX\ group encodeChrom\ longLabel University of Virginia DNA Replication Timing and Origins\ priority 64.0\ shortLabel UVa DNA Rep\ superTrack on\ track encodeUvaDnaRepSuper\ encodeUvaDnaRep UVa DNA Rep bed 3 . University of Virginia Temporal Profiling of DNA Replication 0 64 60 75 60 10 130 10 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX,\ The five subtracks in this annotation correspond to five different time points\ relative to the start of the DNA synthesis phase (S-phase) of the cell cycle. \
\ \\ Regions that are replicated during the given time interval are shown in green. \ Varying shades of green are used to distinguish one subtrack from another.\ To display only selected subtracks, uncheck the boxes next to the tracks you \ wish to hide.
\ \\ The experimental strategy adopted to map this profile involved isolation of \ replication products from HeLa cells synchronized at the G1-S boundary by \ thymidine-aphidicolin double block. Cells released from the block were\ labeled with BrdU at every two-hour interval of the 10 hours of S-phase and\ DNA was isolated from them. The heavy-light(H/L) DNA representing the pool of \ DNA replicated during each two-hour labeling period was separated from the\ unlabeled DNA by double cesium chloride density gradient centrifugation.\ The purified heavy-light DNA was then hybridized to a high-density \ genome-tiling Affymetrix array comprised of all unique probes within the \ ENCODE regions.
\\ The raw data generated by the microarray experiments was \ processed by computing the enrichment of signal in a particular \ part of the S-phase relative to the entirety of the S-phase (10 hours).\ High confidence regions (P-value = 1E-04) of \ replication were mapped by applying the Wilcoxon Rank Sum test in a sliding\ window of size 10 kb using the standard Affymetrix data analysis tools\ and the April 2003 (hg15) version of the human genome assembly. \ These coordinates were then mapped to the July 2003 (hg17) assembly by UCSC\ using the liftOver tool.
\ \\ The submitted data are from two biological experimental sets. Regions of\ significant enrichment were included from both of the biological replicates.\
\ \\ Data generation and analysis for this track were performed by the \ DNA replication group in the \ Dutta Lab\ at the University of Virginia: Neerja Karnani, Christopher Taylor, \ Hakkyun Kim, Louis Lim, Ankit Malhotra, Gabe Robins and Anindya Dutta.
\\ Neerja Karnani and Christopher Taylor prepared the data for presentation in \ the UCSC Genome Browser.
\ \\ Jeon, Y., Bekiranov, S., Karnani, N., Kapranov, P., Ghosh, S., MacAlpine, D., \ Lee, C., Hwang, D.S., Gingeras, T.R. and Dutta, A.\ Temporal profile of replication of human chromosomes.\ Proc Natl Acad Sci U S A 102(18), 6419-24 (2005).
\ encodeChrom 1 altColor 10,130,10\ chromosomes chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX\ color 60,75,60\ compositeTrack on\ dataVersion ENCODE June 2005 Freeze\ group encodeChrom\ longLabel University of Virginia Temporal Profiling of DNA Replication\ origAssembly hg16\ priority 64.0\ shortLabel UVa DNA Rep\ superTrack encodeUvaDnaRepSuper dense\ track encodeUvaDnaRep\ type bed 3 .\ visibility hide\ hgIkmc IKMC Genes Mapped bed 12 International Knockout Mouse Consortium Genes Mapped to Human Genome 0 64.1 0 0 0 127 127 127 0 0 0 http://www.knockoutmouse.org/search?criteria=$$\ This track shows genes targeted by \ International Knockout Mouse Consortium (IKMC) \ mapped to the human genome. IKMC is a \ collaboration to generate a public resource of mouse embryonic stem (ES)\ cells containing a null mutation in every gene in the mouse genome.\ Gene targets are color-coded by status:\
\ The KnockOut Mouse Project Data\ Coordination Center (KOMP DCC) is the central database resource\ for coordinating mouse gene targeting within IKMC and provides\ web-based query and display tools for IKMC data. In addition, the\ KOMP DCC website provides a tool for the scientific community to\ nominate genes of interest to be knocked out by the KOMP initiative.
\ \\ IKMC members include\
\ Using complementary targeting strategies, the IKMC centers \ design and create targeting vectors, mutant ES cell lines and, to some\ extent, mutant mice, embryos or sperm. Materials are distributed to\ the research community.
\\ The KOMP Repository \ archives, maintains, and distributes IKMC products. Researchers can\ order products and get product information from the\ Repository. Researchers can also express interest in products that are\ still in the pipeline. They will then receive email notification as\ soon as KOMP generated products are available for distribution.
\\ The process for ordering EUCOMM materials can be found \ here.
\\ The process for ordering TIGM materials can be found \ here.
\\ Information on NorCOMM products and services can be found \ here.\
\ Genes were mapped to the human genome by IKMC.\
\ \\ Thanks to the International Knockout Mouse Consortium, and Carol Bult in \ particular, for providing these data.
\ \\ Collins FS, Finnell RH, Rossant J, Wurst W.\ A new partner for the international knockout mouse consortium.\ Cell. 2007 Apr 20;129(2):235.
\ \\ International Mouse Knockout Consortium, Collins FS, Rossant J, Wurst W.\ A mouse for all reasons.\ Cell. 2007 Jan 12;128(1):9-13.
\ \\ Austin CP, Battey JF, Bradley A, Bucan M, Capecchi M, Collins FS, Dove\ WF, Duyk G, Dymecki S, Eppig JT et al.\ The knockout mouse project.\ Nat Genet. 2004 Sep;36(9):921-4.
\ \ genes 1 group genes\ itemRgb on\ longLabel International Knockout Mouse Consortium Genes Mapped to Human Genome\ mgiUrl http://www.informatics.jax.org/searches/accession_report.cgi?id=$$\ mgiUrlLabel MGI Report:\ noScoreFilter .\ priority 64.1\ shortLabel IKMC Genes Mapped\ track hgIkmc\ type bed 12\ url http://www.knockoutmouse.org/search?criteria=$$\ urlLabel KOMP Data Coordination Center:\ visibility hide\ encodeUvaDnaRepSeg UVa DNA Rep Seg bed 3 . University of Virginia DNA Replication Temporal Segmentation 0 64.1 0 0 0 127 127 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX,\ The four subtracks in this annotation correspond to replication\ timing categories for DNA synthesis. Replication is segregated\ into early specific (Early), mid specific (Mid), late specific (Late),\ and non-specific (PanS). The first three categories correspond\ to regions that replicated in a time point-specific manner; \ the latter category encompasses regions that replicated in a \ temporally non-specific manner.
\ \\ This annotation follows the display conventions for composite \ tracks. To display only \ selected subtracks, uncheck the boxes next to the tracks you wish to hide.
\ \\ The experimental strategy adopted to map this profile involved isolation of\ replication products from HeLa cells synchronized at the G1-S boundary by\ thymidine-aphidicolin double block. Cells released from the block were labeled\ with BrDu at every two-hour interval of S-phase and DNA was isolated from \ them. The heavy-light (H/L) DNA representing the pool of DNA replicated during \ each two-hour labeling period was separated from unlabeled DNA by double cesium \ chloride density gradient centrifugation. The purified H/L DNA was \ then hybridized to a high-density genome-tiling Affymetrix array comprised \ of all unique probes within the ENCODE regions.
\\ The time of replication of 50% (TR50) of each microarray probe was\ calculated by accumulating the sum over the five time points and\ linearly interpolating the time when 50% was reached. Each probe\ was also classified as temporally specific or non-specific based on\ whether or not at least 50% of the accumulated signal appeared in a single\ time point.
\\ The TR50 data was then analyzed within a 20 kb sliding window \ to classify regions as specific versus non-specific based\ on the ratio of specific to non-specific probes within the window.\ Specific regions were further classified as early, mid, or late\ replicating based on the average TR50 of specific probes within the\ window. The resulting regions form a non-overlapping segregation\ of the replication data into the four given categories of\ replication timing.
\ \\ The replication experiments were completed for two biological sets\ in the HeLa-adherent cell line.
\ \\ Data generation and analysis for this track were performed by the\ DNA replication group in the Dutta Lab at the University of \ Virginia: Neerja Karnani, Christopher Taylor, Hakkyun Kim,\ Louis Lim, Ankit Malhotra, Gabe Robins and Anindya Dutta.
\\ Neerja Karnani and Christopher Taylor prepared the data for \ presentation in the UCSC Genome Browser.
\ \\ Jeon, Y., Bekiranov, S., Karnani, N., Kapranov, P., Ghosh, S.,\ MacAlpine, D., Lee, C., Hwang, D.S., Gingeras, T.R. and Dutta, A.\ Temporal profile of replication of human chromosomes.\ Proc Natl Acad Sci U S A 102(18), 6419-24 (2005).
\ \ encodeChrom 1 chromosomes chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX\ compositeTrack on\ dataVersion ENCODE Oct 2005 Freeze\ group encodeChrom\ longLabel University of Virginia DNA Replication Temporal Segmentation\ origAssembly hg16\ priority 64.1\ shortLabel UVa DNA Rep Seg\ superTrack encodeUvaDnaRepSuper dense\ track encodeUvaDnaRepSeg\ type bed 3 .\ visibility hide\ encodeUvaDnaRepOrigins UVa DNA Rep Ori bed 3 . University of Virginia DNA Replication Origins 0 64.2 0 0 0 127 127 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX,\ The subtracks within this annotation show replication origins identified using \ the nascent strand method (Ori-NS), the bubble trapping method (Ori-Bubble) \ and the TR50 local minima method (Ori-TR50). \ Tracks are available for HeLa cells (cervix carcinoma) for all methods and \ GM06990 cells (lymphoblastoid) for Ori-NS.\ \
\ This annotation follows the display conventions for composite \ tracks. To show only selected subtracks within this annotation, \ uncheck the boxes next to the tracks you wish to hide.\
\ \\ ENCODE region-wide mapping of replication origins was performed. \ Origin-centered nascent-strands purified from HeLa and \ GM06990 cell lines were hybridized to Affymetrix \ ENCODE tiling arrays.\ \
\ Cells in their exponential stage of growth were labeled, in culture, with \ bromodeoxyuridine (BrdU) for 30 mins. DNA was then isolated from the cells. \ Nascent strands of 0.5-2.5 kb synthesized with incorporation of BrdU, \ representing the replication origins, were purified using a sucrose \ gradient followed by immunoprecipitation with BrdU antibody (Giacca \ et al., 1997). The purified nascent strands were amplified and \ then hybridized to Affymetrix ENCODE tiling arrays, which have 25-mer \ probes tiled every 22 bp, on average, in the non-repetitive sequence of the \ ENCODE regions. As an experimental control, genomic DNA was hybridized to \ arrays independently.
\\ Replication origins were identified by estimating the \ significance of the enrichment of nascent strands DNA (treatment) signal over \ genomic DNA (control) signal in a sliding window of 1000 bp. An estimate of \ significance in the window was calculated by computing the p-value using the \ Wilcoxon Rank-Sum test over all three biological replicates and control signal \ estimates in that window. The origins (Ori-NS) represented in the subtrack are \ the genomic regions that showed a signal enrichment pValue <= 0.001.\ \
\ The origin mapping experiments were completed for three biological sets.\ \
\ Data generation and analysis for the subtracks using the Ori-NS method were \ performed by the DNA replication group in the Dutta Lab at the University of \ Virginia: Neerja Karnani, Christopher Taylor, Ankit Malhotra, Gabe Robins \ and Anindya Dutta. \
\ Christopher Taylor and Neerja Karnani prepared the data for presentation in \ the UCSC Genome Browser. \ \\ Giacca M, Pelizon C, Falaschi A. Mapping replication origins by quantifying relative abundance \ of nascent DNA strands using competitive polymerase chain reaction. \ Methods. 1997;13(3):301-12.\
\ \\ ENCODE region-wide mapping of replication origins in HeLa \ cells was performed by the bubble trapping method. Replication origins were\ identified by hybridization to Affymetrix ENCODE tiling arrays. \ \
\ The bubble trapping method works on the principle that circular plasmids can be\ trapped in gelling agarose followed by the application of electrical current\ for a prolonged period of time (see Mesner et al. 2006 for more \ details). Entrapment occurs by an apparent physical linkage of the circular \ DNA with the agarose matrix. The circular bubble component of the DNA \ replication intermediates was therefore enriched by agarose trapping. After \ recovery from the agarose gel, a library of the entrapped DNA was formed by DNA cloning. Subsequently, DNA from the library was labeled and hybridized to \ Affymetrix ENCODE tiling arrays, which have 25-mer probes tiled every 22 bp \ on average in the non-repetitive ENCODE regions. As an experimental control, \ genomic DNA was hybridized to arrays independently.
\\ Replication origins were identified by estimating the significance of the \ enrichment of the bubble-trapped DNA (treatment) signal over genomic DNA \ (control) signal in a sliding window of 10,000 bp. An estimate of significance \ in the window was calculated by computing the p-value using the Wilcoxon \ Rank-Sum test over all three biological replicates and the control signal \ estimates in that window. The origins (Ori-Bubble) hence represented in the \ UCSC browser track are the genomic regions that showed a signal enrichment \ pValue <= 0.001.\
\ \\ The origin mapping experiments were completed for two biological sets. \
\ \\ Data generation and analysis for the subtrack using the Ori-bubble method \ were performed by the DNA replication group in the Dutta Lab and Hamlin Lab \ at the University of Virginia: Neerja Karnani, Larry Mesner, Christopher \ Taylor, Ankit Malhotra, Gabe Robins, Anindya Dutta and Joyce Hamlin.
\\ Neerja Karnani and Christopher Taylor prepared the data for presentation in the UCSC Genome Browser.
\ \\ Mesner LD, Crawford EL, Hamlin JL. Isolating apparently pure libraries of replication origins \ from complex genomes. Mol Cell. 2006 Mar 3;21(5):719-26.\
\ \\ ENCODE region-wide mapping of replication origins in HeLa \ cells was performed by the TR50 local minima method. Replication \ origins were identified by hybridization to Affymetrix ENCODE tiling arrays.\
\ \\ The experimental strategy adopted to map this profile involved isolation of \ replication products from HeLa cells synchronized at the G1-S boundary by \ thymidine-aphidicolin double block. Cells released from the block were labeled \ with BrdU at every two-hour interval of the 10 hours of S-phase. Subsequently,\ DNA was isolated from the cells. The heavy-light (H/L) DNA representing the \ pool of DNA replicated during each two-hour labeling period was separated from \ the unlabeled DNA by double cesium chloride density gradient centrifugation. \ The purified H/L DNA was then hybridized to a high-density genome-tiling\ Affymetrix array comprised of all unique probes within the ENCODE regions.
\\ The time of replication of 50% (TR50) of each microarray probe was \ calculated by accumulating the sum over the five time points and linearly \ interpolating the time when 50% was reached. Each probe was also classified \ as showing temporally specific replication (all alleles replicating \ together within a two-hour window) or temporally non-specific replication \ (at least one allele replicating apart from the others by at least a two \ hour difference).
\\ The TR50 data for the temporally specific probes was then \ smoothed within a 60 kb window using lowess smoothing. Local minima (within \ a 30 kb window) on the smoothed TR50 curve were identified which \ had at least 30 probes in the window on both sides of the minimum to locate \ possible origins of replication. A confidence value was calculated for each \ site as the average difference from the value of the local minimum of all \ TR50 values falling into the 30 kb window.
\ \\ The replication experiments were completed for two biological sets and a \ technical replicate in the HeLa adherent cell line.
\ \\ Data generation and analysis for the subtrack using the Ori-TR50 method \ were performed by the DNA replication group in the Dutta Lab at the \ University of Virginia: Neerja Karnani, Christopher Taylor, Hakkyun Kim, \ Louis Lim, Ankit Malhotra, Gabe Robins and Anindya Dutta.
\\ Neerja Karnani and Christopher Taylor prepared the data for presentation in the UCSC Genome Browser.
\ \\ Jeon Y, Bekiranov S, Karnani N, Kapranov P, Ghosh S, MacAlpine D, Lee C, \ Hwang DS, Gingeras TR, Dutta A.\ Temporal profile of replication of human chromosomes.\ Proc Natl Acad Sci U S A. 2005 May 3;102(18):6419-24.\
\ encodeChrom 1 chromosomes chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX\ compositeTrack on\ dataVersion ENCODE Oct 2005 Freeze, May 2007 data\ group encodeChrom\ longLabel University of Virginia DNA Replication Origins\ origAssembly hg17\ priority 64.2\ shortLabel UVa DNA Rep Ori\ superTrack encodeUvaDnaRepSuper dense\ track encodeUvaDnaRepOrigins\ type bed 3 .\ visibility hide\ encodeUvaDnaRepTr50 UVa DNA Rep TR50 wig 2.05 6.36 University of Virginia DNA Smoothed Timing at 50% Replication 0 64.3 0 0 0 127 127 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX,\ This annotation shows smoothed replication\ timing for DNA synthesis as the time of 50% replication (TR50).
\ \\ This annotation follows the display conventions for composite \ tracks. The subtracks within this annotation \ may be configured in a variety of ways to highlight different aspects of the \ displayed data. The graphical configuration options are shown at the top of \ the track description page, followed by a list of subtracks. To display only \ selected subtracks, uncheck the boxes next to the tracks you wish to hide. \ For more information about the graphical configuration options, click the \ Graph\ configuration help link.
\ \\ The experimental strategy adopted to map this profile involved \ isolation of replication products from HeLa cells synchronized \ at the G1-S boundary by thymidine-aphidicolin double block. Cells \ released from the block were labeled with BrdU at every two-hour \ interval of the 10 hours of S-phase and DNA was isolated from them. \ The heavy-light (H/L) DNA representing the pool of DNA replicated \ during each two-hour labeling period was separated from the unlabeled \ DNA by double cesium chloride density gradient centrifugation. The \ purified H/L DNA was then hybridized to a high-density\ genome-tiling Affymetrix array comprised of all unique probes within the\ ENCODE regions.
\\ The time of replication of 50% (TR50) of each microarray probe was\ calculated by accumulating the sum over the five time points and\ linearly interpolating the time when 50% was reached. Each probe\ was also classified as temporally specific or non-specific based on\ whether at least 50% of the accumulated signal appeared in a single\ time point or not.
\\ The TR50 data for all specific probes were then lowess-smoothed within\ a 60 kb window to provide the profile displayed in the annotation.
\ \\ The replication experiments were completed for two biological sets\ in the HeLa adherent cell line.
\ \\ Data generation and analysis for this track were performed by the\ DNA replication group in the Dutta Lab at the University of \ Virginia: Neerja Karnani, Christopher Taylor, Hakkyun Kim,\ Louis Lim, Ankit Malhotra, Gabe Robins and Anindya Dutta.
\\ Neerja Karnani and Christopher Taylor prepared the data for \ presentation in the UCSC Genome Browser.
\ \\ Jeon, Y., Bekiranov, S., Karnani, N., Kapranov, P., Ghosh, S.,\ MacAlpine, D., Lee, C., Hwang, D.S., Gingeras, T.R. and Dutta, A.\ Temporal profile of replication of human chromosomes.\ Proc Natl Acad Sci U S A 102(18), 6419-24 (2005).\ encodeChrom 0 autoScale Off\ chromosomes chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX\ dataVersion ENCODE Oct 2005 Freeze\ group encodeChrom\ longLabel University of Virginia DNA Smoothed Timing at 50% Replication\ maxHeightPixels 128:16:16\ origAssembly hg17\ priority 64.3\ shortLabel UVa DNA Rep TR50\ spanList 1\ superTrack encodeUvaDnaRepSuper dense\ track encodeUvaDnaRepTr50\ type wig 2.05 6.36\ viewLimits 2.2:5.2\ visibility hide\ windowingFunction mean\ encodeAffyChIpHl60PvalRaraHr00 Affy RARA RA 0h wig 0.0 534.54 Affymetrix ChIP/Chip (RARA retinoic acid-treated HL-60, 0hrs) P-Value 0 65 25 200 0 140 227 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 0 color 25,200,0\ longLabel Affymetrix ChIP/Chip (RARA retinoic acid-treated HL-60, 0hrs) P-Value\ parent encodeAffyChIpHl60Pval\ priority 65\ shortLabel Affy RARA RA 0h\ subGroups factor=RARA time=0h\ track encodeAffyChIpHl60PvalRaraHr00\ xenoEst Other ESTs psl xeno Non-Human ESTs from GenBank 0 65 0 0 0 127 127 127 1 0 0 http://www.ncbi.nlm.nih.gov/htbin-post/Entrez/query?form=4&db=n&term=$$
\ This track displays translated blat alignments of expressed sequence tags \ (ESTs) in GenBank from organisms other than human.\ ESTs are single-read sequences, typically about 500 bases in length, that \ usually represent fragments of transcribed genes.
\ \\ This track follows the display conventions for \ PSL alignment tracks. In dense display mode, the items that\ are more darkly shaded indicate matches of better quality.
\\ The strand information (+/-) for this track is in two parts. The\ first + or - indicates the orientation of the query sequence whose\ translated protein produced the match. The second + or - indicates the\ orientation of the matching translated genomic sequence. Because the two\ orientations of a DNA sequence give different predicted protein sequences,\ there are four combinations. ++ is not the same as --, nor is +- the same\ as -+.
\\ The description page for this track has a filter that can be used to change \ the display mode, alter the color, and include/exclude a subset of items \ within the track. This may be helpful when many items are shown in the track \ display, especially when only some are relevant to the current task.
\\ To use the filter:\
\ This track may also be configured to display base labeling, a feature that\ allows the user to display all bases in the aligning sequence or only those \ that differ from the genomic sequence. For more information about this option,\ click \ here.\ Several types of alignment gap may also be colored; \ for more information, click \ here.\
\ \\ To generate this track, the ESTs were aligned against the genome using \ blat. When a single EST aligned in multiple places, the \ alignment having the highest base identity was found. Only alignments \ having a base identity level within 0.5% of the best and at least 96% base \ identity with the genomic sequence were kept.
\ \\ This track was produced at UCSC from EST sequence data submitted to the \ international public sequence databases by scientists worldwide.
\ \\ Benson DA, Karsch-Mizrachi I, Lipman DJ, Ostell J, \ Wheeler DL. \ GenBank: update. Nucleic Acids Res. \ 2004 Jan 1;32(Database issue):D23-6.
\\ Kent WJ.\ BLAT - the BLAST-like alignment tool.\ Genome Res. 2002 Apr;12(4):656-64.
\ \ rna 1 baseColorUseSequence genbank\ group rna\ indelDoubleInsert on\ indelQueryInsert on\ longLabel Non-$Organism ESTs from GenBank\ priority 65\ shortLabel Other ESTs\ spectrum on\ track xenoEst\ type psl xeno\ url http://www.ncbi.nlm.nih.gov/htbin-post/Entrez/query?form=4&db=n&term=$$\ visibility hide\ encodeYaleChIPSTAT1Pval Yale ChIP pVal bedGraph 4 Yale ChIP/Chip (STAT1 ab, Hela cells, P-Values) 0 65 0 0 0 127 127 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX,\ This track shows probable sites of STAT1 binding in HeLa cells\ as determined by chromatin immunoprecipitation followed by\ microarray analysis. STAT1 (Signal Transducer and\ Activator of Transcription) is a transcription factor that\ moves to the nucleus and binds DNA only in response to a cytokine\ signal such as interferon-gamma. HeLa cells are a common cell line\ derived from a cervical cancer. Each of the four subtracks represents \ a different microarray platform. The track as a whole can be used to \ compare results across microarray platforms.
\\ The first three platforms are custom maskless \ photolithographic arrays with oligonucleotides tiling most of the \ non-repetitive DNA sequence of the ENCODE regions:\
\ The subtracks show the ratio of immunoprecipitated DNA from \ cytokine-stimulated cells vs. unstimulated cells in each of the four \ platforms. The ratio is calculated as -log10(p-value) \ in a 501-base window. The data shown is the combined result of multiple\ biological replicates: five for the first maskless array \ (50-mer every 38 bp), two for the second maskless array (36-mer \ every 36 bp), three for the third maskless array (50-mer every 50 \ bp) and six for the PCR Amplicon array. \
\\ These data are available at NCBI GEO as \ GSE2714, which also provides additional information about \ the experimental protocols.
\ \\ This annotation follows the display conventions for composite \ "wiggle" tracks. The subtracks within this annotation \ may be configured in a variety of ways to highlight different aspects of the \ displayed data. The graphical configuration options are shown at the top of \ the track description page, followed by a list of subtracks. To display only \ selected subtracks, uncheck the boxes next to the tracks you wish to hide. \ For more information about the graphical configuration options, click the \ Graph\ configuration help link.
\ \\ For all arrays, the STAT1 ChIP DNA was labeled with Cy5 and the control DNA \ was labeled with Cy3.
\ \\ The data from replicates were median-scaled and quantile-normalized \ to each other. After normalization, replicates were condensed to a\ single value. Using a \ 501 bp sliding window centered on each oligonucleotide probe, a \ signal map (estimating the fold enrichment [log2 \ scale] of ChIP DNA) is generated by computing the pseudomedian \ signal of all log2(Cy5/Cy3) ratios (median of \ pairwise averages) within the window (including replicates). \ Using the same procedure, a -log10(p-value) map \ (measuring significance of enrichment of oligonucleotide probes \ in the window) for all sliding windows can be made by computing \ P-values using the Wilcoxon paired signed rank test comparing \ fluorensent intensity between Cy5 and Cy3 for each \ oligonucleotide probe (Cy5 and Cy3 signals from the same array). \ A binding site is determined by thresholding both on fold \ enrichment and -log10(p-value) and requiring a \ maximum gap and a minimum run between oligonucleotide positions.
\\
For the first maskless array (50-mer every 38 bp):\
\
log2(Cy5/Cy3) >= 1.25, -log10(p-value) >=8.0, MaxGap <= 100 bp, MinRun >= 180 bp
\
For the second maskless array (36-mer every 36 bp): \
\
log2(Cy5/Cy3) >= 0.25, -log10(p-value) >=4.0, MaxGap <= 250 bp, MinRun >= 0 bp
\
For the third maskless array (50-mer every 50 bp): \
\
log2(Cy5/Cy3) >= 0.25, -log10(p-value) >=4.0, MaxGap <= 250 bp, MinRun >= 0 bp
\ The Cy5 and Cy3 array data were loess-normalized between channels \ on the same slide and then between slides. A z-score was then \ determined for each PCR amplicon from the distribution of \ log(Cy5/Cy3) in a local log(Cy5*Cy3) intensity window (see \ Quackenbush, 2002 and the \ Express \ Yourself website for more details). From the z-score, a P-value was then \ associated with each PCR amplicon. Hits were determined using a 3 sigma \ threshold and requiring a spot to be present on three out of six arrays.
\ \\ ChIP-chip binding sites were verified by comparing "hit lists" \ generated from combinations of different biological replicates. \ Only experiments that yielded a significant overlap (greater than \ 50 percent) were accepted. As an independent check (for maskless \ arrays), data on the microarray were randomized with respect to \ position and re-scored; significantly fewer hits (consistent \ with random noise) were generated this way.
\ \\ This data was generated and analyzed by the labs of Michael \ Snyder, Mark Gerstein and Sherman Weissman at Yale University. The PCR \ Amplicon arrays were manufactured by Bing Ren's lab at UCSD.
\ \\ Cawley, S., Bekiranov, S., Ng, H.H., Kapranov, P., Sekinger, E.A., Kampa, D., \ Piccolboni, A., Sementchenko, V., Cheng, J. et al.\ Unbiased mapping of transcription factor binding sites along \ human chromosomes 21 and 22 points to widespread regulation of noncoding \ RNAs. Cell 116(4), 499-509 (2004).
\\ Euskirchen, G., Royce, T.E., Bertone, P., Martone, R., Rinn, J.L., Nelson, \ F.K., Sayward, F., Luscombe, N.M., Miller, P. et al.\ CREB binds to multiple loci on human chromosome 22, \ Mol Cell Biol. 24(9), 3804-14 (2004).
\\ Luscombe, N.M., Royce, T.E., Bertone, P., Echols, N., Horak, C.E., Chang, \ J.T., Snyder, M. and Gerstein, M.\ ExpressYourself: A modular platform for processing and \ visualizing microarray data.\ Nucleic Acids Res. 31(13), 3477-82 (2003).
\\ Martone, R., Euskirchen, G., Bertone, P., Hartman, S., Royce, T.E., \ Luscombe, N.M., Rinn, J.L., Nelson, F.K., Miller, P. et al.\ Distribution of NF-kappaB-binding sites across human chromosome \ 22.\ Proc Natl Acad Sci U S A. 100(21), 12247-52 (2003).
\\ Quackenbush, J.. \ Microarray data normalization and transformation, \ Nat Genet. 32(Suppl), 496-501 (2002).
\ \ encodeChip 0 autoScale Off\ chromosomes chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX\ compositeTrack on\ dataVersion ENCODE June 2005 Freeze\ group encodeChip\ longLabel Yale ChIP/Chip (STAT1 ab, Hela cells, P-Values)\ maxHeightPixels 128:16:16\ maxLimit 18.2\ minLimit 0\ origAssembly hg16\ priority 65.0\ shortLabel Yale ChIP pVal\ track encodeYaleChIPSTAT1Pval\ type bedGraph 4\ viewLimits 0:10\ visibility hide\ windowingFunction mean\ encodeYaleChIPSTAT1Sig Yale ChIP Sig bedGraph 4 Yale ChIP/Chip: STAT1 ab, Hela cells, Signal 0 65.1 0 0 0 127 127 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX,\ Each of these four tracks shows the map of signal intensity \ (estimating the fold enrichment [log2 scale] of ChIP DNA vs \ unstimulated DNA) for STAT1 ChIP-chip using Human Hela S3 cells \ hybridized to four different array designs/platforms. The first \ three platforms are custom maskless photolithographic arrays \ with oligonucleotides tiling most of the non-repetitive DNA \ sequence of the ENCODE regions: \
\ Each track shows the combined results of multiple biological replicates: five \ for the first maskless array (50-mer every 38 bp), two for the \ second maskless array (36-mer every 36 bp), three for the third \ maskless array (50-mer every 50 bp) and six for the PCR Amplicon \ array. For all arrays, the STAT1 ChIP DNA was labeled with Cy5 and \ the control DNA was labeled with Cy3.
\\ These data are available at NCBI GEO as \ GSE2714, which also provides additional information about\ the experimental protocols.
\ \\ This annotation follows the display conventions for composite \ "wiggle" tracks. The subtracks within this annotation \ may be configured in a variety of ways to highlight different aspects of the \ displayed data. The graphical configuration options are shown at the top of \ the track description page, followed by a list of subtracks. To display only \ selected subtracks, uncheck the boxes next to the tracks you wish to hide. \ For more information about the graphical configuration options, click the \ Graph\ configuration help link.
\ \\ The data from replicates were median-scaled and quantile-normalized to each \ other (both Cy3 and Cy5 channels). Using a \ 501 bp sliding window centered on each oligonucleotide probe, a \ signal map (estimating the fold enrichment [log2 \ scale] of ChIP DNA) was generated by computing the pseudomedian \ signal of all log2(Cy5/Cy3) ratios (median of \ pairwise averages) within the window, including replicates. \ Using the same procedure, a -log10(P-value) map \ (measuring significance of enrichment of oligonucleotide probes \ in the window) for all sliding windows was made by computing \ P-values using the Wilcoxon paired signed rank test comparing \ fluorensent intensity between Cy5 and Cy3 for each \ oligonucleotide probe (Cy5 and Cy3 signals from the same array). \ A binding site was determined by thresholding both on fold \ enrichment and -log10(P-value) and requiring a \ maximum gap and a minimum run between oligonucleotide positions.
\\
For the first maskless array (50-mer every 38 bp):\
\
log2(Cy5/Cy3) >= 1.25, -log10(P-value) >= \
8.0, MaxGap <= 100 bp, MinRun >= 180 bp
\
For the second maskless array (36-mer every 36 bp): \
\
log2(Cy5/Cy3) >= 0.25, -log10(P-value) >= \
4.0, MaxGap <= 250 bp, MinRun >= 0 bp
\
For the third maskless array (50-mer every 50 bp): \
\
log2(Cy5/Cy3) >= 0.25, -log10(P-value) >= \
4.0, MaxGap <= 250 bp, MinRun >= 0 bp
\ The Cy5 and Cy3 array data were loess-normalized between channels \ on the same slide and then between slides. A z-score was then \ determined for each PCR amplicon from the distribution of \ log(Cy5/Cy3) in a local log(Cy5*Cy3) intensity window (see \ Quackenbush, 2002 and the \ Express \ Yourself website for more details). From the z-score, a P-value was then \ associated with each PCR amplicon. Hits were determined using a 3 sigma \ threshold and requiring a spot to be present on three out of six arrays.
\ \\ ChIP-chip binding sites were verified by comparing "hit lists" \ generated from combinations of different biological replicates. \ Only experiments that yielded a significant overlap (greater than \ 50 percent) were accepted. As an independent check (for maskless \ arrays), data on the microarray were randomized with respect to \ position and re-scored; significantly fewer hits (consistent \ with random noise) were generated this way.
\ \\ These data were generated and analyzed by the labs of Michael \ Snyder, Mark Gerstein and Sherman Weissman at Yale University. The PCR \ Amplicon arrays were manufactured by Bing Ren's lab at UCSD.
\ \\ Cawley, S., Bekiranov, S., Ng, H.H., Kapranov, P., Sekinger, E.A., Kampa, D., \ Piccolboni, A., Sementchenko, V., Cheng, J. et al.\ Unbiased mapping of transcription factor binding sites along \ human chromosomes 21 and 22 points to widespread regulation of noncoding \ RNAs. Cell 116(4), 499-509 (2004).
\\ Euskirchen, G., Royce, T.E., Bertone, P., Martone, R., Rinn, J.L., Nelson, \ F.K., Sayward, F., Luscombe, N.M., Miller, P. et al.\ CREB binds to multiple loci on human chromosome 22, \ Mol Cell Biol. 24(9), 3804-14 (2004).
\\ Luscombe, N.M., Royce, T.E., Bertone, P., Echols, N., Horak, C.E., Chang, \ J.T., Snyder, M. and Gerstein, M.\ ExpressYourself: A modular platform for processing and \ visualizing microarray data.\ Nucleic Acids Res. 31(13), 3477-82 (2003).
\\ Martone, R., Euskirchen, G., Bertone, P., Hartman, S., Royce, T.E., \ Luscombe, N.M., Rinn, J.L., Nelson, F.K., Miller, P. et al.\ Distribution of NF-kappaB-binding sites across human chromosome \ 22.\ Proc Natl Acad Sci U S A. 100(21), 12247-52 (2003).
\\ Quackenbush, J.. \ Microarray data normalization and transformation, \ Nat Genet. 32(Suppl), 496-501 (2002).
\ \ encodeChip 0 autoScale Off\ chromosomes chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX\ compositeTrack on\ dataVersion ENCODE June 2005 Freeze\ group encodeChip\ longLabel Yale ChIP/Chip: STAT1 ab, Hela cells, Signal\ maxHeightPixels 128:16:16\ maxLimit 3.69\ minLimit -4.19\ origAssembly hg16\ priority 65.1\ shortLabel Yale ChIP Sig\ track encodeYaleChIPSTAT1Sig\ type bedGraph 4\ viewLimits 0:2\ visibility hide\ windowingFunction mean\ encodeYaleChIPSTAT1Sites Yale ChIP Sites bed . Yale ChIP/Chip (STAT1 ab, Hela cells, Binding Sites) 0 65.2 0 0 0 127 127 127 0 0 18 chr1,chr10,chr11,chr13,chr14,chr15,chr16,chr19,chr2,chr20,chr21,chr22,chr5,chr6,chr7,chr8,chr9,chrX,\ Each of these four tracks shows the binding sites for STAT1 ChIP-chip \ using Human Hela S3 cells hybridized to four different array \ designs/platforms. The first \ three platforms are custom maskless photolithographic arrays \ with oligonucleotides tiling most of the non-repetitive DNA \ sequence of the ENCODE regions: \
\ Each track shows the combined results of multiple biological replicates: five \ for the first maskless array (50-mer every 38 bp), two for the \ second maskless array (36-mer every 36 bp), three for the third \ maskless array (50-mer every 50 bp) and six for the PCR Amplicon \ array. For all arrays, the STAT1 ChIP DNA was labeled with Cy5 and \ the control DNA was labeled with Cy3. See NCBI GEO \ GSE2714 for details of the experimental protocols.
\ \\ The data from replicates were median-scaled and quantile-normalized to each \ other (both Cy3 and Cy5 channels). Using a \ 501 bp sliding window centered on each oligonucleotide probe, a \ signal map (estimating the fold enrichment [log2 \ scale] of ChIP DNA) was generated by computing the pseudomedian \ signal of all log2(Cy5/Cy3) ratios (median of \ pairwise averages) within the window, including replicates. \ Using the same procedure, a -log10(P-value) map \ (measuring significance of enrichment of oligonucleotide probes \ in the window) for all sliding windows was made by computing \ P-values using the Wilcoxon paired signed rank test comparing \ fluorensent intensity between Cy5 and Cy3 for each \ oligonucleotide probe (Cy5 and Cy3 signals from the same array). \ A binding site was determined by thresholding both on fold \ enrichment and -log10(P-value) and requiring a \ maximum gap and a minimum run between oligonucleotide positions.
\\
For the first maskless array (50-mer every 38 bp):\
\
log2(Cy5/Cy3) >= 1.25, -log10(P-value) >= \
8.0, MaxGap <= 100 bp, MinRun >= 180 bp
\
For the second maskless array (36-mer every 36 bp): \
\
log2(Cy5/Cy3) >= 0.25, -log10(P-value) >= \
4.0, MaxGap <= 250 bp, MinRun >= 0 bp
\
For the third maskless array (50-mer every 50 bp): \
\
log2(Cy5/Cy3) >= 0.25, -log10(P-value) >= \
4.0, MaxGap <= 250 bp, MinRun >= 0 bp
\ The Cy5 and Cy3 array data were loess-normalized between channels \ on the same slide and then between slides. A z-score was then \ determined for each PCR amplicon from the distribution of \ log(Cy5/Cy3) in a local log(Cy5*Cy3) intensity window (see \ Quackenbush, 2002 and the \ Express \ Yourself website for more details). From the z-score, a P-value was then \ associated with each PCR amplicon. Hits were determined using a 3 sigma \ threshold and requiring a spot to be present on three out of six arrays.
\ \\ ChIP-chip binding sites were verified by comparing "hit lists" \ generated from combinations of different biological replicates. \ Only experiments that yielded a significant overlap (greater than \ 50 percent) were accepted. As an independent check (for maskless \ arrays), data on the microarray were randomized with respect to \ position and re-scored; significantly fewer hits (consistent \ with random noise) were generated this way.
\ \\ This data was generated and analyzed by the labs of Michael Snyder, \ Mark Gerstein and Sherman Weissman at Yale University. The PCR Amplicon arrays\ were manufactured by Bing Ren's lab at UCSD.\ \
\ Cawley, S., Bekiranov, S., Ng, H.H., Kapranov, P., Sekinger, E.A., Kampa, D., \ Piccolboni, A., Sementchenko, V., Cheng, J. et al.\ Unbiased mapping of transcription factor binding sites along \ human chromosomes 21 and 22 points to widespread regulation of noncoding \ RNAs. Cell 116(4), 499-509 (2004).
\\ Euskirchen, G., Royce, T.E., Bertone, P., Martone, R., Rinn, J.L., Nelson, \ F.K., Sayward, F., Luscombe, N.M., Miller, P. et al.\ CREB binds to multiple loci on human chromosome 22, \ Mol Cell Biol. 24(9), 3804-14 (2004).
\\ Luscombe, N.M., Royce, T.E., Bertone, P., Echols, N., Horak, C.E., Chang, \ J.T., Snyder, M. and Gerstein, M.\ ExpressYourself: A modular platform for processing and \ visualizing microarray data.\ Nucleic Acids Res. 31(13), 3477-82 (2003).
\\ Martone, R., Euskirchen, G., Bertone, P., Hartman, S., Royce, T.E., \ Luscombe, N.M., Rinn, J.L., Nelson, F.K., Miller, P. et al.\ Distribution of NF-kappaB-binding sites across human chromosome \ 22.\ Proc Natl Acad Sci U S A. 100(21), 12247-52 (2003).
\\ Quackenbush, J.. \ Microarray data normalization and transformation, \ Nat Genet. 32(Suppl), 496-501 (2002).
\ \ encodeChip 1 chromosomes chr1,chr10,chr11,chr13,chr14,chr15,chr16,chr19,chr2,chr20,chr21,chr22,chr5,chr6,chr7,chr8,chr9,chrX\ compositeTrack on\ dataVersion ENCODE June 2005 Freeze\ group encodeChip\ longLabel Yale ChIP/Chip (STAT1 ab, Hela cells, Binding Sites)\ origAssembly hg16\ priority 65.2\ shortLabel Yale ChIP Sites\ track encodeYaleChIPSTAT1Sites\ type bed .\ visibility hide\ encodeAffyChIpHl60SitesRaraHr00 Affy RARA RA 0h bed 3 . Affymetrix ChIP/Chip (RARA retinoic acid-treated HL-60, 0hrs) Sites 0 66 25 200 0 140 227 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 1 color 25,200,0\ longLabel Affymetrix ChIP/Chip (RARA retinoic acid-treated HL-60, 0hrs) Sites\ parent encodeAffyChIpHl60Sites\ priority 66\ shortLabel Affy RARA RA 0h\ subGroups factor=RARA time=0h\ track encodeAffyChIpHl60SitesRaraHr00\ anyCovBed mRNA/EST/Pseud bed 3 . Blastz Alignments of GenBank mRNA Including Pseudogenes and All ESTs 0 66 170 128 128 212 191 191 0 0 0 rna 1 color 170,128,128\ group rna\ longLabel Blastz Alignments of GenBank mRNA Including Pseudogenes and All ESTs\ priority 66\ shortLabel mRNA/EST/Pseud\ track anyCovBed\ type bed 3 .\ visibility hide\ encodeRegulomeBase UW/Reg DNaseI Sens wig 0.0 3.0 ENCODE UW/Regulome Mean DNaseI Sensitivity 0 66.1 0 0 0 127 127 127 0 0 10 chr2,chr5,chr7,chr8,chr9,chr11,chr12,chr16,chr18,chrX,\ This track shows the moving baseline of mean DNaseI sensitivity, \ computed over each PCR amplicon using a locally-weighted least squares \ (LOWESS)-based algorithm described in Dorschner et al (2004).\ The track is one of a set of tracks that annotate continuous DNaseI \ sensitivity measurements and DNaseI hypersensitive sites (HSs) over ENCODE \ regions. \ DNaseI has long been used to map general chromatin accessibility and the \ DNaseI "hyperaccessibility" or "hypersensitivity" that is a \ universal feature of active cis-regulatory sequences.\ The data were produced using quantitative chromatin profiling (QCP) \ (Dorschner et al.).
\\ See the UW/Reg Amplicon track for a list of the cell lines/phenotypes studied\ in these experiments.
\ \\ The displayed values are calculated as (copies in DNaseI-untreated / copies \ in DNaseI-treated). Thus, increasing values represent increasing \ sensitivity.
\\ The subtracks within this composite annotation track\ may be configured in a variety of ways to highlight different aspects of the \ displayed data. The graphical configuration options are shown at the top of \ the track description page, followed by a list of subtracks. To display only \ selected subtracks, uncheck the boxes next to the tracks you wish to hide. \ For more information about the graphical configuration options, click the \ Graph\ configuration help link.
\\ Color differences among the subtracks are arbitrary; they provide a\ visual cue for distinguishing the different cell lines/phenotypes.
\ \\ QCP was performed as described in Dorschner et al. See the UW/Reg\ Amplicon track description for more information.
\ \\ Data generation, analysis, and validation were performed jointly by groups at\ Regulome Corporation and the University of Washington (UW) in Seattle.
\\ Regulome Corp.: Michael O.\ Dorschner, Richard Humbert, Peter J. Sabo, Anthony Shafer, Jeff Goldy,\ Molly Weaver, Kristin Lee, Fidencio Neri, Brendan Henry, Mike Hawrylycz, Paul\ Tittel, Jim Wallace, Josh Mack, Janelle Kawamoto, John A. Stamatoyannopoulos.\
\\ UW Medical\ Genetics: Patrick Navas, Man Yu, Hua Cao, Brent Johnson, Ericka\ Johnson, George Stamatoyannopoulos.
\\ UW Genome Sciences: \ Scott Kuehn, Robert Thurman, William S. Noble.
\ \\ Dorschner, M.O., Hawrylycz, M., Humbert, R., Wallace, J.C., Shafer, A., \ Kawamoto, J., Mack, J., Hall, R., Goldy, J., Sabo, P.J. et al.\ High-throughput localization of functional elements by \ quantitative chromatin profiling.\ Nat Methods 1(3), 219-25 (2004).
\ \ encodeChrom 0 autoScale Off\ chromosomes chr2,chr5,chr7,chr8,chr9,chr11,chr12,chr16,chr18,chrX\ compositeTrack on\ dataVersion ENCODE June 2005 Freeze\ group encodeChrom\ longLabel ENCODE UW/Regulome Mean DNaseI Sensitivity\ maxHeightPixels 128:16:16\ origAssembly hg16\ priority 66.1\ shortLabel UW/Reg DNaseI Sens\ smoothingWindow 2\ spanList 250\ track encodeRegulomeBase\ type wig 0.0 3.0\ visibility hide\ encodeRegulomeProb UW/Reg DNaseI HSs bedGraph 4 ENCODE UW/Regulome DNaseI hypersensitive sites/scores 0 66.2 0 0 0 127 127 127 0 0 10 chr2,chr5,chr7,chr8,chr9,chr11,chr12,chr16,chr18,chrX,\ This track identifies amplicons overlying DNaseI hypersensitive sites (HSs)\ and provides an empirical P-value for each. \ The track is one of a set of tracks that annotate continuous DNaseI \ sensitivity measurements and DNaseI hypersensitive sites (HSs) over ENCODE \ regions. \ DNaseI has long been used to map \ general chromatin accessibility and the DNaseI "hyperaccessibility" \ or "hypersensitivity" that is a universal feature of active \ cis-regulatory sequences.\ The data were produced using quantitative chromatin profiling (QCP) \ (Dorschner et al., 2004).\
\ See the UW/Reg Amplicon track for a list of the cell lines/phenotypes studied\ in these experiments.
\ \\ Data values are represented \ on a vertical axis as a score between 0 and 3, corresponding to \ -log10(P-value) (i.e., a score of 3 indicates a P-value of \ less than 0.001).\ Note that these are empirically determined P-values, not binomial/Gaussian \ P-values. Also, the HSs are called only in the context of plates with an \ acceptable (though conservative) plate quality score (see the Regulome Quality \ track).
\\ The subtracks within this composite annotation track\ may be configured in a variety of ways to highlight different aspects of the \ displayed data. The graphical configuration options are shown at the top of \ the track description page, followed by a list of subtracks. To display only \ selected subtracks, uncheck the boxes next to the tracks you wish to hide. \ For more information about the graphical configuration options, click the \ Graph\ configuration help link.
\\ Color differences among the subtracks are arbitrary; they provide a\ visual cue for distinguishing the different cell lines/phenotypes.
\ \\ QCP was performed as described in Dorschner et al. See the UW/Reg\ Amplicon track description for more information.
\ \\ See the UW/Reg Amplicon track description for verification information.
\ \\ Data generation, analysis, and validation were performed jointly by groups at\ Regulome Corporation and the University of Washington (UW) in Seattle.
\\ Regulome Corp.: Michael O.\ Dorschner, Richard Humbert, Peter J. Sabo, Anthony Shafer, Jeff Goldy,\ Molly Weaver, Kristin Lee, Fidencio Neri, Brendan Henry, Mike Hawrylycz, Paul\ Tittel, Jim Wallace, Josh Mack, Janelle Kawamoto, John A. Stamatoyannopoulos.\
\\ UW Medical\ Genetics: Patrick Navas, Man Yu, Hua Cao, Brent Johnson, Ericka\ Johnson, George Stamatoyannopoulos.
\\ UW Genome Sciences: \ Scott Kuehn, Robert Thurman, William S. Noble.
\ \\ Dorschner, M.O., Hawrylycz, M., Humbert, R., Wallace, J.C., Shafer, A., \ Kawamoto, J., Mack, J., Hall, R., Goldy, J., Sabo, P.J. et al.\ High-throughput localization of functional elements by \ quantitative chromatin profiling.\ Nat Methods 1(3), 219-25 (2004).
\ \ encodeChrom 0 autoScale Off\ chromosomes chr2,chr5,chr7,chr8,chr9,chr11,chr12,chr16,chr18,chrX\ compositeTrack on\ dataVersion ENCODE June 2005 Freeze\ group encodeChrom\ longLabel ENCODE UW/Regulome DNaseI hypersensitive sites/scores\ maxHeightPixels 128:16:16\ maxLimit 3\ minLimit 0\ priority 66.2\ shortLabel UW/Reg DNaseI HSs\ track encodeRegulomeProb\ type bedGraph 4\ visibility hide\ encodeRegulomeQuality UW/Reg Plate Q/A bed 5 . ENCODE UW/Regulome Plate Quality Score 0 66.3 0 0 0 127 127 127 1 0 10 chr2,chr5,chr7,chr8,chr9,chr11,chr12,chr16,chr18,chrX,\ This track provides a visual representation of data quality scores, which range\ from 0 to 1, for each plate in the UW/Regulome experiments. \ It is one of a set of tracks that annotate continuous DNaseI \ sensitivity measurements and DNaseI hypersensitive sites (HSs) over ENCODE \ regions. \ DNaseI has long been used to map general chromatin accessibility and the \ DNaseI "hyperaccessibility" or "hypersensitivity" that is a \ universal feature of active cis-regulatory sequences.\ The data were produced using quantitative chromatin profiling (QCP) \ (Dorschner et al., 2004).
\\ Quality scores are available on the following cell lines/phenotypes and \ chromosomes:\
\\\
\\ Cell Line/Phenotype Chromosomes \ CACO2 5, 7, 9, 11, 12, 16, X \ GM06990 2, 5, 7, 8, 9, 11, 12, 16, 18, X \ SKNSH 5, 7, 9, 11, 12, 16, X \ Huh7 2, 8, 11, 18 \ HepG2 11 \ K562 11 \ Adult Erythroblast 11
\ See the UW/Reg Amplicon track for more information on the cell lines/phenotypes \ studied in these experiments.
\ \\ Plates with scores \ greater than or equal to 0.5 were conservatively considered acceptable for \ reliable scoring of HSs. Scores are shown in greyscale, with darker colors \ indicating higher scores.
\\ This composite annotation track consists of several subtracks that \ show the quality scores for each cell line/phenotype. \ To show only selected subtracks, uncheck the boxes next to the tracks \ you wish to hide. The display may also be filtered to show only those items \ with unnormalized scores that meet or exceed a certain threshhold. To set a \ threshhold, type the minimum score into the text box at the top of the \ description page.
\\ Color differences among the subtracks are arbitrary; they provide a\ visual cue for distinguishing the different cell lines/phenotypes.
\ \\ QCP was performed as described in Dorschner et al. See the UW/Reg\ Amplicon track description for more information.\ QCP assays were formatted into 384-well plates for high-throughput real-time \ PCR. Each plate was treated as a separate experiment. \
\ Plate quality scores were computed using a Support Vector Machine (SVM). \ Trained operators manually scored 500 plates, classifying each on a scale of \ 1 to 5 to rank the degree of experimental noise. The unified set was then used \ to train an SVM to classify and score "good" and "bad" \ plates. Good plates were conservatively assigned noise scores of 1 - 3;\ bad plates received scores of 4 - 5. By performing cross validation on a 90% \ subsample of the training set, the SVM achieved an ROC (receiver\ operating characteristic) score of 0.93.
\ \\ See the UW/Reg Amplicon track description for verification information.
\ \\ Data generation, analysis, and validation were performed jointly by groups at\ Regulome Corporation and the University of Washington (UW) in Seattle.
\\ Regulome Corp.: Michael O.\ Dorschner, Richard Humbert, Peter J. Sabo, Anthony Shafer, Jeff Goldy,\ Molly Weaver, Kristin Lee, Fidencio Neri, Brendan Henry, Mike Hawrylycz, Paul\ Tittel, Jim Wallace, Josh Mack, Janelle Kawamoto, John A. Stamatoyannopoulos.\
\\ UW Medical\ Genetics: Patrick Navas, Man Yu, Hua Cao, Brent Johnson, Ericka\ Johnson, George Stamatoyannopoulos.
\\ UW Genome Sciences: \ Scott Kuehn, Robert Thurman, William S. Noble.
\ \\ Dorschner, M.O., Hawrylycz, M., Humbert, R., Wallace, J.C., Shafer, A., \ Kawamoto, J., Mack, J., Hall, R., Goldy, J., Sabo, P.J. et al.\ High-throughput localization of functional elements by \ quantitative chromatin profiling.\ Nat Methods 1(3), 219-25 (2004).
\ \ encodeChrom 1 chromosomes chr2,chr5,chr7,chr8,chr9,chr11,chr12,chr16,chr18,chrX\ compositeTrack on\ dataVersion ENCODE June 2005 Freeze\ group encodeChrom\ longLabel ENCODE UW/Regulome Plate Quality Score\ maxHeightPixels 128:16:16\ priority 66.3\ shortLabel UW/Reg Plate Q/A\ track encodeRegulomeQuality\ type bed 5 .\ useScore 1\ encodeRegulomeAmplicon UW/Reg Amplicon bed 5 . ENCODE UW/Regulome Amplicon 0 66.4 0 0 0 127 127 127 1 0 10 chr2,chr5,chr7,chr8,chr9,chr11,chr12,chr16,chr18,chrX,\ This track shows a tiling path of PCR amplicons, along with their raw DNaseI\ sensitivity scores, across all ENCODE \ regions. It is one of a set of tracks that annotate continuous DNaseI \ sensitivity measurements and DNaseI hypersensitive sites (HSs) over the ENCODE \ regions. \ DNaseI has long been used to map general chromatin accessibility and the \ DNaseI "hyperaccessibility" or "hypersensitivity" that is a \ universal feature of active cis-regulatory sequences.\ The data were produced using quantitative chromatin profiling (QCP) \ (Dorschner et al., 2004).\
\\ DNaseI-treated and untreated chromatin samples from the following cell \ lines/phenotypes were studied: \
\ The display is separated into "odd" and "even" amplicons, \ to provide a visually distinct appearance among amplicons, so that\ adjacent amplicons are always in different subtracks. The details page \ for each amplicon reveals its \ start/stop coordinates and its raw DNaseI sensitivity score.\ The score is calculated by the formula (copies in DNaseI-treated / copies \ in DNaseI-untreated) * 1000.
\\ The graphical display may be filtered to show only those items \ with unnormalized scores that meet or exceed a certain threshhold. To set a \ threshhold, type the minimum score into the text box at the top of the \ description page.
\ \\ QCP was performed as described in Dorschner et al. \ PCR amplicons of ~250 bp in size were tiled end-to-end across the study regions.\ An amplicon tiling path has been computed over all regions and is available \ through UniSTS.
\\ Chromatin preparation and DNaseI treatment were\ performed on the cell types list above as described in \ Dorschner et al. High-throughput real-time PCR was used to quantify \ DNaseI at each amplicon by measuring copies remaining in DNaseI-treated \ vs. untreated samples. The results were then analyzed with a \ statistical algorithm to compute the moving baseline of mean DNaseI sensitivity \ and to identify outliers that correspond with DNaseI hypersensitive sites.
\ \\ QCP measurements were performed in replicate (6X) on pooled biological \ replicate samples. Validation of the results was carried out by conventional \ DNaseI hypersensitivity assays using end-labeling/Southern blotting. A total of \ 1.17 Mb have been evaluated by conventional assay.
\\ The specificity was defined as the number of true negative evaluable QCP \ amplicons divided by the sum of the true negatives plus false positives. Using \ 246.2 Kb from ENm002, the specificity was calculated to be 0.997. The \ sensitivity of the QCP assay was calculated as the true positives divided by \ the sum of the true positives plus false negatives. The sensitivity measured \ for ENm002 was 0.9487.
\ \\ Data generation, analysis, and validation were performed jointly by groups at \ Regulome Corporation and the University of Washington (UW) in Seattle.
\\ Regulome Corp.: Michael O.\ Dorschner, Richard Humbert, Peter J. Sabo, Anthony Shafer, Jeff Goldy, \ Molly Weaver, Kristin Lee, Fidencio Neri, Brendan Henry, Mike Hawrylycz, Paul \ Tittel, Jim Wallace, Josh Mack, Janelle Kawamoto, John A. Stamatoyannopoulos.\
\\ UW Medical \ Genetics: Patrick Navas, Man Yu, Hua Cao, Brent Johnson, Ericka \ Johnson, George Stamatoyannopoulos.
\\ UW Genome Sciences: \ Scott Kuehn, Robert Thurman, William S. Noble.
\ \\ Dorschner, M.O., Hawrylycz, M., Humbert, R., Wallace, J.C., Shafer, A., \ Kawamoto, J., Mack, J., Hall, R., Goldy, J., Sabo, P.J. et al.\ High-throughput localization of functional elements by \ quantitative chromatin profiling.\ Nat Methods 1(3), 219-25 (2004).
\ \ encodeChrom 1 chromosomes chr2,chr5,chr7,chr8,chr9,chr11,chr12,chr16,chr18,chrX\ compositeTrack on\ dataVersion ENCODE June 2005 Freeze\ group encodeChrom\ longLabel ENCODE UW/Regulome Amplicon\ origAssembly hg16\ priority 66.4\ shortLabel UW/Reg Amplicon\ track encodeRegulomeAmplicon\ type bed 5 .\ useScore 1\ visibility hide\ encodeAffyChIpHl60PvalRaraHr02 Affy RARA RA 2h wig 0.0 534.54 Affymetrix ChIP/Chip (RARA retinoic acid-treated HL-60, 2hrs) P-Value 0 67 25 200 0 140 227 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 0 color 25,200,0\ longLabel Affymetrix ChIP/Chip (RARA retinoic acid-treated HL-60, 2hrs) P-Value\ parent encodeAffyChIpHl60Pval\ priority 67\ shortLabel Affy RARA RA 2h\ subGroups factor=RARA time=2h\ track encodeAffyChIpHl60PvalRaraHr02\ anyMrnaCov mRNA/Pseud bed 3 . Blastz Alignments of GenBank mRNA Including Pseudogenes 0 67 170 128 128 212 191 191 0 0 0 rna 1 color 170,128,128\ group rna\ longLabel Blastz Alignments of GenBank mRNA Including Pseudogenes\ priority 67\ shortLabel mRNA/Pseud\ track anyMrnaCov\ type bed 3 .\ visibility hide\ HInvGeneMrna H-Inv psl . H-Invitational Genes mRNA Alignments 0 67.5 0 100 100 127 177 177 0 0 0\ This track shows alignments of full-length cDNAs that were used as the basis \ of the H-Invitational Gene Database (HInv-DB). \ The HInv-DB is a human gene database containing human-curated annotation of \ 41,118 full-length cDNA clones representing 21,037 cDNA clusters.\ The project was initiated in 2002 and the database became publicly\ available in April 2004.
\\ HInv-DB entries describe the following entities:\
\ To cluster redundant cDNAs and alternative splicing variants within the H-Inv \ cDNAs, a total of 41,118 H-Inv cDNAs were mapped to the human genome using \ the mapping pipeline developed by the Japan Biological Information Research\ Center (JBIRC). The mapping yielded 40,140 cDNAs that \ were aligned against the genome using the stringent criteria of at least 95% \ identity and 90% length coverage. These 40,140 cDNAs were clustered to 20,190 \ loci, resulting in an average of 2.0 cDNAs per locus. For the remaining 978 \ unmapped cDNAs, cDNA-based clustering was applied, yielding 847 clusters. \ In total, 21,037 clusters (20,190 mapped and 847 unmapped) were identified \ and integrated into H-InvDB. H-Inv cluster IDs (e.g. HIX0000001) were\ assigned to these clusters. A representative sequence was selected from each \ cluster and used for further analyses and annotation.
\\ A full description of the construction of the HInv-DB is contained in the \ report by the H-Inv Consortium (see References section).
\ \\ The H-InvDB is hosted at the JBIRC.\ The human-curated annotations were produced during invitational annotation\ meetings held in Japan during the summer of 2002, with a follow-up\ meeting in November 2004. Participants included 158 scientists \ representing 67 institutions from 12 countries.
\\ The full-length cDNA clones and sequences were produced by the\ Chinese National Human Genome Center (CHGC), \ the Deutsches Krebsforschungszentrum (DKFZ/MIPS), \ Helix Research Institute, Inc. (HRI), \ the Institute of Medical Science in the University of Tokyo (IMSUT), \ the Kazusa DNA Research Institute (KDRI), \ the Mammalian Gene Collection (MGC/NIH) and the\ Full-Length Long Japan (FLJ) project.
\ \\ Imanishi, T. et al. \ Integrative annotation of 21,037 human genes validated by full-length cDNA clones.\ PLoS Biol. 2:(6), e162 (2004).
\ rna 1 color 0,100,100\ group rna\ longLabel H-Invitational Genes mRNA Alignments\ priority 67.5\ shortLabel H-Inv\ track HInvGeneMrna\ type psl .\ visibility hide\ encodeAffyChIpHl60SitesRaraHr02 Affy RARA RA 2h bed 3 . Affymetrix ChIP/Chip (RARA retinoic acid-treated HL-60, 2hrs) Sites 0 68 25 200 0 140 227 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 1 color 25,200,0\ longLabel Affymetrix ChIP/Chip (RARA retinoic acid-treated HL-60, 2hrs) Sites\ parent encodeAffyChIpHl60Sites\ priority 68\ shortLabel Affy RARA RA 2h\ subGroups factor=RARA time=2h\ track encodeAffyChIpHl60SitesRaraHr02\ tigrGeneIndex TIGR Gene Index genePred Alignment of TIGR Gene Index TCs Against the Human Genome 0 68 100 0 0 177 127 127 0 0 0 http://www.tigr.org/tigr-scripts/tgi/tc_report.pl?$$This track displays alignments of the TIGR Gene Index (TGI)\ against the human genome. The TIGR Gene Index is based\ largely on assemblies of EST sequences in the public databases.\ See \ www.tigr.org for more information about TIGR and the Gene Index.
\ \Thanks to Foo Cheung and Razvan Sultana of the The Institute for Genomic Research, for converting these data into a track for the browser.
\ rna 1 autoTranslate 0\ color 100,0,0\ group rna\ longLabel Alignment of TIGR Gene Index TCs Against the $Organism Genome\ priority 68\ shortLabel TIGR Gene Index\ track tigrGeneIndex\ type genePred\ url http://www.tigr.org/tigr-scripts/tgi/tc_report.pl?$$\ visibility hide\ encodeAffyChIpHl60PvalRaraHr08 Affy RARA RA 8h wig 0.0 534.54 Affymetrix ChIP/Chip (RARA retinoic acid-treated HL-60, 8hrs) P-Value 0 69 25 200 0 140 227 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 0 color 25,200,0\ longLabel Affymetrix ChIP/Chip (RARA retinoic acid-treated HL-60, 8hrs) P-Value\ parent encodeAffyChIpHl60Pval\ priority 69\ shortLabel Affy RARA RA 8h\ subGroups factor=RARA time=8h\ track encodeAffyChIpHl60PvalRaraHr08\ uniGene_2 UniGene bed 12 . UniGene Hs 162 Alignments and SAGEmap Info 0 69 0 0 0 127 127 127 1 0 0\ Serial analysis of gene expression (SAGE)\ is a quantitative measurement of gene expression. Data are presented for every\ cluster contained in the browser window and the selected cluster name is \ highlighted in the table. All data are from the repository at the \ SageMap project\ built on UniGene version Hs 162. Click on a UniGene cluster name on the track\ details page to display SageMap's page for that cluster. Please note that data \ are not available for every cluster. There is no data available for clusters\ that lie entirely within the bounds of larger clusters.
\ \\ SAGE counts are produced by sequencing small "tags" of DNA believed \ to be associated with a gene. These tags were generated by attaching \ poly-A RNA to oligo-dT beads. After synthesis of double-stranded cDNA, \ transcripts were cleaved by an anchoring enzyme (usually NlaIII). Then, small \ tags were produced by ligation with a linker containing a type IIS restriction\ enzyme site and cleavage with the tagging enzyme (usually BsmFI). The \ tags were concatenated together and sequenced. The frequency of each \ tag was counted and used to infer expression level of transcripts that could\ be matched to that tag.
\ \\ All SAGE data presented here were mapped to UniGene transcripts by the \ SageMap project at NCBI.
\ \\ This track shows the UniGene genes from NCBI.\ Each UniGene entry is a set of transcript sequences that appear to come from the same transcription locus (gene or expressed pseudogene), together with information on protein similarities, gene expression, cDNA clone reagents, and genomic location. \
\\ Coding exons are represented by \ blocks connected by horizontal lines representing introns. \ In full display mode, arrowheads \ on the connecting intron lines indicate the direction of transcription.
\ \\ The UniGene sequence file, Hs.seq.uniq.gz, is downloaded from NCBI.\ Sequences are aligned to base genome using BLAT to create this track.
\\ When a single UniGene gene aligned in multiple places, \ the alignment having the highest base identity was found. \ Only alignments having a base identity level within 0.2% of the best and \ at least 96.5% base identity with the genomic sequence were kept. \
\\ Thanks to UniGene for \ providing this annotation. \
\ rna 1 group rna\ longLabel UniGene Alignments\ priority 69\ shortLabel UniGene\ spectrum on\ track uniGene_3\ type psl\ url http://www.ncbi.nlm.nih.gov/UniGene/clust.cgi?ORG=Hs&CID=\ visibility hide\ encodeAffyChIpHl60SitesRaraHr08 Affy RARA RA 8h bed 3 . Affymetrix ChIP/Chip (RARA retinoic acid-treated HL-60, 8hrs) Sites 0 70 25 200 0 140 227 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 1 color 25,200,0\ longLabel Affymetrix ChIP/Chip (RARA retinoic acid-treated HL-60, 8hrs) Sites\ parent encodeAffyChIpHl60Sites\ priority 70\ shortLabel Affy RARA RA 8h\ subGroups factor=RARA time=8h\ track encodeAffyChIpHl60SitesRaraHr08\ encodeAllElements Consens Elements bed 5 . NHGRI/PSU/UCSC/Stanford TBA and MLAGAN Consensus Conserved Elements 0 70 0 0 0 127 127 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX,\ These tracks represent conserved elements detected by any (union) or all\ (intersection) combinations of elements produced by binCons, phastCons, and \ GERP conservation scoring methods applied to TBA and MLAGAN sequence alignments \ of 23 vertebrates in the ENCODE regions. \
\ For more information on the individual subtracks, see the\ description pages for the TBA Elements and MLAGAN Elements tracks.
\ \\ The locations of conserved elements are indicated by blocks in the graphical \ display. The display may be filtered to show only those items \ with unnormalized scores that meet or exceed a certain threshhold. To set a \ threshhold, type the minimum score into the text box at the top of the \ description page. To show only selected subtracks within this annotation, \ uncheck the boxes next to the tracks you wish to hide.
\ \\ See the description pages for the TBA Elements and MLAGAN Elements \ for additional information about methods used to generate these data.
\ \\ See the description pages for the TBA Elements and MLAGAN Elements \ for information about verification techniques used to generate these data.
\ \\ BinCons and phastCons MCS data were contributed by Elliott Margulies in the \ Eric Green lab at \ NHGRI, with assistance from Adam Siepel of UCSC.
\\ GERP was developed primarily by Greg Cooper in the lab of \ Arend Sidow \ at Stanford University (Depts of Pathology and Genetics), in close collaboration\ with Eric Stone (Biostatistics, NC State), and George Asimenos and \ Eugene Davydov in the lab of \ Serafim Batzoglou \ (Dept. of Computer Science, Stanford).
\\ The intersection and union data shown in these subtracks were contributed by\ Elliott Margulies.
\ \\ See the TBA/MLAGAN Alignment and TBA/MLAGAN Cons tracks for \ references.
\ encodeCompGeno 1 chromosomes chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX\ compositeTrack on\ dataVersion ENCODE June 2005 Freeze\ exonArrows off\ group encodeCompGeno\ longLabel NHGRI/PSU/UCSC/Stanford TBA and MLAGAN Consensus Conserved Elements\ priority 70.0\ shortLabel Consens Elements\ track encodeAllElements\ type bed 5 .\ visibility hide\ uniGene UniGene psl . UniGene Alignments and SAGE Info 0 70 0 0 0 127 127 127 0 0 0 rna 1 group rna\ longLabel UniGene Alignments and SAGE Info\ priority 70\ shortLabel UniGene\ track uniGene\ type psl .\ visibility hide\ encodeTbaAlign TBA Alignment wigMaf 0.0 1.0 NHGRI/PSU TBA Alignments 0 70.1 0 10 100 1 128 0 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX,\ This track displays human-centric multiple sequence alignments in the ENCODE \ regions for the 23 vertebrates in the May 2005 ENCODE MSA freeze, based on \ comparative sequence data generated for the ENCODE project.\ The alignments in this track were generated using the\ Threaded Blockset Aligner (TBA).\ A complete list of the vertebrates included in the May 2005 freeze may be found\ at the top of the description page for this track.
\\ The Genome Browser companion tracks, TBA Cons and TBA Elements, display \ conservation scoring and conserved elements for these alignments based on \ various conservation methods.
\ \\ In full display mode, this track shows pairwise alignments\ of each species aligned to the human genome. \ The alignments are shown in dense display mode using a gray-scale\ density gradient. The checkboxes in the track configuration section allow\ the exclusion of species from the pairwise display.\
\ When zoomed-in to the base-display level, the track shows the base\ composition of each alignment. The numbers and symbols on the "human\ gap" line indicate the lengths of gaps in the human sequence at those\ alignment positions relative to the longest non-human sequence. \ If there is sufficient space in the display, the size of the gap is shown; \ if not, and if the gap size is a multiple of 3, a "*" is displayed, \ otherwise "+" is shown. \ To view detailed information about the\ alignments at a specific position, zoom in the display to 30,000 or fewer \ bases, then click on the alignment.
\ \\ The TBA was used to align sequences in the May 2005 ENCODE sequence data \ freeze. Multiple alignments were seeded from a series of combinatorial pairwise \ blastz alignments (not referenced to any one species). The specific \ combinations were determined by the\ species guide tree. Additionally, a \ blastz.specs file\ was used to fine-tune the blastz parameters, based on the evolutionary\ distance of the species being compared.\ The resulting multiple alignments were projected onto the human reference\ sequence.
\ \\ The TBA multiple alignments were created by Elliott Margulies of the \ Green Lab at NHGRI. \
\\ The programs Blastz and TBA, which were used to generate the alignments, were\ provided by Minmei Hou, Scott Schwartz and Webb Miller of the \ Penn State Bioinformatics \ Group.
\The phylogenetic tree is based on Murphy et al. (2001) and general\ consensus in the vertebrate phylogeny community.\
\ \\ Blanchette, M., Kent, W.J., Reimer, C., Elnitski, L., Smit, A.,\ Roskin, K., Baertsch, R., Rosenbloom, K.R., Clawson, H. et al.\ Aligning Multiple Genomic Sequences With the Threaded Blockset \ Aligner.\ Genome Res 14, 708-15 (2004).
\\ Chiaromonte, F., Yap, V.B., and Miller, W. \ Scoring pairwise genomic sequence alignments.\ Pac Symp Biocomput 2002, 115-26 (2002).
\\ Schwartz, S., Kent, W.J., Smit, A., Zhang, Z., Baertsch, R., Hardison, R., \ Haussler, D. and Miller, W.\ Human-Mouse Alignments with BLASTZ.\ Genome Res 13(1):103-7 (2003).
\Murphy, W.J., et al.\ Resolution of the early placental mammal radiation using Bayesian phylogenetics. Science 294(5550), 2348-51 (2001).
\ encodeCompGeno 1 altColor 1,128,0\ chromosomes chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX\ color 0, 10, 100\ dataVersion ENCODE June 2005 Freeze\ group encodeCompGeno\ longLabel NHGRI/PSU TBA Alignments\ priority 70.1\ sGroup_mammal monDom1 platypus\ sGroup_placental rn3 mm6 rabbit cow canFam1 rfbat hedgehog armadillo elephant tenrec\ sGroup_primate panTro1 baboon rheMac1 marmoset galago\ sGroup_vertebrate galGal2 xenTro1 danRer2 tetNig1 fr1\ shortLabel TBA Alignment\ speciesGroups primate placental mammal vertebrate\ summary encodeTbaSummary\ track encodeTbaAlign\ treeImage phylo/hg16_23way.gif\ type wigMaf 0.0 1.0\ visibility hide\ encodeAffyChIpHl60PvalRaraHr32 Affy RARA RA 32h wig 0.0 534.54 Affymetrix ChIP/Chip (RARA retinoic acid-treated HL-60, 32hrs) P-Value 0 71 25 200 0 140 227 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 0 color 25,200,0\ longLabel Affymetrix ChIP/Chip (RARA retinoic acid-treated HL-60, 32hrs) P-Value\ parent encodeAffyChIpHl60Pval\ priority 71\ shortLabel Affy RARA RA 32h\ subGroups factor=RARA time=32h\ track encodeAffyChIpHl60PvalRaraHr32\ rnaCluster Gene Bounds bed 12 . Gene Boundaries as Defined by RNA and Spliced EST Clusters 0 71 200 0 50 227 127 152 0 0 0\ This track shows the boundaries of genes and the direction of\ transcription as deduced from clustering spliced ESTs and mRNAs\ against the genome. When many spliced variants of the same gene exist, \ this track shows the variant that spans the greatest distance in the \ genome.
\ \\ ESTs and mRNAs from \ GenBank were aligned against the genome using BLAT.\ Alignments with less than 97.5% base identity within the aligning blocks \ were filtered out. When multiple alignments occurred, only those\ alignments with a percentage identity within 0.2% of the\ best alignment were kept. The following alignments were also discarded: \ ESTs that aligned without any introns, blocks smaller than 10 bases, and \ blocks smaller than 130 bases that were not located next to an intron. \ The orientations of the ESTs and mRNAs were deduced from the GT/AG splice \ sites at the introns; ESTs and mRNAs with overlapping blocks\ on the same strand were merged into clusters. Only the\ extent and orientation of the clusters are shown in this track.
\\ Scores for individual gene boundaries were assigned based on the number of \ cDNA alignments used:\
\ This track, which was originally developed by Jim Kent,\ was generated at UCSC and uses data submitted to GenBank by \ scientists worldwide.
\ \\ Benson DA, Karsch-Mizrachi I, Lipman DJ, Ostell J, Wheeler DL. \ GenBank: update. Nucleic Acids Res. \ 2004 Jan 1;32:D23-6.
\\ Kent WJ.\ BLAT - the BLAST-like alignment tool.\ Genome Res. 2002 Apr;12(4):656-64.
\ rna 1 color 200,0,50\ group rna\ longLabel Gene Boundaries as Defined by RNA and Spliced EST Clusters\ priority 71\ shortLabel Gene Bounds\ track rnaCluster\ type bed 12 .\ visibility hide\ encodeTbaCons TBA Cons wig 0.0 1.0 NHGRI/PSU/UCSC TBA Conservation 0 71 0 0 0 127 127 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX,\ This track displays different measurements of conservation based on \ the Threaded Blockset Aligner (TBA) multiple sequence alignments of \ ENCODE regions shown in the TBA Alignment track. Three programs — binCons\ (binomial-based conservation method), phastCons (phylogenetic hidden-Markov\ model method), and \ GERP (Genomic Evolutionary Rate Profiling)\ — generated the conservation scoring used to create this track. A related \ track, TBA Elements, shows multi-species conserved sequences (MCSs) based on\ the conservation measurements displayed in this track.
\\ For details on the conservation scores generated by each program, refer to the \ individual Methods subsections.
\ \\ The subtracks within this composite annotation track, which\ show data from the binCons, phastCons and GERP programs, may be configured in a \ variety of ways to highlight different aspects of the \ displayed data. The graphical configuration options \ are shown at the top of the track description page, followed by a list of \ the subtracks. A subtrack may be hidden from view by checking the box to the\ left of the track name in the list. For more information about the \ graphical configuration options, click the \ Graph\ configuration help link.
\\ Color differences among the subtracks are arbitrary; they provide a\ visual cue for distinguishing the different gene prediction methods. See the\ Methods section for display information specific to each subtrack.
\ \\ The methods used to create the TBA alignments in the ENCODE\ regions are described in the TBA Alignment track description.
\ \\ The binCons score is based on the cumulative binomial probability of \ detecting the observed number of identical bases (or greater) in \ sliding 25 bp windows (moving one bp at a time) between the \ reference sequence and each other species, given the neutral rate\ at four-fold degenerate sites. Neutral rates are calculated\ separately at each targeted region. For targets with no gene annotations,\ the average percent identity across all alignable sequence was instead used\ to weight the individual species binomial scores (this latter\ weighting scheme was found to closely match 4D weights).
\\ The negative log of these P-values was then averaged across all \ human-referenced pairwise combinations, and the highest scoring overlapping \ 25 bp window for each base was the resulting score. This track shows the \ plotting of a ranked percentile score normalized between 0 and 1 across all \ ENCODE regions, such that the top 5% most conserved sequence across all ENCODE\ regions have a score of 0.95 or greater (top 10% have a score of 0.9 or \ greater, and so on).
\\ BinCons scores were normalized to represent a percentile to the power of\ 10. For example, scores representing the top 1 percent most conserved\ sequence, 99th percentile, have a score greater than or equal to 0.99^10\ = 0.904. Transforming scores to the power of 10 was done for visual\ purposes only, in order to accentuate and distinguish the peaks of more\ highly conserved regions.
\\ More details on binCons can be found in Margulies et. al. (2003)\ cited below.
\ \\ The phastCons program predicts conserved elements and produces base-by-base\ conservation scores using a two-state phylogenetic hidden Markov model.\ The model consists of a state for conserved regions and a \ state for nonconserved regions, each of which is associated with a \ phylogenetic model. These two models are identical\ except that the branch lengths of the conserved phylogeny are \ multiplied by a scaling parameter rho (0 < rho < 1).
\\ For determining the conservation for the ENCODE TBA\ alignments, the nonconserved model was estimated \ from four-fold degenerate coding sites within the ENCODE regions using \ the program phyloFit. The parameter rho was then estimated by \ maximum likelihood, conditional on the nonconserved model, using the EM \ algorithm implemented in phastCons. Parameter estimation was based on \ a single large alignment, constructed by concatenating the \ alignments for all conserved regions.
\\ PhastCons was run with the options --expected-lengths 15 and\ --target-coverage 0.05 to obtain the desired level of \ "smoothing" and a final coverage by conserved elements of 5%.
\\ The conservation score at each base is the posterior probability that the\ base was generated by the conserved state of the phylo-HMM. It can\ be interpreted as the probability that the base is in a conserved\ element, given the assumptions of the model and the estimated parameters.\ Scores range from 0 to 1, with higher scores corresponding to\ higher levels of conservation.
\\ More details on phastCons can be found in Siepel et. al. (2005)\ cited below.
\ \\ The GERP score is the expected substitution rate divided by\ the observed substitution rate at a particular human base.\ Scores are estimated on a column-by-column basis using multiple sequence\ alignments of mammalian genomic DNA generated by MLAGAN.\ The scores range from 0 to 3; those greater than 3 are clipped to 3. \ The expected and observed rates are\ both calculated on a phylogenic tree using the same fixed topology.\ The branch lengths of the expected tree are based on the average\ substitutions at neutral sites. The branch lengths of the observed\ tree, which is calculated separately for each human base, are based on the\ substitutions seen at the column of the\ multiple alignment at that base. Species that have gaps at\ a particular column are not considered in the scoring for that column.
\\ Higher scores correspond to human\ bases in alignment columns with higher degrees of similarity, i.e.\ bases that have evolved slowly, some of which have been under purifying \ selection. The opposite holds true for swiftly evolving (low similarity) \ columns.
\\ Scores are deterministic, given a maximum-likelihood model of\ nucleotide substitution, species topology, neutral tree, and alignment.
\ \\ BinCons was developed by Elliott Margulies of the \ Eric Green lab at \ NHGRI.
\\ PhastCons was developed by Adam Siepel in the \ Haussler lab at UCSC.
\\ GERP was developed primarily by Greg Cooper in the lab of\ Arend Sidow\ at Stanford University\ (Depts of Pathology and Genetics), in close collaboration with\ Eric Stone (Biostatistics, NC State), and George Asimenos and\ Eugene Davydov in the lab of\ Serafim Batzoglou\ (Dept. of Computer Science, Stanford).
\\
\ TBA was provided by Minmei Hou, Scott Schwartz and Webb Miller of the \ Penn State Bioinformatics \ Group.
\\ The data for this track were generated by Elliott Margulies, \ with assistance from Adam Siepel.
\ \\ Blanchette, M., Kent, W.J., Reimer, C., Elnitski, L., Smit, A.,\ Roskin, K., Baertsch, R., Rosenbloom, K.R., Clawson, H. et al.\ Aligning Multiple Genomic Sequences With the Threaded Blockset \ Aligner.\ Genome Res 14, 708-15 (2004).
\\ Cooper, G.M., Stone, E.A., Asimenos, G., NISC Comparative Sequencing Program,\ Green, E.D., Batzoglou, S. and Sidow, A.\ Distribution and intensity of constraint in mammalian genomic \ sequence.\ Genome Res. 15, 901-13 (2005).
\\ Margulies, E.H., Blanchette, M., NISC Comparative Sequencing Program, \ Haussler, D. and Green, E.D. \ Identification and characterization of multi-species conserved \ sequences. \ Genome Res. 13, 2507-18 (2003).
\\ Siepel, A., Bejerano, G., Pedersen, J.S., Hinrichs, A., Hou, M.,\ Rosenbloom, K., Clawson, H., Spieth, J., Hillier, L.W. et al.\ Evolutionarily conserved elements in vertebrate,\ insect, worm, and yeast genomes. \ Genome Res. 15, 1034-1050 (2005).
\ encodeCompGeno 0 autoScale Off\ chromosomes chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX\ compositeTrack on\ dataVersion ENCODE June 2005 Freeze\ group encodeCompGeno\ longLabel NHGRI/PSU/UCSC TBA Conservation\ maxHeightPixels 100:25:11\ priority 71.0\ shortLabel TBA Cons\ track encodeTbaCons\ type wig 0.0 1.0\ visibility hide\ windowingFunction mean\ encodeAffyChIpHl60SitesRaraHr32 Affy RARA RA 32h bed 3 . Affymetrix ChIP/Chip (RARA retinoic acid-treated HL-60, 32hrs) Sites 0 72 25 200 0 140 227 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 1 color 25,200,0\ longLabel Affymetrix ChIP/Chip (RARA retinoic acid-treated HL-60, 32hrs) Sites\ parent encodeAffyChIpHl60Sites\ priority 72\ shortLabel Affy RARA RA 32h\ subGroups factor=RARA time=32h\ track encodeAffyChIpHl60SitesRaraHr32\ genieBounds Clone Bounds bed 9 . Clone Boundaries from EST Mate Pairs 0 72 178 34 34 216 144 144 0 0 0These clone bounds are based on EST mate pairs from \ Affymetrix's \ Genie gene finding software. \
\ rna 1 color 178,34,34\ group rna\ longLabel Clone Boundaries from EST Mate Pairs\ priority 72\ shortLabel Clone Bounds\ track genieBounds\ type bed 9 .\ visibility hide\ exonWalk ExonWalk genePred ExonWalk Alt-Splicing Transcripts 0 72 23 58 58 139 156 156 0 0 0The ExonWalk program merges cDNA evidence together to predict full\ length isoforms, including alternative transcripts. To predict\ transcripts that are biologically functional, rather than the result\ of technical or biological noise, ExonWalk requires that every intron\ and exon be either: 1) Present in cDNA libraries of another organism\ (i.e. also present in mouse), 2) Have three separate cDNA GenBank\ entries supporting it, or 3) Be evolving like a coding exon as\ determined by Exoniphy.\ Once the transcripts are predicted an ORF finder (BESTORF from\ Softberry) is used to find the\ best open reading frame. By default transcripts that are targets for\ nonsense mediated decay (NMD) are filtered out as they are less likely\ to be translated into proteins.\ \
The input to the ExonWalk program is the AltSplice track which has\ filtered out exons and introns that are not: 1) Present in cDNA\ libraries of another organism (i.e. also present in mouse), 2) Have\ three separate cDNA GenBank entries supporting it, or 3) Be evolving\ like a coding exon as determined by Exoniphy.\ \
The ExonWalk algorithm takes these filtered sequences and\ constructs a graph where the exons are the nodes and the introns are\ the edges. The goal of the program is to produce all full length\ transcripts implied by the transcripts. Full length transcripts are\ defined as transcripts that are not a subsequences of another\ transcript. The stages of the algorithm can be divided into three\ steps as illustrated in Figure 1 below:\ \
\
![]() |
\ Different stages of the ExonWalk Program. A. Different\ transcripts for a particular gene have been aligned to the genome to\ give an order and orientation. B. Exons in the overlapping\ section of compatible transcripts are joined to form new\ edges. C. Vertices which are redundant are pruned from the\ graph, being replaced by edges from other, equivalent, vertices. This\ simplifies the initial graph and yet retains splicing specific\ information. D. The maximal paths through the graph are\ explored to produce a set of maximal (full length) transcripts.\ |
Initially each each transcript is an independent sub-graph in the\ exon graph. Individual transcripts are then compared pairwise to\ determine if they are compatible. If they are compatible, an edge is\ created between exons of the overlap, called a compatibility edge.\ This results in a directed graph where overlapping exons are connected\ together, and thus compatible transcripts have been connected as well\ (Figure 1B). The algorithm then makes use of the\ implicit order provided by the genome sequence and the fact that\ splicing occurs in order to explore all of the paths present in the\ graph.\ \
Comments/Questions? Email sugnet@soe.ucsc.edu\ genes 1 color 23,58,58\ group genes\ longLabel ExonWalk Alt-Splicing Transcripts\ priority 72\ shortLabel ExonWalk\ track exonWalk\ type genePred\ visibility hide\ exonWalk2 ExonWalk2 genePred ExonWalk Alt-Splicing Transcripts - take 2 0 72.01 23 58 58 139 156 156 0 0 0 genes 1 color 23,58,58\ group genes\ longLabel ExonWalk Alt-Splicing Transcripts - take 2\ priority 72.01\ shortLabel ExonWalk2\ track exonWalk2\ type genePred\ visibility hide\ exonWalkRna ExonWalkRna genePred ExonWalk Alt-Splicing Transcripts mRNA only, no orthology 0 72.02 23 58 58 139 156 156 0 0 0 genes 1 color 23,58,58\ group genes\ longLabel ExonWalk Alt-Splicing Transcripts mRNA only, no orthology\ priority 72.02\ shortLabel ExonWalkRna\ track exonWalkRna\ type genePred\ visibility hide\ exonWalkRnaNoCds ExonWalkRnaNoCds bed 12 Exonwalk on Rna only, no orthology, no CDS mapping 0 72.03 23 58 58 139 156 156 0 0 0 genes 1 color 23,58,58\ group genes\ longLabel Exonwalk on Rna only, no orthology, no CDS mapping\ priority 72.03\ shortLabel ExonWalkRnaNoCds\ track exonWalkRnaNoCds\ type bed 12\ visibility hide\ agxMapped agxMapped bed 12 . Condensed version of AltGraphX Mapped from Mouse 0 72.1 153 26 42 204 140 148 0 0 0 rna 1 color 153,26,42,\ group rna\ longLabel Condensed version of AltGraphX Mapped from Mouse\ priority 72.1\ shortLabel agxMapped\ track agxMapped\ type bed 12 .\ visibility hide\ orthoIntrons orthoIntrons bed 12 . Bed version of AltGraphX Mapped Inrons from Mouse 0 72.2 107 74 34 181 164 144 0 0 0 rna 1 color 107,74,34,\ group rna\ longLabel Bed version of AltGraphX Mapped Inrons from Mouse\ priority 72.2\ shortLabel orthoIntrons\ track orthoIntrons\ type bed 12 .\ visibility hide\ encodeAffyChIpHl60PvalSirt1Hr00 Affy SIRT1 RA 0h wig 0.0 534.54 Affymetrix ChIP/Chip (SIRT1 retinoic acid-treated HL-60, 0hrs) P-Value 0 73 0 225 0 127 240 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 0 color 0,225,0\ longLabel Affymetrix ChIP/Chip (SIRT1 retinoic acid-treated HL-60, 0hrs) P-Value\ parent encodeAffyChIpHl60Pval\ priority 73\ shortLabel Affy SIRT1 RA 0h\ subGroups factor=SIRT1 time=0h\ track encodeAffyChIpHl60PvalSirt1Hr00\ altGraph AltGraph psl . AltGraph 0 73 0 0 0 127 127 127 0 0 0 rna 1 group rna\ longLabel AltGraph\ priority 73\ shortLabel AltGraph\ track altGraph\ type psl .\ visibility hide\ encodeTbaElements TBA Elements bed 5 . NHGRI/PSU/UCSC TBA Conserved Elements 0 73 0 0 0 127 127 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX,
\ This track displays multi-species conserved sequences (MCSs)\ derived from binCons, phastCons, and genomic evolutionary rate profiling \ (GERP) conservation scoring\ of Threaded Blockset Aligner (TBA) multiple sequence alignments in the \ ENCODE regions. The combined-methods subtracks show the union/intersection\ of conserved elements produced by the three conservation methods.\
\ The multiple sequence alignments may be viewed in the TBA Alignment\ track. Another related track, TBA Cons, shows the conservation scoring. \ The descriptions accompanying these tracks detail\ the methods used to create the alignments and conservation.
\ \\ The locations of conserved elements are indicated by blocks in the graphical\ display. This composite annotation track consists of several subtracks that\ show conserved elements derived by the three methods listed above, as well as \ both unions and intersections of the sets of conserved and non-coding conserved \ elements. To show only selected subtracks, uncheck the boxes next to the tracks\ you wish to hide.
\The display may also be filtered to show only those items\ with unnormalized scores that meet or exceed a certain threshhold. To set a\ threshhold, type the minimum score into the text box at the top of the \ description page.
\\ Display characteristics specific to certain subtracks are described in the\ respective Methods sections below.
\ \\
\ For each ENCODE target, a conservation score threshold was picked to match\ the number of conserved bases predicted by phastCons, an alternative method\ for measuring conservation. This latter method has been found slightly more\ reliable for predicting the expected fraction of conserved sequence \ in each target. Clusters of bases\ that exceeded the given conservation score threshold were designated \ as MCSs. The minimum length of an MCS is 25\ bases. Strict cutoffs were used: if even one base fell below the\ conservation score threshold, it separated an MCS into two distinct\ regions.
\ \\ The predicted MCSs are segments of the alignment that are likely to\ have been "generated" by the conserved state of the phylo-HMM,\ i.e., maximal segments in which the maximum-likelihood (Viterbi)\ path remains in the conserved state.
\ \\ GERP elements are scored according to the inferred intensity\ of purifying selection\ and are measured as "rejected substitutions" (RSs). RSs capture the\ magnitude of difference between the number of "observed" substitutions\ (estimated using maximum likelihood) and the number that would be\ "expected" under a neutral model of evolution. \ The RS is displayed as part of the item name.\ Items with higher RSs are displayed in a darker shade of blue. The score shown \ on the details page, which has been scaled by 300 for display purposes, is \ generally not as accurate as the RS count that is part of the item name.
\\ "Constrained elements" are identified as those groups\ of consecutive human bases that have an observed rate of evolution that is\ smaller than the expected rate. These groups of columns are merged if they\ are less than a few nucleotides apart and are scored according to the sum of\ the site-by-site difference between observed and expected rates (RS).
\\ Permutations of the actual alignments were analyzed, and the "constrained\ elements" identified in these permuted alignments were treated as\ "false positives". Subsequently, an RS threshold was picked such\ that the total length of "false positive" constrained elements\ (identified in the permuted alignments) was less than 5% of the length of\ constrained elements identified in the actual alignment.\ Thus, all annotated constrained elements are significant at better\ than 95% confidence, and the total fraction of the ENCODE regions\ annotated as constrained is 5-7%.
\ \\ These subtracks were produced by creating unions and intersections of the\ constrained element data detected by binCons, phastCons, and GERP on TBA \ alignments. In these annotations, "non-coding" is defined as those \ regions not overlapping with CDS regions in any of the following UCSC gene \ tables: refFlat, knownGene, mgcGenes, vegaGene, or ensGene.
\ \\ BinCons and phastCons MCS data were contributed by Elliott Margulies in the \ Eric Green lab at \ NHGRI, with assistance from Adam Siepel of UCSC.
\\ GERP was developed primarily by Greg Cooper in the lab of\ Arend Sidow\ at Stanford University (Depts of Pathology and Genetics), in close collaboration\ with Eric Stone (Biostatistics, NC State), and George Asimenos and\ Eugene Davydov in the lab of\ Serafim Batzoglou\ (Dept. of Computer Science, Stanford).
\\ TBA was provided by Minmei Hou, Scott Schwartz and Webb Miller of the \ Penn State Bioinformatics \ Group.
\ \\ See the TBA Alignment and TBA Cons tracks for references.
\ encodeCompGeno 1 chromosomes chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX\ compositeTrack on\ dataVersion ENCODE June 2005 Freeze\ exonArrows off\ group encodeCompGeno\ longLabel NHGRI/PSU/UCSC TBA Conserved Elements\ priority 73.0\ shortLabel TBA Elements\ track encodeTbaElements\ type bed 5 .\ visibility hide\ encodeAffyChIpHl60SitesSirt1Hr00 Affy SIRT1 RA 0h bed 3 . Affymetrix ChIP/Chip (SIRT1 retinoic acid-treated HL-60, 0hrs) Sites 0 74 0 225 0 127 240 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 1 color 0,225,0\ longLabel Affymetrix ChIP/Chip (SIRT1 retinoic acid-treated HL-60, 0hrs) Sites\ parent encodeAffyChIpHl60Sites\ priority 74\ shortLabel Affy SIRT1 RA 0h\ subGroups factor=SIRT1 time=0h\ track encodeAffyChIpHl60SitesSirt1Hr00\ altGraphX Alt-Splicing altGraphX Alternative Splicing from ESTs/mRNAs 0 74 0 0 0 127 127 127 0 0 0\ This track summarizes alternative splicing shown in the mRNA and\ EST tracks. The blocks represent exons; lines indicate possible\ splice junctions. The graphical display is drawn such that no exons\ overlap, making alternative events easier to view when the track\ is in full display mode and the resolution is set to approximately gene-level.\
\\ To help reduce the noise present in the EST libraries, \ exons and splice junctions are filtered based on orthologous mouse\ transcripts and the frequency with which an exon or intron appears in human\ transcript libraries. Only those exons and splice junctions that have\ an orthologous exon or splice junction in the mouse\ transcriptome or are present three or more times in the human transcriptome are \ kept. Transcripts labeled as mRNA in GenBank are weighted more heavily, \ reflecting their typically higher quality. This process is similar\ to that presented in Sugnet, C.W. et al.,\ Transcriptome and genome conservation of alternative splicing \ events in humans and mice. \ Pacific Symposium on Biocomputing (PSB) 2004 Online Proceedings.
\ \\ The splicing graphs for each genome were generated separately\ from their native EST and mRNA transcripts using the following\ process: \
\ After the splicing graphs were constructed independently for both\ human and mouse, they were mapped to each other using the entire set of\ genome mouse net alignments (viewable on the browser as\ the Mouse Net track). Only those exons and splice junctions that were common \ to both or occurred three or more times in the human transcript were kept \ in the splicing graph. When counting the number of times an\ exon or splice junction was included in the human transcripts, those\ designated as mRNA were weighted more heavily than those designated as \ EST.
\ \\ For more information on the mouse net alignments, see \ Kent, W.J., Baertsch, R., Hinrichs, A., Miller, W., and Haussler, D.\ Evolution's cauldron: \ Duplication, deletion, and rearrangement in the mouse and human genomes. \ Proc Natl Acad Sci USA 100(20), 11484-11489 (2003).
\ \\ This annotation was generated by Chuck \ Sugnet of the UCSC Genome Bioinformatics Group.
\ rna 1 group rna\ longLabel Alternative Splicing from ESTs/mRNAs\ priority 74\ shortLabel Alt-Splicing\ track altGraphX\ type altGraphX\ visibility hide\ encodeMlaganAlign MLAGAN Alignment wigMaf 0.0 1.0 Stanford MLAGAN Alignments 0 74 0 10 100 1 128 0 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX,\ This track displays human-centric multiple sequence alignments in the \ ENCODE regions for the 23 vertebrates included in the May 2005 ENCODE MSA \ freeze, based on comparative sequence data generated for the ENCODE project. \ The alignments in this track were generated using the \ LAGAN Alignment Toolkit.\ A complete list of the vertebrates included in the May 2005 freeze may be found\ at the top of the description page for this track.
\\ The Genome Browser companion tracks, MLAGAN Cons and MLAGAN Elements, \ display conservation scoring and conserved elements for these alignments based \ on various conservation methods.
\ \\ In full display mode, this track shows pairwise alignments\ of each species aligned to the human genome.\ The alignments are shown in dense display mode using a gray-scale\ density gradient. The checkboxes in the track configuration section allow\ the exclusion of species from the pairwise display.\
\ When zoomed-in to the base-display level, the track shows the base\ composition of each alignment. The numbers and symbols on the "human\ gap" line indicate the lengths of gaps in the human sequence at those\ alignment positions relative to the longest non-human sequence. \ If there is sufficient space in the display, the size of the gap is shown; \ if not, and if the gap size is a multiple of 3, a "*" is displayed, \ otherwise "+" is shown. \ To view detailed information about the\ alignments at a specific position, zoom in the display to 30,000 or fewer\ bases, then click on the alignment.
\ \\ To create the alignments, the sequence of each non-human species was first \ "rearranged" to be orthologously collinear with respect to the human \ sequence. The rearrangements were generated using a suite of tools and \ algorithms based on Shuffle-LAGAN and SuperMap.\ For each pairing of human sequence with that of another species, Shuffle-LAGAN \ was used to find the best-scoring chain of local similarities according to \ a scoring scheme that penalized evolutionary rearrangements. SuperMap was then\ used to aggregate parts of the chain into a human-monotonic map of syntenic \ blocks. This mapping was used to undo the genomic rearrangements of the other \ sequence and convert it to a form that was directly alignable to the human \ sequence.\
\ A multiple global alignment was created for every region using \ MLAGAN. The alignments \ were then refined using \ MUSCLE, which \ processes small non-overlapping windows of an alignment and attempts to realign \ them in an iterative fashion, keeping the refined alignment\ if it has a better sum-of-pairs score than the original.
\ \\ The MLAGAN alignments were generated by George Asimenos from Stanford's \ ENCODE group.
\\ Shuffle-LAGAN, SuperMap and MLAGAN were written by Mike Brudno.
\\ MUSCLE was authored by Bob Edgar.
\The phylogenetic tree is based on Murphy et al. (2001) and general\ consensus in the vertebrate phylogeny community.\
\ \\ Brudno, M., Do, C., Cooper, G., Kim, M.F., Davydov, E., Green, E.D., Sidow, A.\ and Batzoglou, S. \ LAGAN and Multi-LAGAN: efficient tools for large-scale multiple \ alignment of genomic DNA.\ Genome Res. 13(4), 721-31 (2003).
\\ Brudno, M., Malde, S., Poliakov, A., Do, C., Courone, O., Dubchak, I. and \ Batzoglou, S.\ Glocal alignment: finding rearrangements during alignment.\ Bioinformatics 19(Suppl. 1), i54-i62 (2003).
\\ Edgar, R.C. \ MUSCLE: multiple sequence alignment with high\ accuracy and high throughput.\ Nucl. Acids Res. 32(5), 1792-97 (2004).
\\ Murphy, W.J., et al.\ Resolution of the early placental mammal radiation using Bayesian phylogenetics. Science 294(5550), 2348-51 (2001).
\ encodeCompGeno 1 altColor 1,128,0\ chromosomes chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX\ color 0, 10, 100\ dataVersion ENCODE June 2005 Freeze\ group encodeCompGeno\ longLabel Stanford MLAGAN Alignments\ maxHeightPixels 100:40:11\ priority 74.0\ sGroup_mammal opossum platypus\ sGroup_placental rat mouse rabbit cow dog rfbat hedgehog armadillo elephant tenrec\ sGroup_primate chimp baboon rhesus marmoset galago\ sGroup_vertebrate chicken x_tropicalis zebrafish tetraodon fugu\ shortLabel MLAGAN Alignment\ speciesGroups primate placental mammal vertebrate\ summary encodeMlaganSummary\ track encodeMlaganAlign\ treeImage phylo/hg16_23way.gif\ type wigMaf 0.0 1.0\ visibility hide\ altGraphX2 Alt-Splicing2 altGraphX Alternative Splicing from ESTs/mRNAs - test take 2 0 74.1 0 0 0 127 127 127 0 0 0 rna 1 group rna\ longLabel Alternative Splicing from ESTs/mRNAs - test take 2\ priority 74.1\ shortLabel Alt-Splicing2\ track altGraphX2\ type altGraphX\ visibility hide\ encodeMlaganCons MLAGAN Cons wig 0.0 1.0 Stanford MLAGAN Conservation 0 74.1 0 0 0 127 127 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX,\ This track displays different measurements of conservation based on\ the MLAGAN multiple sequence alignments of ENCODE regions \ shown in the MLAGAN Alignment track. Three programs — binCons\ (binomial-based conservation method), phastCons (phylogenetic hidden-Markov\ model method), and\ GERP (Genomic Evolutionary Rate Profiling)\ — generated the conservation scoring used to create this track. A related \ track, MLAGAN Elements, shows multi-species conserved sequences (MCSs) based on\ the conservation measurements displayed in this track.
\\ For details on the conservation scores generated by each program, refer to the \ individual Methods subsections.
\ \\ This annotation follows the display conventions for composite \ tracks. The subtracks within this annotation may be configured in a variety of \ ways to highlight different aspects of the displayed data. The graphical \ configuration options are shown at the top of the track description page, \ followed by a list of subtracks. To display only selected subtracks, uncheck \ the boxes next to the tracks you wish to hide. For more information about the \ graphical configuration options, click the \ Graph configuration \ help link.
\\ Color differences among the subtracks are arbitrary; they provide a\ visual cue for distinguishing the different gene prediction methods. See the\ Methods section for display information specific to each subtrack.
\ \\ The methods used to create the MLAGAN alignments in the ENCODE\ regions are described in the MLAGAN Alignment track description.
\ \\ The binCons score is based on the cumulative binomial probability of\ detecting the observed number of identical bases (or greater) in\ sliding 25 bp windows (moving one bp at a time) between the\ reference sequence and each other species, given the neutral rate\ at four-fold degenerate sites. Neutral rates are calculated\ separately at each targeted region. For targets with no gene annotations,\ the average percent identity across all alignable sequence was instead used\ to weight the individual species binomial scores (this latter\ weighting scheme was found to closely match 4D weights).
\\ The negative log of these P-values was then averaged across all\ human-referenced pairwise combinations, and the highest scoring overlapping\ 25 bp window for each base was the resulting score. This track shows the\ plotting of a ranked percentile score normalized between 0 and 1 across all\ ENCODE regions, such that the top 5% most conserved sequence across all ENCODE\ regions have a score of 0.95 or greater (top 10% have a score of 0.9 or\ greater, and so on).
\\ BinCons scores were normalized to represent a percentile to the power of\ 10. For example, scores representing the top 1 percent most conserved\ sequence, 99th percentile, have a score greater than or equal to 0.99^10\ = 0.904. Transforming scores to the power of 10 was done for visual\ purposes only, in order to accentuate and distinguish the peaks of more\ highly conserved regions.
\\ More details on binCons can be found in Margulies et. al. (2003)\ cited below.
\ \\ The phastCons program predicts conserved elements and produces base-by-base\ conservation scores using a two-state phylogenetic hidden Markov model.\ The model consists of a state for conserved regions and a\ state for nonconserved regions, each of which is associated with a\ phylogenetic model. These two models are identical\ except that the branch lengths of the conserved phylogeny are\ multiplied by a scaling parameter rho (0 < rho < 1).
\\ For determining the conservation for the ENCODE MLAGAN\ alignments, the nonconserved model was estimated\ from four-fold degenerate coding sites within the ENCODE regions using\ the program phyloFit. The parameter rho was then estimated by\ maximum likelihood, conditional on the nonconserved model, using the EM\ algorithm implemented in phastCons. Parameter estimation was based on\ a single large alignment, constructed by concatenating the\ alignments for all conserved regions.
\\ PhastCons was run with the options --expected-lengths 15 and\ --target-coverage 0.05 to obtain the desired level of\ "smoothing" and a final coverage by conserved elements of 5%.
\\ The conservation score at each base is the posterior probability that the\ base was generated by the conserved state of the phylo-HMM. It can\ be interpreted as the probability that the base is in a conserved\ element, given the assumptions of the model and the estimated parameters.\ Scores range from 0 to 1, with higher scores corresponding to\ higher levels of conservation.
\\ More details on phastCons can be found in Siepel et. al. (2005)\ cited below.
\ \\ The GERP score is the expected substitution rate divided by\ the observed substitution rate at a particular human base.\ Scores are estimated on a column-by-column basis using multiple sequence\ alignments of mammalian genomic DNA generated by MLAGAN.\ The scores range from 0 to 3; those greater than 3 are clipped to 3. \ The expected and observed rates are\ both calculated on a phylogenic tree using the same fixed topology.\ The branch lengths of the expected tree are based on the average\ substitutions at neutral sites. The branch lengths of the observed\ tree, which is calculated separately for each human base, are based on the\ substitutions seen at the column of the\ multiple alignment at that base. Species that have gaps at\ a particular column are not considered in the scoring for that column.
\\ Higher scores correspond to human\ bases in alignment columns with higher degrees of similarity, i.e.\ bases that have evolved slowly, some of which have been under purifying \ selection. The opposite holds true for swiftly evolving (low similarity) \ columns.
\\ Scores are deterministic, given a maximum-likelihood model of \ nucleotide substitution, species topology, neutral tree, and alignment.
\ \\ BinCons was developed by Elliott Margulies of the \ Eric Green lab at \ NHGRI.
\\ PhastCons was developed by Adam Siepel in the \ Haussler lab at UCSC.
\\ GERP was developed primarily by Greg Cooper in the lab of \ Arend Sidow \ at Stanford University\ (Depts of Pathology and Genetics), in close collaboration with \ Eric Stone (Biostatistics, NC State), and George Asimenos and \ Eugene Davydov in the lab of \ Serafim Batzoglou \ (Dept. of Computer Science, Stanford).
\\ The data for this track were generated by Elliott Margulies, with assistance \ from Adam Siepel.
\ \\ Cooper, G.M., Stone, E.A., Asimenos, G., NISC Comparative Sequencing Program, \ Green, E.D., Batzoglou, S. and Sidow, A. \ Distribution and intensity of constraint in mammalian genomic \ sequence. \ Genome Res.. 15(7), 901-13 (2005).
\\ Margulies, E.H., Blanchette, M., NISC Comparative Sequencing Program,\ Haussler, D. and Green, E.D.\ Identification and characterization of multi-species conserved\ sequences.\ Genome Res. 13, 2507-18 (2003).
\\ Siepel, A., Bejerano, G., Pedersen, J.S., Hinrichs, A., Hou, M.,\ Rosenbloom, K., Clawson, H., Spieth, J., Hillier, L.W. et al.\ Evolutionarily conserved elements in vertebrate,\ insect, worm, and yeast genomes.\ Genome Res. 15, 1034-50 (2005).
\ encodeCompGeno 0 autoScale Off\ chromosomes chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX\ compositeTrack on\ dataVersion ENCODE June 2005 Freeze\ group encodeCompGeno\ longLabel Stanford MLAGAN Conservation\ maxHeightPixels 100:25:11\ priority 74.1\ shortLabel MLAGAN Cons\ track encodeMlaganCons\ type wig 0.0 1.0\ visibility hide\ windowingFunction mean\ encodeMlaganElements MLAGAN Elements bed 5 . Stanford MLAGAN Conserved Elements 0 74.2 0 0 0 127 127 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX,\ This track displays multi-species conserved sequences (MCSs) derived from \ binCons, phastCons, and genomic evolutionary rate profiling \ (GERP) conservation \ scoring of human ENCODE genomic DNA alignments to 22 other vertebrates using the \ MLAGAN alignment package. \ The combined-methods subtracks show the union/intersection of conserved elements\ produced by the three conservation methods.\
\ The multiple sequence alignments may be viewed in\ the MLAGAN Alignments track. Another related track, MLAGAN Cons, shows \ the conservation scoring. The descriptions accompanying these tracks detail the \ methods used to create the alignments and conservation.
\ \\ The locations of conserved elements are indicated by blocks in the graphical \ display. This composite annotation track consists of several subtracks that \ show conserved elements derived by the three methods listed above, as well as \ both unions and intersections of the sets of coding and noncoding conserved \ elements. To show only selected subtracks, uncheck the boxes next to the tracks \ you wish to hide. The display may also be filtered to show only those items \ with unnormalized scores that meet or exceed a certain threshhold. To set a \ threshhold, type the minimum score into the text box at the top of the \ description page.
\\ Display characteristics specific to certain subtracks are described in the\ respective Methods sections below.
\ \\ For each ENCODE target, a conservation score threshold was picked to match\ the number of conserved bases predicted by phastCons, an alternative method\ for measuring conservation. This latter method has been found slightly more\ reliable for predicting the expected fraction of conserved sequence \ in each target. Clusters of bases\ that exceeded the given conservation score threshold were designated \ as MCSs. The minimum length of an MCS is 25\ bases. Strict cutoffs were used: if even one base fell below the\ conservation score threshold, it separated an MCS into two distinct\ regions.
\ \\ The predicted MCSs are segments of the alignment that are likely to\ have been "generated" by the conserved state of the phylo-HMM,\ i.e., maximal segments in which the maximum-likelihood (Viterbi)\ path remains in the conserved state.
\ \\ GERP constrained elements exhibit significant evidence of the effects of \ purifying selection. \ Elements are scored according to the inferred intensity of purifying selection \ and are measured as "rejected substitutions" (RSs). RSs capture the \ magnitude of difference between the number of "observed" substitutions\ (estimated using maximum likelihood) and the number that would be \ "expected" under a neutral model of evolution. \ The RS is displayed as part of the item name. Items with\ higher RSs are displayed in a darker shade of blue. The score shown on\ the details page, which has been scaled by 300 for display purposes, is \ generally not as accurate as the RS count that is part of the item name.
\\ "Constrained elements" are identified as those groups\ of consecutive human bases that have an observed rate of evolution that is\ smaller than the expected rate. These groups of columns are merged if they\ are less than a few nucleotides apart and are scored according to the sum of\ the site-by-site difference between observed and expected rates (RS).
\\ Permutations of the actual alignments were analyzed, and the "constrained\ elements" identified in these permuted alignments were treated as\ "false positives". Subsequently, an RS threshold was picked such\ that the total length of "false positive" constrained elements\ (identified in the permuted alignments) was less than 5% of the length of\ constrained elements identified in the actual alignment.\ Thus, all annotated constrained elements are significant at better\ than 95% confidence, and the total fraction of the ENCODE regions\ annotated as constrained is 5-7%.
\ \\ These subtracks were produced by creating unions and intersections of the\ constrained element data detected by binCons, phastCons, and GERP on MLAGAN\ alignments. In these annotations, "non-coding" is defined as those \ regions not overlapping with CDS regions in any of the following UCSC gene \ tables: refFlat, knownGene, mgcGenes, vegaGene, or ensGene.
\ \\ BinCons and phastCons MCS data were contributed by Elliott Margulies in the \ Eric Green lab at \ NHGRI, with assistance from Adam Siepel of UCSC.
\\ GERP was developed primarily by Greg Cooper in the lab of \ Arend Sidow \ at Stanford University (Depts of Pathology and Genetics), in close collaboration\ with Eric Stone (Biostatistics, NC State), and George Asimenos and \ Eugene Davydov in the lab of \ Serafim Batzoglou \ (Dept. of Computer Science, Stanford).
\ \\ See the MLAGAN Alignment and MLAGAN Cons tracks for references.
\ encodeCompGeno 1 chromosomes chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX\ compositeTrack on\ dataVersion ENCODE June 2005 Freeze\ exonArrows off\ group encodeCompGeno\ longLabel Stanford MLAGAN Conserved Elements\ priority 74.2\ shortLabel MLAGAN Elements\ track encodeMlaganElements\ type bed 5 .\ visibility hide\ sibTxGraph SIB Alt-Splicing altGraphX Alternative Splicing Graph from Swiss Institute of Bioinformatics 0 74.5 0 0 0 127 127 127 0 0 0 http://www.isrec.isb-sib.ch/cgi-bin/tromer/tromergraph2draw.pl?species=H.+sapiens&tromer=$$\ This track shows the graphs constructed by analyzing experimental RNA\ transcripts, and serves as basis for the predicted alternative splicing\ transcripts shown in the SIB Genes track. The blocks represent exons; lines\ indicate introns. The graphical display is drawn such that no exons\ overlap, making alternative events easier to view when the track is in full\ display mode and the resolution is set to approximately gene-level.
\Further information on the graphs can be found on the\ Transcriptome \ Web interface.
\ \\ The splicing graphs were generated using a multi-step pipeline: \
\ The SIB Alternative Splicing Graphs track was produced on the Vital-IT high-performance \ computing platform\ using a computational pipeline developed by Christian Iseli with help from\ colleagues at the Ludwig \ Institute for Cancer\ Research and the Swiss \ Institute of Bioinformatics. It is based on data from NCBI RefSeq and GenBank/EMBL. Our\ thanks to the people running these databases and to the scientists worldwide\ who have made contributions to them.
\ rna 1 group rna\ idInUrlSql select name from sibTxGraph where id=%s\ longLabel Alternative Splicing Graph from Swiss Institute of Bioinformatics\ priority 74.5\ shortLabel SIB Alt-Splicing\ track sibTxGraph\ type altGraphX\ url http://www.isrec.isb-sib.ch/cgi-bin/tromer/tromergraph2draw.pl?species=H.+sapiens&tromer=$$\ urlLabel SIB link:\ visibility hide\ sibAltEvents SIB Alt Events bed 6 . Alt-Splicing, Alternative Promoters, Alternative Poly-A etc from SIB 0 74.6 0 0 0 127 127 127 0 0 0 rna 1 group rna\ longLabel Alt-Splicing, Alternative Promoters, Alternative Poly-A etc from SIB\ priority 74.6\ shortLabel SIB Alt Events\ track sibAltEvents\ type bed 6 .\ visibility hide\ encodeAffyChIpHl60PvalSirt1Hr02 Affy SIRT1 RA 2h wig 0.0 534.54 Affymetrix ChIP/Chip (SIRT1 retinoic acid-treated HL-60, 2hrs) P-Value 0 75 0 225 0 127 240 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 0 color 0,225,0\ longLabel Affymetrix ChIP/Chip (SIRT1 retinoic acid-treated HL-60, 2hrs) P-Value\ parent encodeAffyChIpHl60Pval\ priority 75\ shortLabel Affy SIRT1 RA 2h\ subGroups factor=SIRT1 time=2h\ track encodeAffyChIpHl60PvalSirt1Hr02\ encodeAffyChIpHl60SitesSirt1Hr02 Affy SIRT1 RA 2h bed 3 . Affymetrix ChIP/Chip (SIRT1 retinoic acid-treated HL-60, 2hrs) Sites 0 76 0 225 0 127 240 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 1 color 0,225,0\ longLabel Affymetrix ChIP/Chip (SIRT1 retinoic acid-treated HL-60, 2hrs) Sites\ parent encodeAffyChIpHl60Sites\ priority 76\ shortLabel Affy SIRT1 RA 2h\ subGroups factor=SIRT1 time=2h\ track encodeAffyChIpHl60SitesSirt1Hr02\ mgcIntrons mgcIntronPicks bed 12 . Introns and Flanking Exons for RACE PCR 0 76 0 0 0 127 127 127 0 0 0 rna 1 group rna\ longLabel Introns and Flanking Exons for RACE PCR\ priority 76\ shortLabel mgcIntronPicks\ track mgcIntrons\ type bed 12 .\ visibility hide\ binMCS95_encode TBA MCSs Nov04 bed 5 . Binomial-based MCSs from ENCODE Nov. 2004 TBA alignments (top 5%) 0 76 0 60 120 127 157 187 1 0 0 encodeCompGeno 1 color 0,60,120\ dataVersion ENCODE June 2005 Freeze\ group encodeCompGeno\ longLabel Binomial-based MCSs from ENCODE Nov. 2004 TBA alignments (top 5%)\ priority 76.0\ shortLabel TBA MCSs Nov04\ track binMCS95_encode\ type bed 5 .\ useScore 1\ visibility hide\ encode_tba TBA Nov04 wigMaf 0.0 10.0 TBA Nov. 2004 Alignments of ENCODE Regions 0 76.1 0 10 100 1 128 0 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeCompGeno 1 altColor 1,128,0\ autoScale Off\ chromosomes chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX\ color 0, 10, 100\ dataVersion ENCODE June 2005 Freeze\ group encodeCompGeno\ longLabel TBA Nov. 2004 Alignments of ENCODE Regions\ maxHeightPixels 100:40:11\ pairwise encode_tba 15\ priority 76.1\ sGroup_mammal platypus\ sGroup_placental rat mouse cow dog armadillo\ sGroup_primate chimp baboon marmoset galago\ sGroup_vertebrate chicken\ shortLabel TBA Nov04\ spanList 1\ speciesGroups primate placental mammal vertebrate\ track encode_tba\ type wigMaf 0.0 10.0\ visibility hide\ wiggle binCons_encode\ yLineOnOff Off\ encodeAffyChIpHl60PvalSirt1Hr08 Affy SIRT1 RA 8h wig 0.0 534.54 Affymetrix ChIP/Chip (SIRT1 retinoic acid-treated HL-60, 8hrs) P-Value 0 77 0 225 0 127 240 127 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX, encodeChip 0 color 0,225,0\ longLabel Affymetrix ChIP/Chip (SIRT1 retinoic acid-treated HL-60, 8hrs) P-Value\ parent encodeAffyChIpHl60Pval\ priority 77\ shortLabel Affy SIRT1 RA 8h\ subGroups factor=SIRT1 time=8h\ track encodeAffyChIpHl60PvalSirt1Hr08\ encode_MSA2_MLAGAN MLAGAN Nov04 wigMaf 0.0 10.0 Stanford MLAGAN Alignments of November 2004 ENCODE MSA Sequences 0 77 0 10 100 1 128 0 0 0 21 chr1,chr10,chr11,chr12,chr13,chr14,chr15,chr16,chr18,chr19,chr2,chr20,chr21,chr22,chr4,chr5,chr6,chr7,chr8,chr9,chrX,\ We have generated a first version of alignments for the ENCODE regions. These alignments are in beta stage; future versions will incorporate newer alignment techniques that are currently under research and development in our group. We think that this early set of alignments will be useful for those who are eager to perform analysis on aligned data. For each region, we have used a human-centric methodology that comprises the following steps:
\In the first step, the sequence of each species is "rearranged" so that it is orthologously collinear, with respect to the human sequence. In other words, the sequence of each species is mapped to the human sequence: First, a human-monotonic map is created based on local similarities between the two sequences (using an algorithm based on Shuffle-LAGAN), and then a new sequence is produced for the species, by glueing together different pieces of the original sequence, according to the mapping. The mapping allows for any rearrangements, such as inversions, translocations, and duplications. Therefore, a new FASTA file is created for each of the species (other than human), containing a sequence that is directly alignable to human using standard global alignment techniques.
In the second step, a multiple global alignment is created for every region using MLAGAN.
Moreover, for each region we report how the sequences have been rearranged, so that people who want to do comparative analysis on the alignment can later map the coordinates of the rearranged sequences back to the original ones. For each region, a subdirectory named "rearrangements" contains a compressed tar archive with .info files. A .info file contains the map between the original and the rearranged sequence of some species. For example, here is the info file for the galago species, region ENr133:
\galago 2 165283 244112 1 58088 0 33 + 0 73 58160\ galago 2 244113 348408 58089 174393 85 0 + 0 58044 174348\ galago 2 353199 357763 174394 180830 0 69 - 0 175801 182237\ galago 2 357764 369080 180831 195593 18 0 + 0 182613 197375\ galago 2 369569 498714 195594 301899 4 0 + 0 197246 303551\
The info file tries to follow the conventions of AVID's draft sequence info file format. The first field contains the species name; the last two fields contain the species' coordinates, and the third and fourth fields contain the human coordinates. For example, in the first line of the example, the part of galago's sequence from the 73rd to the 58,160th base is mapped to the respective part (from 165,283 to 244,112) of the human sequence. The file is always sorted according to human coordinates, since it is a human-monotonic map. Fields number five and six correspond to the coordinates of the rearranged sequence that is created from the map; in the example, the first 58,088 bases of the rearranged galago sequence are copied from positions 73 - 58,160 of the original sequence. The ninth field contains a sign that distinguishes positive strands (+) from negative ones (-). In the example, positions 175,801 to 182,237 are reverse complemented and then put into positions 174,394 to 180,830 in the rearranged sequence. The rest of the fields are nonimportant or irrelevant. Notice that not all of the original sequence is present in the rearranged one; the algorithm may discard parts of the original sequence which could not be mapped to any place of the human sequence.
\The info files have also been drawn into linear plots. Here is the linear plot of the previous example:
\The first grey horizontal line represents the galago sequence, and the second line represents the human sequence. Black arrows are drawn in rearranged regions (showing the direction within the strand) and grey lines cross the two regions to indicate that they were linked. The same info file can also be represented in a pseudo-dotplot, like the following:
\The horizontal axis represents the human sequence, and the vertical axis represents the other species' sequence. A black line is drawn to indicate a rearranged piece, and grey dotted lines indicate its boundaries in the human-monotonic axis. The figure looks like a dotplot, but one should have in mind that it's actually a rearrangement visualization; it does not directly depict any local alignment hits or aligned regions, like usual dotplots do. Also notice that in all plots, the line that represents the sequence of the other species always ends at the position of the furthest rearrangement (rather than the position of the last nucleotide in the sequence).
\Linear plots and dotplots are available for all regions, and they are located in the "rearrangements" subdirectory, in PNG format.
\The actual data location is: http://ai.stanford.edu/~asimenos/beta_encode_Nov_2004/data/
\Finally, notice that we have used "rat" instead of "ratB" in these alignments. Also, our rearrangement algorithm was not fine-tuned for each region, and so it may behave differently from region to region. It seems that a few of the regions (especially the randomly picked ones) contain sequencies from other species that show very weak homology or too many repeated elements. For that, we decided to exclude the marmoset sequence from region ENr132 and the cow sequence from region ENr213. Lastly, in some regions, the original FASTA files of some of the species contained more than one sequencies, usualy from two or more different chromosomes of the species; in this case we concatenated the sequencies into a single one, before feeding them to the rearrangement algorithm. Therefore, in such cases, the MAF and .info coordinates (and the first horizontal line in the linear plot or the vertical axis in the dotplot) refer to the concatenated input file. Thus, extra care should be taken if one needs to map these coordinates back to the original sequences.
\Feel free to download the alignments or browse through some interesting plots! Be sure to email me any comments!
\ \