ENCODE Project at NHGRI    ENCODE Data Coordination Center at UCSC
  Home   -   Terms   -   Help  
Genome Browser
Cell Types
Antibodies
Release Log
Downloads
Contributors
Publications
Data Policy
Pilot Project
Contact Us

  About the ENCODE Data Coordination Center (DCC)

The Encyclopedia of DNA Elements (ENCODE) Consortium is an international collaboration of research groups funded by the National Human Genome Research Institute (NHGRI). The goal of the consortium is to build a comprehensive parts list of the functional elements of the human genome, including elements that act at the protein level (coding genes) and RNA level (non-coding genes), and regulatory elements that control the cells and circumstances in which a gene is active. The discovery and annotation of gene elements is accomplished primarily by sequencing RNA from a diverse range of sources, comparative genomics, integrative bioinformatic methods, and human curation. Regulatory elements are typically investigated through DNA hypersensitivity assays, assays of DNA methylation, and chromatin immunoprecipitation (ChIP) of proteins that interact with DNA, including modified histones and transcription factors, followed by sequencing (ChIP-Seq). The results of ENCODE experiments, collected in the ENCODE DCC database, are displayed on the UCSC Genome Browser. The data can also be downloaded from the ENCODE DCC website in text format.

To access ENCODE data, open the Genome Browser, select the March 2006 assembly of the human genome, and go to your region of interest. ENCODE tracks will be marked with the NHGRI logo . The bulk of the ENCODE data can be found in the Expression and Regulation track groups, with a few in the Mapping, Genes, and Variation groups. Although most participating research groups have provided several tracks, generally only selected data from each research group are displayed by default. Click the hyperlinked name of a particular track to display a page containing configuration options and details about the methods used to generate the data. See the Genome Browser User's Guide for further information about displaying tracks and navigating in the Genome Browser.

Data from the earlier pilot phase of the ENCODE project are available on the May 2004 and March 2006 human assemblies. These datasets are generally available only in the initial ENCODE-targeted regions that covered approximately 1% of the genome. The ENCODE Pilot Project web pages provide convenient browser access to these regions.

Before publishing research that uses ENCODE data, please read the data release policy, which places some restrictions on publication use of data for nine months following the data release.



  News  

23 September 2009 - ENCODE data releases since July 1

During this period a total of 9 new ENCODE tracks were released to the UCSC public server. Functional elements and region characterization in these tracks include:

  • DNaseI hypersensitive sites and hotspots (University of Washington)
  • Regions of DNA methylation, by Methyl-seq and Illumina 27 array (HudsonAlpha Institute)
  • Bi-directional promoters (NHGRI Elnitski lab)
  • Transcriptome profiling by RNA-seq, including single-molecule sequencing (CSHL, Helicos)
  • Expression levels by exon array (University of Washington)
  • Regions of copy number variation (HudsonAlpha Institute)
  • Mappability/uniqueness of nmers (University of Massachusetts, Duke, Broad, Rosetta)

For links to these tracks, see the Release Log and Downloads links listed in the left menu bar.

We would like to thank the contributing ENCODE labs and the the DCC team at UCSC for their efforts completing these tracks.


1 July 2009 - ENCODE data releases for the period April - June 2009

The following ENCODE tracks were released to the ENCODE DCC/UCSC public server during this period: Read more.


13 March 2009 - ENCODE Data Release: Transcription Factor Binding Sites from Yale/UC-Davis/Harvard

We are pleased to announce the release to the ENCODE DCC/UCSC public server of the ENCODE Transcription Factor Binding Sites by ChIP-seq from Yale/UC-Davis/Harvard (Yale TFBS, in the Regulation group). Read more.


  Conditions of Use

The sequence and annotation data displayed in the Genome Browser are freely available for academic, nonprofit, and personal use with the following conditions: