Learn about The ENCODE Project , (ENCyclopedia of DNA Elements), an international consortium of researchers who are moving beyond the basic information of the reference genome sequence. Researchers are using many cutting-edge technologies to learn as much as possible about variations, genes, non-coding transcripts, regulatory elements, and genome structure and more, in extensive detail across the entire genome. The ENCODE project is coordinated by the NHGRI. The UCSC Genome Browser is the designated Data Coordination Center (DCC) , for the ENCODE project, and the official ENCODE data repository. In this tutorial we examine aspects of the ENCODE project and data types, and explore ways for you to access and learn about the ENCODE data available under the UCSC Genome Browser. This tutorial assumes the user has familiarity with the software functionality of the UCSC Genome Browser as described in the Introductory and Advanced Topics tutorials. The focus of this material is on the human ENCODE data at this time. If you are interested in the fly and worm aspects of the project, please visit the modENCODE site for these model organisms.

Note: A second tutorial goes beyond these foundations and explores ways to interact with ENCODE data in 2012. See ENCODE Data Available through the UCSC Genome Browser

You will learn:

  • The foundations and background of the ENCODE project
  • Key differences between the pilot phase and the current production phase
  • To identify ENCODE data in the UCSC Genome Browser, and explore the data use policies
  • What types of data are available under ENCODE, and where to find details of the data types and technologies
  • How to interact with the data in the graphical browser, table browser, and by downloading


This tutorial is a part of the tutorial group Human variations. You might find the other tutorials in the group interesting:

GAD: Genetic Association Database: An archived database associating human genes and polymorphisms with diseases

Madeline 2.0: Human pedigree diagram tools

DrugBank: A chemoinformatics and bioinformatics resource

DGV: Database of Genomic Variants: Database of Genomic Variants, DGV, catalogs and displays structural variation in the human genome

OMIM: Online Mendelian Inheritance in Man (OMIM): A database of human genes, genetic diseases and disorders

CGAP: Characterize the molecular genetic changes that cause a normal cell to become a cancer cell

GeneSNPs: An integrated view of gene structure and SNP variations

NIEHS SNPs: National Institute for Environmental Health Sciences Environmental Genome Project (EGP) SNPs

HapMap: HapMap, a database and analysis resource of human variation

Genetics Home Reference: A collection of data describing the effects of genetic variability on human health and disease

dbGaP: A database of genotypes and phenotypes with extensive variation data and clinical details

SeattleSNPs: Human SNPs in genes

dbSNP: NCBI's SNP database

GeneTests: GeneTests, a current, comprehensive genetic testing resource

This tutorial is a part of the tutorial group UCSC Tutorials. You might find the other tutorials in the group interesting:

UCSC Genome Browser: The Additional Tools: Additional tools at the UCSC Genome Browser

UCSC Genome Browser: Custom Tracks and Table Browser: UCSC Genome Browser advanced topics

UCSC Archaeal Genome Browser: Provides you with many research and analysis tools that can be used to examine the genomes of more than 50 microbial species from the domain archaea.


UCSC Genome Browser: An Introduction: The UCSC Genome Browser Introduction


Genome Databases (euk) : Genomic databases or repositories primarily aimed at eukaryotic organisms. Some may contain prokaryotic data as well.


Video Tip of the Week: New UCSC Genome Browser Gateway look: For years now we've been doing training and outreach on the UCSC Genome Browser. And there's been a lot of change over the years--so much more data, so many new tools, new species. All that ENCODE inf...

Video Tip of the Week: UCSC features for ENCODE data utilization: As noted in last week's tip about the ENCODE DCC at Stanford, there was a workshop recently for the ENCODE project. There were a lot of folks speaking and a big room full of attendees. You should check...

Video Tip of the Week: ENCODE Data Coordination Center, phase 3 : Image via: A User's Guide to the Encyclopedia of DNA Elements (ENCODE). doi:10.1371/journal.pbio.1001046.g001 The ENCODE project began many years ago, with a pilot phase, that examined just 1% of the ...

VideoTip of the Week: ENCODE @ Ensembl: We have a lot of tutorials (2 in fact, ENCODE Foundations & ENCODE @ UCSC), tips and information about ENCODE. We also have a lot of tutorials (again 2, Ensembl and Ensembl Legacy- on the older versio...

Tip of the Week: Gemini, exploration of genetic variation: You Tube: This week's tip of the week is on Gemini which is the acronym for "GENome MINing." Unlike most of the tips we give every week, this one is a software package. But, it is does use and integra...


Recent BioMed Central research articles citing this resource

Guo Yuchun et al., Modular combinatorial binding among human trans -acting factors reveals direct and indirect factor binding Human and rodent genomics. BMC Genomics (2017) doi:10.1186/s12864-016-3434-3

Yang Chia-Chun et al., Inferring condition-specific targets of human TF-TF complexes using ChIP-seq data Transcriptomic methods. BMC Genomics (2017) doi:10.1186/s12864-016-3450-3

Rodríguez-Martín Bernardo et al., ChimPipe: accurate detection of fusion genes and transcription-induced chimeras from RNA-seq data Transcriptomic methods. BMC Genomics (2017) doi:10.1186/s12864-016-3404-9

Rasko E.J. John et al., Nuclear microRNAs in normal hemopoiesis and cancer. Journal of Hematology Oncology (2017) doi:10.1186/s13045-016-0375-x

Ampuja M. et al., Integrated RNA-seq and DNase-seq analyses identify phenotype-specific BMP4 signaling in breast cancer Human and rodent genomics. BMC Genomics (2017) doi:10.1186/s12864-016-3428-1