This site provides a data set based on the February 2009
Homo sapiens high coverage assembly GRCh37 from the
Genome Reference Consortium.
This assembly is used by UCSC to create their hg19 database.
The data set consists of gene models built from the genewise alignments
of the human proteome as well as from alignments of human cDNAs using
the cDNA2genome model of exonerate.
This release of the assembly has the following properties:
It also includes nine haplotypic regions, mainly in the MHC region of chromosome 6.
As the GRC maintains and improves the assembly, patches are being introduced. Patch release three (GRCh37.p3) was included in Ensembl release 62. Currently, assembly patches are of two types:
A preliminary assembly of the Neanderthal (Homo sapiens neanderthalensis) genome is available via the Neanderthal Genome Browser, an Ensembl-powered project based at the Max Planck Institute.