In modern molecular biology and genetics, the genome is the entirety of an organism's hereditary information. It is encoded either in DNA or, for many types of virus, in RNA. The genome includes both the genes and the non-coding sequences of the DNA/RNA. These databases provide sequenced genomes and related data sets.
Getting started with academic research | Overview of genetics, bioinformatics, and computational biology | Recommended databases | Early literature | Genome databases
The Genome Database provides views for a variety of genomes, complete chromosomes, sequence maps with contigs, and integrated genetic and physical maps. The database is organized in six major organism groups: Archaea, Bacteria, Eukaryotae, Viruses, Viroids, and Plasmids and includes complete chromosomes, organelles and plasmids as well as draft genome assemblies.
Genome Project is a collection of complete and incomplete large-scale sequencing, assembly, annotation, and mapping projects for cellular organisms. The database is organized into organism-specific overviews that function as portals from which all projects in the database pertaining to that organism can be browsed and retrieved.
UniGene provides an organized view of the transcriptome. Each UniGene entry is a set of transcript sequences that appear to come from the same transcription locus (gene or expressed pseudogene), together with information on protein similarities, gene expression, cDNA clone reagents, and genomic location.
The GSS database contains all records found within the Genome Survey Sequence (GSS) division of GenBank. GSS records contain first-pass single-read genomic sequences and rarely include annotated biological features.
The database of genotype and phenotype (dbGaP) stores phenotype and genotype data, as well as the associations between them. Data for dbGaP will include genome-wide association studies, medical sequencing, and molecular diagnostic assays. Summaries of phenotype and genotype data as well as study documents and association analyses (when available) will be found on the public site. Authorized access may be required for downloading coded individual-level phenotypes genotypes and pedigrees.