Tag: genome


Found 208 sources
Source Match ReputationScore*

UCSC Genome Browser database


Genome assemblies and aligned annotations for a wide range of vertebrates and model organisms, along with an integrated tool set for visualizing, comparing, analyzing and sharing both publicly available and user-generated genomic datasets.
100%

Ensembl


Ensembl aims to provide a centralized resource for geneticists, molecular biologists and other researchers studying the genomes of our own species and other vertebrates and model organisms. Ensembl is one of several well known genome browsers for the ...
99%

Gene Expression Omnibus


The Gene Expression Omnibus (GEO) is a public repository that archives and freely distributes microarray, next-generation sequencing, and other forms of high-throughput functional genomic data submitted by the scientific community. In addition to dat ...
98%

Kyoto Encyclopedia of Genes and Genomes


KEGG is a database resource for understanding high-level functions and utilities of the biological system, such as the cell, the organism and the ecosystem, from molecular-level information, especially large-scale molecular datasets generated by geno ...
86%

GenBank


GenBank is the NIH genetic sequence database, an annotated collection of all publicly available DNA sequences. The complete release notes for the current version of GenBank are available on the NCBI ftp site. A new release is made every two months. G ...
80%

Database of Single Nucleotide Polymorphism


dbSNP contains human single nucleotide variations, microsatellites, and small-scale insertions and deletions along with publication, population frequency, molecular consequence, and genomic and RefSeq mapping information for both common variations an ...
73%

The International Genome Sample Resource


The International Genome Sample Resource (IGSR) was established to ensure the ongoing usability of data generated by the 1000 Genomes Project and to extend the data set. The 1000 Genomes Project ran between 2008 and 2015, creating the largest public ...
70%

Minimum Information About a Microarray Experiment


MIAME is intended to specify all the information necessary for an unambiguous interpretation of a microarray experiment, and potentially to reproduce it. MIAME defines the content but not the format for this information.
69%

The European Genome-phenome Archive


The European Genome-phenome Archive (EGA) allows you to explore datasets from genomic studies, provided by a range of data providers. Access to datasets must be approved by the specified Data Access Committee (DAC).
66%

FlyBase


Genetic, genomic and molecular information pertaining to the model organism Drosophila melanogaster and related sequences. This database also contains information relating to human disease models in Drosophila, the use of transgenic constructs contai ...
66%

Saccharomyces Genome Database


The Saccharomyces Genome Database (SGD) collects and organizes information about the molecular biology and genetics of the yeast Saccharomyces cerevisiae. SGD contains a variety of biological information and tools with which to search and analyze it.
64%

WormBase


WormBase is an international consortium of biologists and computer scientists dedicated to providing the research community with accurate, current, accessible information concerning the genetics, genomics and biology of C. elegans and related nematod ...
61%

BRENDA


BRENDA is the main collection of enzyme functional data available to the scientific community.
60%

Eukaryotic Pathogen Database Resources


EuPathDB is an integrated database covering the eukaryotic pathogens. While each of the taxonomic groups within this resource is supported by a taxon-specific database built upon the same infrastructure, the EuPathDB portal offers an entry point to a ...
60%

Integrated resource of protein families, domains and functional sites


InterPro is a resource that provides functional analysis of protein sequences by classifying them into families and predicting the presence of domains and important sites. To classify proteins in this way, InterPro uses predictive models, known as si ...
60%

NCBI Gene


The Entrez Global Query Cross-Database Search System is a federated search engine, or web portal that allows users to search many discrete health sciences databases at the National Center for Biotechnology Information (NCBI) website. Entrez can effic ...
59%

Ensembl Genomes


The Ensembl genome annotation system, developed jointly by EMBL-EBI and the Wellcome Trust Sanger Institute, has been used for the annotation, analysis and display of vertebrate genomes since 2000. Since 2009, the Ensembl site has been complemented b ...
59%

The Zebrafish Information Network


The Zebrafish Information Network, ZFIN, serves as the primary community database resource for the laboratory use of zebrafish. We develop and support integrated zebrafish genetic, genomic, developmental and physiological information and link this in ...
59%

Sequence Ontology


SO is a collaborative ontology project for the definition of sequence features used in biological sequence annotation. The Sequence Ontology is a set of terms and relationships used to describe the features and attributes of biological sequence. SO i ...
58%

European Nucleotide Archive


The European Nucleotide Archive (ENA) is a globally comprehensive data resource for nucleotide sequence, spanning raw data, alignments and assemblies, functional and taxonomic annotation and rich contextual data relating to sequenced samples and expe ...
58%

Phytozome


Phytozome, the Plant Comparative Genomics portal of the Department of Energy's Joint Genome Institute, provides JGI users and the broader plant science community a hub for accessing, visualizing and analyzing JGI-sequenced plant genomes, as well as s ...
57%

Minimum Information about a (Meta)Genome Sequence


MIGS/MIMS (Minimum Information About a (Meta)Genome Sequence) outlines a conceptual structure for extending the core information that has been traditionally captured by the INSDC (DDBJ/EMBL/Genbank) to describe genomic and metagenomic sequences. The ...
56%

Gramene: A curated, open-source, integrated data resource for comparative functional genomics in plants


Gramene's purpose is to provide added value to plant genomics data sets available within the public sector, which will facilitate researchers' ability to understand the plant genomes and take advantage of genomic sequence known in one species for ide ...
56%

Reference Sequence Database


The Reference Sequence (RefSeq) collection aims to provide a comprehensive, integrated, non-redundant, well-annotated set of sequences, including genomic DNA, transcripts, and proteins.
56%

VectorBase


VectorBase is a web-accessible data repository for information about invertebrate vectors of human pathogens. VectorBase annotates and maintains vector genomes providing an integrated resource for the research community. Currently, VectorBase contain ...
56%

GeneCards: human genes, protein and diseases


GeneCards is a searchable, integrated, database of human genes that provides concise genomic, transcriptomic, genetic, proteomic, functional and disease related information on all known and predicted human genes.
55%

Clusters of Orthologous Groups of Proteins: Phylogenetic classification of proteins encoded in complete genomes


Clusters of Orthologous Groups of proteins (COGs) were delineated by comparing protein sequences encoded in complete genomes, representing major phylogenetic lineages. Each COG consists of individual proteins or groups of paralogs from at least 3 lin ...
55%

Rat Genome Database


The Rat Genome Database is the premier site for genetic, genomic, phenotype, and disease data generated from rat research. It provides easy access to corresponding human and mouse data for cross-species comparison and its comprehensive data and innov ...
55%

Integrated Microbial Genomes And Microbiomes


The Integrated Microbial Genomes (IMG/M) aims to support the annotation, analysis and distribution of microbial genome and microbiome datasets sequenced at DOE's Joint Genome Institute (JGI). It also serves as a community resource for analysis and an ...
55%

SUPERFAMILY


SUPERFAMILY is a database of structural and functional annotation for all proteins and genomes.
54%

Xenopus laevis and tropicalis biology and genomics resource


Xenbase is the model organism database for Xenopus laevis and X. (Silurana) tropicalis. It contains genomic, development data and community information for Xenopus research. It includes gene expression patterns that incorporate image data from the li ...
54%

Brassica Genome


This site hosts Brassica genome databases.
54%

MGnify


EBI Metagenomics has changed its name to MGnify to reflect a change in scope. This is a free-to-use resource aiming at supporting all metagenomics researchers. The service is an automated pipeline for the analysis and archiving of metagenomic data th ...
52%

ViralZone


ViralZone is a web resource for viral genes and families, providing detailed molecular and epidemiological information, along with virion and genome figures. Each virus or family page gives easy access to UniProtKB/Swiss-Prot viral protein entries.
52%

Database of genomic structural VARiation


dbVar is a database of genomic structural variation. It accepts data from all species and includes clinical data. It can accept diverse types of events, including inversions, insertions and translocations. Additionally, both germline and somatic vari ...
51%

ImMunoGeneTics Information System


IMGT is a high-quality integrated knowledge resource specialized in the immunoglobulins (IG) or antibodies, T cell receptors (TR), major histocompatibility complex (MHC) of human and other vertebrate species, and in the immunoglobulin superfamily (Ig ...
51%

Expressed Sequence Tags database


The dbEST contains sequence data and other information on "single-pass" cDNA sequences, or "Expressed Sequence Tags", from a number of organisms. NCBI is in the process of merging EST and GSS records into the Nucleotide database, and the process is e ...
50%

The Autism Chromosome Rearrangement Database


The Autism Chromosome Rearrangement Database is a collection of hand curated breakpoints and other genomic features, related to autism, taken from publicly available literature, databases and unpublished data.
50%

Functional ANnoTation Of the Mammalian genome Database


FANTOM (Functional ANnoTation Of the Mammalian genome) is a worldwide collaborative project aiming at identifying all functional elements in mammalian genomes.
50%

Restriction enzymes and methylases database


A collection of information about restriction enzymes and related proteins. It contains published and unpublished references, recognition and cleavage sites, isoschizomers, commercial availability, methylation sensitivity, crystal, genome, and sequen ...
50%

EcoCyc E. coli Database


EcoCyc is a model organism database for Escherichia coli K-12 MG1655. EcoCyc curation captures literature-based information on the functions of individual E. coli gene products, metabolic pathways, and regulation of E. coli gene expression. EcoCyc ha ...
49%

PLAZA


PLAZA is a platform for comparative, evolutionary, and functional genomics. The platform consists of multiple instances, where each instance contains additional genomes, improved genome annotations, new software tools, etc.
48%

TriTrypDB


TriTrypDB is one of the databases that can be accessed through the EuPathDB (http://EuPathDB.org; formerly ApiDB) portal, covering eukaryotic pathogens of the genera Cryptosporidium, Giardia, Leishmania, Neospora, Plasmodium, Toxoplasma, Trichomonas ...
48%

Homologene


HomoloGene is a system for automated detection of homologs among the annotated genes of several completely sequenced eukaryotic genomes.
48%

Maize Genetics and Genomics Database


MaizeGDB is the maize research community's central repository for genetics and genomics information.
47%

SoyBase


SoyBase, the USDA-ARS soybean genetic database, is a comprehensive repository for professionally curated genetics, genomics and related data resources for soybean. SoyBase contains the most current genetic, physical and genomic sequence maps integrat ...
47%

PlasmoDB


PlasmoDB is a genome database for the genus Plasmodium, a set of single-celled eukaryotic pathogens that cause human and animal diseases, including malaria.
47%

Group II introns database


Database for identification and cataloguing of group II introns. All bacterial introns listed are full-length and appear to be functional, based on intron RNA and IEP characteristics. The database names the full-length introns, and provides informati ...
47%

DNA Data Bank of Japan


Annotated collection of all publicly available nucleotide and protein sequences. DDBJ collects sequence data mainly from Japanese researchers, as well as researchers in any other countries. DDBJ is part of the International Nucleotide Sequence Databa ...
47%

Genome Database for Rosaceae


The Genome Database for Rosaceae (GDR) is a curated and integrated web-based relational database providing centralized access to Rosaceae genomics and genetics data and analysis tools to facilitate cross-species utilization of data.
46%

Aspergillus Genome Database


The Aspergillus Genome Database is a resource for genomic sequence data as well as gene and protein information for Aspergilli. This publicly available repository is a central point of access to genome, transcriptome and polymorphism data for the fun ...
46%

Database of Sequence Tagged Sites


dbSTS is an NCBI resource that contains sequence data for short genomic landmark sequences or Sequence Tagged Sites.
46%

Sol Genomics Network


The Sol Genomics Network (SGN) is a database and website dedicated to the genomic information of the Solanaceae family, which includes species such as tomato, potato, pepper, petunia and eggplant.
46%

Arabidopsis Information Portal


The Arabidopsis Information Portal (Araport) is an open-access online community resource for Arabidopsis research. Araport enables biologists to navigate from the Arabidopsis thaliana Col-0 reference genome sequence to its associated annotation inclu ...
46%

Minimum Information about any (x) Sequence


The minimum information about any (x) sequence (MIxS) is an overarching framework of sequence metadata, that includes technology-specific checklists from the previous MIGS and MIMS standards, provides a way of introducing additional checklists such a ...
46%

Magnaporthe grisea Database


The Magnaporthe comparative genomics database provides accesses to multiple fungal genomes from the Magnaporthaceae family to facilitate the comparative analysis. The project is a partnership between the International Rice Blast Genome Consortium, an ...
46%

Genome Properties


Genome properties is an annotation system whereby functional attributes can be assigned to a genome, based on the presence of a defined set of protein signatures within that genome. This is a reimplementation at EMBL-EBI of a resource previously host ...
46%

Influenza Virus Resource


Influenza Virus Resource presents data obtained from the NIAID Influenza Genome Sequencing Project as well as from GenBank, combined with tools for flu sequence analysis, annotation and submission to GenBank. In addition, it provides links to other r ...
45%

Candida Genome Database


The Candida Genome Database (CGD) provides access to genomic sequence data and manually curated functional information about genes and proteins of the human pathogen Candida albicans. It collects gene names and aliases, and assigns gene ontology term ...
45%

The ribosomal RNA operon copy number database


The ribosomal RNA operon copy number database is a publicly available, curated resource for ribosomal operon (rrn) copy number information for Bacteria and Archaea.
45%

MiST 3.0: Microbial Signal Transduction database


Bacteria and archaea employ dedicated signal transduction systems that modulate gene expression, second-messenger turnover, quorum sensing, biofilm formation, motility, host-pathogen and beneficial interactions. The updated Microbial Signal Transduct ...
45%

Variant Call Format


Variant Call Format (VCF) is a text file format (most likely stored in a compressed manner). It contains meta-information lines, a header line, and then data lines each containing information about a position in the genome.
45%

PhylomeDB


PhylomeDB is a public database for complete catalogs of gene phylogenies (phylomes). Researchers are able to use this resource to visualise the history of genes with the available phylogentic trees and multiple sequence alignments.
44%

GWIPS-viz


GWIPS-viz is a freely available on-line genome browser which provides pre-populated ribosome profiling (Ribo-seq) and mRNA-seq tracks from published studies.
44%

Genome Sequence Archive


GSA is a data repository specialized for archiving raw sequence reads. It supports data generated from a variety of sequencing platforms ranging from Sanger sequencing machines to single-cell sequencing machines and provides data storing and sharing ...
44%

MGI Mouse Gene Expression Database


The Gene Expression Database (GXD) is a community resource for gene expression information from the laboratory mouse. GXD stores and integrates different types of expression data and makes these data freely available in formats appropriate for compre ...
44%

Toxoplasma Genomics Resource


ToxoDB is a genome database for the genus Toxoplasma, a set of single-celled eukaryotic pathogens that cause human and animal diseases, including toxoplasmosis.
44%

UniVec


UniVec is a database that can be used to quickly identify segments within nucleic acid sequences which may be of vector origin (vector contamination). In addition to vector sequences, UniVec also contains sequences for those adapters, linkers, and pr ...
44%

dictyBase


dictyBase is a single-access database for the complete genome sequence and expression data of four Dictyostelid species providing information on research, genome and annotations. There is also a repository of plasmids and strains held at the Dicty St ...
44%

BIG Data Center


The BIG Data Center at Beijing Institute of Genomics (BIG) of the Chinese Academy of Sciences provides a suite of database resources in support of worldwide research activities in both academia and industry. With the vast amounts of multi-omics data ...
43%

GeneDB


GeneDB is a genome database for prokaryotic and eukaryotic organisms and provides a portal through which data generated by the "Pathogen Genomics" group at the Wellcome Trust Sanger Institute and other collaborating sequencing centres can be accessed ...
43%

TAIR annotation data Format


At TAIR, we display Gene Ontology and Plant Ontology annotations made by TAIR curators and those made by the community including individual researchers and contributors to the GO Consortium. The GO annotations in TAIR are made using a combination of ...
43%

A CLAssification of Mobile genetic Elements


ACLAME is a database dedicated to the collection and classification of mobile genetic elements (MGEs) from various sources, comprising all known phage genomes, plasmids and transposons.
43%

NCBI Viral Genomes Resource


NCBI Viral Genomes Resource is a collection of virus genomic sequences that provides curated sequence data, related information and tools. It includes all complete viral genome sequences deposited in the International Nucleotide Sequence Database Col ...
43%

Ensembl Fungi


Ensembl Fungi is a browser for fungal genomes. A majority of these are taken from the databases of the International Nucleotide Sequence Database Collaboration (the European Nucleotide Archive at the EBI, GenBank at the NCBI, and the DNA Database of ...
43%

Ensembl Plants


Ensembl Plants holds the genomes of plants of significant interest. These range from those of agricultural importance, those which support primary research and of environmental interest. Ensembl Plants datasets are constructed in a direct collaborati ...
43%

Gene3D


Gene3D takes CATH domain families (from PDB structures) and assigns them to the millions protein sequences (using Hidden Markov models generated from HMMER) with no PDB structures.
43%

GiardiaDB


A detailed study of Giardia lamblia's genome will provide insights into an early evolutionary stage of eukaryotic chromosome organization as well as other aspects of the prokaryotic / eukaryotic divergence.
42%

Dfam


The Dfam database is a open collection of DNA Transposable Element sequence alignments, hidden Markov Models (HMMs), consensus sequences, and genome annotations. Dfam represents a collection of multiple sequence alignments, each containing a set of r ...
42%

The Vertebrate Genome Annotation Database


The Vertebrate Genome Annotation (VEGA) database is a central repository for high quality manual annotation of vertebrate finished genome sequence.
42%

An INtegrated Data warehouse of mIcrobial GenOmes


INDIGO enables the integration of annotations for the exploration and analysis of newly sequenced microbial genomes.
42%

Global Genome Biodiversity Network Data Standard


The GGBN Data Standard is a set of terms and controlled vocabularies designed to represent sample facts. It does not cover e.g., scientific name, geography, or physiological facts. This allows combining the GGBN Data Standard with other complementary ...
42%

Human Endogenous Retrovirus database


This database is compiled from the human genome nucleotide sequences obtained mostly in the Human Genome Projects. The database makes it possible to continuously improve classification and characterization of retroviral families. The HERV database no ...
42%

Minimum Information About a Microarray Experiment involving Plants


MIAME/Plant is a standard describing which biological details should be captured for describing microarray experiments involving plants. Detailed information is required about biological aspects such as growth conditions, harvesting time or harvested ...
42%

BeetleBase


BeetleBase is a community resource for Tribolium genetics, genomics and developmental biology. The database is built on the Chado generic data model, and is able to store various types of data, ranging from genome sequences to mutant phenotypes.
42%

The Arabidopsis Gene Regulatory Information Server


The Arabidopsis Gene Regulatory Information Server (AGRIS) is a new information resource of Arabidopsis promoter sequences, transcription factors and their target genes. AGRIS currently contains two databases, AtcisDB (Arabidopsis thaliana cis-regula ...
41%

Human Genetic Variation Database


The Human Genetic Variation Database (HGVD) aims to provide a central resource to archive and display Japanese genetic variation and association between the variation and transcription level of genes. The database currently contains genetic variation ...
41%

Cryptosporidum Genomics Resource


CryptoDB is an integrated genomic and functional genomic database for the parasite Cryptosporidium. CryptoDB integrates whole genome sequence and annotation along with experimental data and environmental isolate sequences provided by community resear ...
41%

Network of Cancer Genes


The Network of Cancer Genes (NCG) contains information on duplicability, evolution, protein-protein and microRNA-gene interaction, function, expression and essentiality of cancer genes from manually curated publications . NCG also provides informatio ...
41%

Clinical Interpretation of Variants in Cancer


CIViC is an expert-crowdsourced knowledgebase for Clinical Interpretation of Variants in Cancer describing the therapeutic, prognostic, diagnostic and predisposing relevance of inherited and somatic variants of all types. Realizing precision medicine ...
41%

A Systematic Annotation Package


ASAP is a relational database and web interface developed to store, update and distribute genome sequence data and gene expression data. It was designed to facilitate ongoing community annotation of genomes and to grow with genome projects as they mo ...
41%

Hymenoptera Genome Database


The Hymenoptera Genome Database (HGD) is a genome informatics resource that supports the research of insects of the order Hymenoptera (e.g. bees, wasps, ants). HGD is divided into three main divisions: BeeBase, which hosts bee genomes and the Bee Pes ...
41%

The Chromosome 7 Annotation Project


The objective of this project is to generate the most comprehensive description of human chromosome 7 to facilitate biological discovery, disease gene research and medical genetic applications.
41%

HumanCyc


HumanCyc is a bioinformatics database that describes human metabolic pathways and the human genome. By presenting metabolic pathways as an organizing framework for the human genome, HumanCyc provides the user with an extended dimension for functional ...
41%

DNASU Plasmid Repository


DNASU is a central repository for plasmid clones and collections. Currently we store and distribute over 197,000 plasmids including 75,000 human and mouse plasmids, full genome collections, the protein expression plasmids from the Protein Structure I ...
41%

Escherichia coli strain K12 genome database


The EcoGene database contains updated information about the E. coli K-12 genome and proteome sequences, including extensive gene bibliographies. A major EcoGene focus has been the re-evaluation of translation start sites.
41%

cis-Regulatory Element Database


The cisRED database holds conserved sequence motifs identified by genome scale motif discovery, similarity, clustering, co-occurrence and coexpression calculations. Sequence inputs include low-coverage genome sequence data and ENCODE data.
41%

The Triticeae Toolbox


The Triticeae Toolbox (T3) is a repository for public wheat data generated by the Wheat Coordinated Agricultural Project (Wheat CAP). Funding is provided by the National Institute for Food and Agriculture (NIFA) and the United States Department of Ag ...
41%

Nottingham Arabidopsis Stock Centre Seeds Database


The Nottingham Arabidopsis Stock Centre (NASC) provides seed and information resources to the International Arabidopsis Genome Programme and the wider research community.
41%

TrichDB


TrichDB is one of the databases that can be accessed through the EuPathDB (http://EuPathDB.org; formerly ApiDB) portal, covering eukaryotic pathogens of the genera Cryptosporidium, Giardia, Leishmania, Neospora, Plasmodium, Toxoplasma, Trichomonas an ...
40%

MAPPER-2


This resource provides information primarily on the upstream non-coding sequence data of genes in 3 genomes which gives insight into the transcription factors binding sites (TFBSs). For each transcript, the region scanned extends from 10,000bp upstre ...
40%

Ascidian Network for In Situ Expression and Embryological Data


Aniseed is a database designed to offer a representation of ascidian embryonic development at the level of the genome (cis-regulatory sequences, spatial gene expression, protein annotation), of the cell (cell shapes, fate, lineage) or of the whole em ...
40%

The DNA Replication Origin Database


This database summarizes our knowledge of replication origins in the budding yeast Saccharomyces cerevisiae. Each proposed origin site has been assigned a Status (Confirmed, Likely, or Dubious) expressing the confidence that the site genuinely corres ...
40%

Plant DNA C-values database


A database containing genome size (C-value) data for all groups of land plants and red, green and brown algae.
40%

Genomic Contextual Data Markup Language


The Genomic Contextual Data Markup Language (GCDML) is a core project of the Genomic Standards Consortium (GSC) that is a reference implementation the Minimum Information about a Genome Sequence (MIGS/MIMS/MIMARKS), and the extensions the Minimum Inf ...
40%

Banana Genome Hub


The Banana Genome Hub centralises databases of genetic and genomic data for the Musa acuminata crop, and is the official portal for the Musa genome resources.
39%

MicrosporidiaDB


MicrosporidiaDB is one of the databases that can be accessed through the EuPathDB (http://EuPathDB.org; formerly ApiDB) portal, covering eukaryotic pathogens of the genera Cryptosporidium, Giardia, Leishmania, Neospora, Plasmodium, Toxoplasma, Tricho ...
39%

Comprehensive Yeast Genome Database


A major part of this information gets extracted by manual annotation from the yeast literature, and results of the systematic functional analysis projects as well as cross-references to other in-house or external databases (NCBI, PIR, PEDANT, EMBL) p ...
39%

National Microbial Pathogen Data Resource


The NMPDR provided curated annotations in an environment for comparative analysis of genomes and biological subsystems, with an emphasis on the food-borne pathogens Campylobacter, Listeria, Staphylococcus, Streptococcus, and Vibrio; as well as the ST ...
39%

Tomato Functional Genomics Database


The Tomato Functional Genomics Database integrates several prior databases including the Tomato Expression Database and Tomato Metabolite Database, and the Tomato Small RNA Database.
39%

Distributed Sequence Annotation System


The Distributed Annotation System (DAS) defines a communication protocol used to exchange annotations on genomic or protein sequences.
39%

BacMap


BacMap is a picture atlas of annotated bacterial genomes. It is an interactive visual database containing hundreds of fully labeled, zoomable, and searchable maps of bacterial genomes.
39%

Aphid Genomics Database


The Aphid Genome Database's aim is to improve the current pea aphid genome assembly and annotation, and to provide new aphid genome sequences as well as tools for analysis of these genomes.
39%

Genome3D


Genome3D is a resource that provides structural annotation and 3D models of genomes of model organisms such as human, yeast and E.coli. The database can be used to predict protein structures that have not yet been identified. Genome3D uses structural ...
39%

Networks of Functional Coupling of proteins


FunCoup is a framework to infer genome-wide functional couplings in 11 model organisms. Functional coupling, or functional association, is an unspecific form of association that encompasses direct physical interaction but also more general types of d ...
39%

Bovine Genome Database


The Bovine Genome Database project is developed to support the efforts of bovine genomics researchers by providing data mining, genome navigation and annotation tools for the bovine reference genome based on the hereford cow, L1 Dominette 01449.
38%

GreenPhylDB: A phylogenomic database for plant comparative genomics


GreenPhylDB comprises 37 full genomes from the major phylum of plant evolution. Clustering of these genomes was performed to define a consistent and extensive set of homeomorphic plant families.
38%

Dog Genome SNP Database


Dog Genome SNP Database (DoGSD) is a data container for the variation information of dog/wolf genomes. It was designed and constructed as an SNPs detector and visualization tool to provide the research community a useful resource for the study of dog ...
38%

GenoList Genome Browser


GenoList is an integrated environment for comparative exploration of microbial genomes. The current release integrates genome data for over 700 species (Genome Reviews). The query and navigation user interface includes specialized tools for subtracti ...
37%

MouseMine @ MGI


A database of integrated mouse data from MGI, powered by InterMine. MouseMine is member of InterMOD, a consortium of model organism databases dedicated to making cross-species data analysis easier through ongoing coordination and collaborative system ...
37%

Evolutionary Annotation Database


Evola contains ortholog information of all human genes among vertebrates. Orthologs are a pair of genes in different species that evolved from a common ancestral gene by speciation. In Evola, orthologs were detected by comparative genomics and amino ...
37%

Chlamydiae Database


ChlamDB is a comparative genomics database covering the entire Chlamydiae phylum as well as their closest relatives belonging to the Planctomycetes-Verrucomicrobiae-Chlamydiae (PVC) superphylum. Genomes can be compared, analyzed and retrieved using a ...
37%

MapViewer


The Map Viewer is a tool of Entrez Genomes that provides special browsing capabilities for eukaryotic chromosomes. It allows the user to view and search an organisms complete genome, display chromosome maps, and zoom into progressively greater levels ...
37%

Daphnia Water Flea Genome Database


wFleaBase includes data from all species of the genus, yet the primary species are Daphnia pulex and Daphnia magna, because of the broad set of genomic tools that have already been developed for these animals.
36%

Shanghai Rapeseed Database


Shanghai RAPESEED Database: a resource for functional genomics studies of seed development and fatty acid metabolism of Brassica.
36%

StellaBase


StellaBase is the Nematostella vectensis genomics database.
36%

MycoBrowser tuberculosis


Mycobrowser is a resource that provides both in silico generated and manually reviewed information within databases dedicated to the complete genomes of Mycobacterium tuberculosis, Mycobacterium leprae, Mycobacterium marinum and Mycobacterium smegmat ...
36%

MycoBrowser leprae


Mycobrowser is a resource that provides both in silico generated and manually reviewed information within databases dedicated to the complete genomes of Mycobacterium tuberculosis, Mycobacterium leprae, Mycobacterium marinum and Mycobacterium smegmat ...
36%

MycoBrowser smegmatis


Mycobrowser is a resource that provides both in silico generated and manually reviewed information within databases dedicated to the complete genomes of Mycobacterium tuberculosis, Mycobacterium leprae, Mycobacterium marinum and Mycobacterium smegmat ...
36%

MycoBrowser marinum


Mycobrowser is a resource that provides both in silico generated and manually reviewed information within databases dedicated to the complete genomes of Mycobacterium tuberculosis, Mycobacterium leprae, Mycobacterium marinum and Mycobacterium smegmat ...
36%

Oryzabase


The Oryzabase is a comprehensive rice science database established in 2000 by rice researcher's committee in Japan. The Oryzabase consists of five parts, (1) genetic resource stock information, (2) gene dictionary, (3) chromosome maps, (4) mutant ima ...
36%

The Human OligoGenome Resource: A Database for Customized Targeted Resequencing Covering the Human Genome


Oligonucleotides for targeted resequencing of the human genome
36%

Target Pathogen


The Target-Pathogen database is a bioinformatic approach to prioritize drug targets in pathogens. Available genomic data for pathogens has created new opportunities for drug discovery and development, including new species, resistant and multiresista ...
36%

GENI-ACT


GENI-ACT is a resource that allows the research community to collaboratively annotate bacterial genomes. Changes can be suggested to existing genomes and these alterations can be ported back to NCBI Genbank. GENI-ACT also has modules which can be use ...
36%

WheatGenome.info


An integrated database and portal for wheat genome information.
36%

Integrated Genomic Database of Non-Small Cell Lung Cancer


Integrated Genomic Database of Non-Small Cell Lung Cancer.
36%

ORTHOlogous MAmmalian Markers


ORTHOlogous MAmmalian Markers database (OrthoMaM) describes the evolutionary dynamics of orthologous genes in mammalian genomes using a phylogenetic framework.
36%

Genome Biology Ontology Language


To enable interoperability of genome annotations, we have developed the Genome Biology Ontology Language (GBOL) and associated stack (GBOL stack). GBOL is provenance centered and provides a consistent representation of genome derived automated predic ...
36%

SpodoBase


SpodoBase is an integrated database for the genomics of the Lepidoptera Spodoptera frugiperda. It is a publicly available structured database with insect pest sequences which will allow identification of a number of genes and comprehensive cloning of ...
36%

Comparative Fungal Genomics Platform


The CFGP (Comparative Fungal Genomics Platform) was designed for comparative genomics projects with diverse fungal genomes.
35%

Eukaryotic Genes


euGenes provides a common summary of gene and genomic information from eukaryotic organism databases including gene symbol and full name, chromosome, genetic and molecular map information, Gene Ontology (Function/Location/Process) and gene homology, ...
35%

Soybean Knowledge Base (SoyKB)


Soybean Knowledge Base (SoyKB), is a comprehensive all-inclusive web resource developed for soybean translational genomics and molecular breeding. SoyKB stores information about genes/proteins, miRNAs/sRNAs, metabolites, SNPs, plant introduction (PI ...
35%

Influenza Virus Database


IVDB hosts complete genome sequences of influenza A virus generated by BGI and curates all other published influenza virus sequences after expert annotations. IVDB provides a series of tools and viewers for analyzing the viral genomes, genes, genetic ...
35%

Drosophila Species Genomes


The D. melanogaster and eight other eukaryote model genomes, and gene predictions from several groups. Summaries of essential genome statistics include sizes, genes found and predicted, homology among genomes, phylogenetic trees of species, and compa ...
34%

Catalogue of Transmission Genetics in Arabs


The Centre for Arab Genomic Studies (CAGS) initiated the ambitious project to establish the CTGA (Catalogue of Transmission Genetics in Arabs) database for genetic disorders in Arabs with the aim to enlighten the scientific community and the public o ...
34%

YanHuang - YH1 Genome Database


The YH database presents the entire DNA sequence of a Han Chinese individual, as a representative of Asian population. This genome, named as YH, is the start of YanHuang Project, which aims to sequence 100 Chinese individuals in 3 years.assembled bas ...
34%

TropGENE DB


TropGENE DB is a database that manages genetic and genomic information about tropical crops studied by Cirad. The database is organised into crop specific modules.
34%

Annmap


Annmap is a genome browser that includes mappings between genomic features and Affymetrix microarrays.
34%

Hardwood Genomics Project


The Hardwood Genomics Project is a databases for expressed genes, genetic markers, genetic linkage maps, and reference populations. It provides lasting genomic and biological resources for the discovery and conservation of genes in hardwood trees for ...
34%

openSNP


A crowdsourced collection of personal genomics data. Includes SNP genotyping, exome sequencing data, phenotypic annotation and quantified self tracking data.
34%

Predictive Networks


Integration, navigation, visualization, and analysis of gene interaction networks. This record has been marked as "Uncertain" because its homepage no longer exists. If you have any information on the current status of this resource, please contact us ...
34%

proteomics BAM


The proteomics BAM (proBAM) file format is designed for storing and analyzing peptide spectrum matches (PSMs) within the context of the genome. proBAM is built upon the SAM format and its compressed binary version, BAM, with necessary modifications t ...
34%

Chicken Variation Database


The chicken Variation Database (ChickVD) is an integrated information system for storage, retrieval, visualization and analysis of chicken variation data.
34%

Insect Microsatellite Database


InSatDb, unlike many other microsatellite databases that cater largely to the needs of microsatellites as markers, presents an interactive interface to query information regarding microsatellite characteristics of five fully sequenced insect genomes ...
33%

Genetic Epidemiology Simulation Database


GESDB is a platform for sharing simulation data and discussion of simulation techniques for human genetic studies. The database contains simulation scripts, simulated data, and documentations from published manuscripts. The forum provides a platform ...
33%

Drosophila polymorphism database


Drosophila Polymorphism Database, is a secondary database designed to provide a collection of all the existing polymorphic sequences in the Drosophila genus. It allows, for the first time, the search for any polymorphic set according to different par ...
33%

Ontology for Genetic Disease Investigations


This ontology is used to model scientific investigation, especially Genome-Wide Association Studies (GWAS), to discover genetic susceptibility factors to disease, such as Diabetes. It models the genetic variants, polymorphisms, statistical measuremen ...
33%

KnowPulse


KnowPulse is a breeder-focused web portal that integrates genetics and genomics of pulse crops with model genomes.
32%

Structural and functional annotation of Arabidopsis thaliana gene and protein families


GeneFarm is a database whose purpose is to store traceable annotations for Arabidopsis nuclear genes and gene products.
32%

Aspergillus Genomes


Aspergillus Genomes is a resource for viewing annotated genes arising from various Aspergillus sequencing and annotation projects.
32%

Transmembrane Helices in Genome Sequences


A web based database of Transmembrane Helices in Genome Sequences.
31%

Pig Genomic Informatics System


The Pig Genomic Informatics System (PigGIS) presents accurate pig gene annotations in all sequenced genomic regions. It integrates various available pig sequence data, including 3.84 million whole-genome-shortgun (WGS) reads and 0.7 million Expressed ...
31%

LiceBase


LiceBase is a database for sea lice genomics. LiceBase provides the genome annotation of the Atlantic salmon louse Lepeophtheirus salmonis, a genome browser, Blast functionality and access to related high-thoughput genomics data.
31%

Chickpea Portal


This resource contains genome and gene sequences, features and isolationed chromosome alignments, while functional annotation can be searched in GBrowse. Chickpea forms a critical component of the Australian and Indian farming system, offering offer ...
31%

Single Nucleotide Polymorphism Ontology


The SNP Ontology is a domain ontology that provides a formal representation (OWL-DL) of genomic variations. Despite its name, SNP-Ontology, is not limited to the representation of SNPs but it encompasses genomic variations in a broader meaning. The S ...
31%

WheatMine


WheatMine integrates many types of data for Triticum aestivum including gene model, markers, and scaffolds. You can run flexible queries, export results and analyse lists of data.
31%

Life Science Database Archive


If a database is inadequate in terms of its description, unclear with respect to the terms of use, or is not downloadable, it may not be fully used, cited or rightly acknowledged by the (research) communities. This is even true for databases with hig ...
31%

RIKEN Arabidopsis Genome Encyclopedia II


RARGE II provides basic information about the Arabidopsis genome, such as information related to cDNA sequences and transposon insertion mutants. To create it, publicly available information for a total of 66,209 Arabidopsis mutant lines was used, in ...
31%

Cnidarian Evolutionary Genomics Database


CnidBase, the Cnidarian Evolutionary Genomics Database, is a tool for investigating the evolutionary, developmental and ecological factors that affect gene expression and gene function in cnidarians.
30%

Animal Genome Size Database


A comprehensive catalogue of animal genome size data where haploid DNA contents (C-values, in picograms) are currently available for 4972 species (3231 vertebrates and 1741 non-vertebrates) based on 6518 records from 669 published sources.
30%

Genome Variation Format


The Genome Variation Format (GVF) is a very simple file format for describing sequence alteration features at nucleotide resolution relative to a reference genome.
29%

bigWig Track Format


The bigWig format is for display of dense, continuous data that will be displayed in the Genome Browser as a graph. The bigWig files are in an indexed binary format. The main advantage of this format is that only those portions of the file needed to ...
29%

The Track Hub Registry


The Track Hub Registry is a global centralised collection of publicly accessible track hubs. Its goal is to allow third parties to advertise track hubs, and to make it easier for researchers around the world to discover and use track hubs containing ...
29%

Wiggle Track Format


The wiggle (WIG) format is an older format for display of dense, continuous data such as GC percent, probability scores, and transcriptome data. The bigWig format is the recommended format for almost all graphing track needs. For speed and efficiency ...
29%

The MOuse NOnCode Lung database


MONOCLdb is an integrative and interactive database designed to retrieve and visualize annotations and expression profiles of long-non coding RNAs (lncRNAs) expressed in Collaborative Cross (http://compgen.unc.edu/) founder mice in response to respir ...
29%

ENA Sequence Flat File Format


ENA Sequence Flat File Format is a standardised plain text format for nucleotide sequences. This format was previously called the EMBL Sequence Flat File Format.
29%

ENA Sequence XML Schema


ENA Sequence XML Schema is a standardised XML schema for nucleotide sequences. All assembled and annotated sequences must conform to this schema.
29%

Non-Redundant B.subtilis database


This server allows to access the complete genome of Bacillus subtilis. Additional data on gene mapping and codon usage have been added, as well as cross-references with the SWISS-PROT, ENZYME and HOBACGEN databases.
29%

Ontology of Genes and Genomes


OGG is a biological ontology in the area of genes and genomes. OGG uses the Basic Formal Ontology (BFO) as its upper level ontology. This OGG document contains the genes and genomes of a list of selected organisms. Each gene in OGG has over 10 annota ...
28%

Ontology of Genes and Genomes - Mouse


OGG-Mm is the OGG Mus musculus (mouse) subset. The OGG (Ontology of Genes and Genomes) is a formal ontology of genes and genomes of biological organisms. OGG is developed by following OBO Foundry principles and aligning with the BFO top-level ontolog ...
28%

Berkeley Drosophila Genome Project EST database


The goals of the Drosophila Genome Center are to finish the sequence of the euchromatic genome of Drosophila melanogaster to high quality and to generate and maintain biological annotations of this sequence.
28%

CanGEM


Gene copy number changes in cancer
28%

Genome Annotation File version 1


Annotation data is submitted to the GO Consortium in the form of gene association files, or GAFs. This standard lays out the format specification for GAF 1.0
26%

Genome Annotation File version 2


Annotation is the process of assigning GO terms to gene products. The annotation data in the GO database is contributed by members of the GO Consortium, and the Consortium actively encourages new groups to start contributing annotation. Annotation da ...
26%

Organelle Genome Resource


The organelle genomes are part of the NCBI Reference Sequence (RefSeq) project that provides curated sequence data and related information for the community to use as a standard.
26%

Animal Genome Tracks on GBrowse


Genome track alignments using GBrowse on this site are featured with: (1) Annotated and predicted genes and transcripts; (2) QTL / SNP Association tracks; (3) OMIA genes; (4) Various SNP Chip tracks; (5) Other mapping fetures or elements that are ava ...
26%

Physical mapping data at Canada's Michael Smith Genome Sciences Centre - Data


FPC Mapping data files from species that have been fingerprinted at Canada's Michael Smith Genome Sciences Centre (BCGSC).
26%

Follicular Lymphoma Genome Data at Canada's Michael Smith Genome Sciences Centre (BCGSC)


Mapping, copy number analysis, sequence and gene expression data generated by the High Resolution Analysis of Follicular Lymphoma Genomes project. The data will be available for 24 patients with follicular lymphoma. All data will be made as widely an ...
26%

Xanthobase


Xanthobase provides information on Xanthomonas oryzae pv oryzae (Xoo), the rice (Oryza sativa) pathogenic bacterium in which genome sequencing has revealed very extensive race differentiation. The whole genome sequence of its native host has also bee ...
26%

NAGRP Blast Center


NAGRP Blast Center aggregates various sequence databases and makes them accessible via its website.
26%

net alignment annotation Format


The net file format is used to describe the axtNet data that underlie the net alignment annotations in the Genome Browser.
26%

Gene Prediction File Format


Gene Prediction File Format (genePred) is a table format commonly used for gene prediction tracks in the Genome Browser. Variations of genePred include standard format, extended format and a format which includes RefSeq genes with gene names.
26%

YeastCyc


YeastCyc is a Pathway/Genome Database of the model eukaryote Saccharomyces cerevisiae S288c. In addition to genomic information, the database contains metabolic pathway, reaction, enzyme, and compound information, which has been manually curated from ...
26%

ENCODE peak information Format


The ENCODE peak information Format is used to provide called regions of signal enrichment based on pooled, normalized (interpreted) data.
26%

Nematode Expression Pattern DataBase


The Kohara lab has been constructing an expression pattern map of the 100Mb genome of the nematode Caenorhabditis elegans through EST analysis and systematic whole mount in situ hybridization. NEXTDB is the database to integrate all information from ...
26%

DragonDB


The NEW Antirrhinum majus (Snapdragon) genetic and genomic database
26%

DSPR


The Drosophila Synthetic Population Resource (DSPR) consists of a new panel of over 1700 recombinant inbred lines (RILs) of Drosophila melanogaster, derived from two highly recombined synthetic populations, each created by intercrossing a different s ...
26%

Minimum Information about a Stem Cell Experiment


MISCE recommends the standard information required to report a stem cell experiment, covering: study and experiment design, organism characterization, specimen isolation, cell isolation, cellular reprogramming, gene editing, cellular differentiation, ...
26%

.ACE format


The ACE file format is a specification for storing data about genomic contigs. The original ACE format was developed for use with Consed, a program for viewing, editing, and finishing DNA sequence assemblies. ACE files are generated by various assemb ...
26%

Parkinson Disease Mutation Database


The Parkinson disease Mutation Database (PDmutDB) aims at collecting all known mutations in the genes related to Parkinson disease (PD). Mutations are collected from the literature and from presentations at scientific meetings. In addition, mutations ...
26%

C. Elegans Gene Expression


Using serial analysis of gene expression (SAGE) and microarrays, we are examining total mRNA populations in all developmental stages, both in whole worms and in specific cells and tissues. In addition, we are building promoter::GFP constructs to moni ...
26%

National Omics Data Encyclopedia


The National Omics Data Encyclopedia (NODE) is big data library with complete and integrative data storage, safe and efficiency-guaranteed data management as well as comprehensive and user-friendly data service functions. NODE stores raw sequence dat ...
26%

INTREPID bioinformatics


Intrepid Bioinformatics serves as a community for genetic researchers and scientific programmers who need to achieve meaningful use of their genetic research data – but can’t spend tremendous amounts of time or money in the process. The Intrepid Bioi ...
26%

Gene Product Information Format


The need for a way to represent genes/gene products separately from annotations, as well as the need to use the evidence ontology has lead to the creation of the GPAD (Gene Product Annotation Data) and GPI (Gene Product Information) formats, defined ...
26%

Bigelow National Center for Algae and Microbiota


The NCMA maintains the largest and most diverse collection of publically available marine algal strains in the world. The algal strains in the collection have been obtained from all over the world, from polar to tropical waters, marine, freshwater, b ...
26%

Alzheimer Disease & Frontotemporal Dementia Mutation Database


The Alzheimer Disease & Frontotemporal Dementia Mutation Database (AD&FTDMDB) aims at collecting all known mutations in the genes related to Alzheimer disease (AD) and fromtotemporal dementias (FTD). Mutations are collected from the literature and fr ...
26%

Pathogen Portal


Pathogen Portal is a repository linking to the Bioinformatics Resource Centers (BRCs) sponsored by the National Institute of Allergy and Infectious Diseases (NIAID) and maintained by The Virginia Bioinformatics Institute. The BRCs are providing web-b ...
26%

Australian Drosophila Ecology and Evolution Resource


The Australian Drosophila Ecology and Evolution Resource (ADEER) from the Hoffmann lab and other contributors is a nationally significant life science collection. The Drosophila Clinal Data Collection contains data on populations along the eastern c ...
26%

*ReputationScore indicates how established a given datasource is. Find out more.




Need help integrating and/or managing biomedical data?