Tag: dna sequences


Found 93 sources
Source Match ReputationScore*

UCSC Genome Browser database


Genome assemblies and aligned annotations for a wide range of vertebrates and model organisms, along with an integrated tool set for visualizing, comparing, analyzing and sharing both publicly available and user-generated genomic datasets.
100%

GenBank


GenBank is the NIH genetic sequence database, an annotated collection of all publicly available DNA sequences. The complete release notes for the current version of GenBank are available on the NCBI ftp site. A new release is made every two months. G ...
80%

Sequence Read Archive


The Sequence Read Archive (SRA) stores raw sequencing data from the next generation of sequencing platforms Data submitted to SRA. It is organized using a metadata model consisting of six objects: study, sample, experiment, run, analysis and submissi ...
76%

ARLEQUIN Project Format


Arlequin ver 3.0 is a software package integrating several basic and advanced methods for population genetics data analysis, like the computation of standard genetic diversity indices, the estimation of allele and haplotype frequencies, tests of depa ...
67%

The European Genome-phenome Archive


The European Genome-phenome Archive (EGA) allows you to explore datasets from genomic studies, provided by a range of data providers. Access to datasets must be approved by the specified Data Access Committee (DAC).
66%

The UCSC Archaeal Genome Browser


The UCSC Archaeal Genome Browser is a window on the biology of more than 100 microbial species from the domain Archaea. Basic gene annotation is derived from NCBI Genbank/RefSeq entries, with overlays of sequence conservation across multiple species, ...
66%

European Variation Archive


The European Variation Archive is an open-access archive that accepts submission of, and provides access to, all types of genetic variation data from all species. All users are able to download any dataset, or query our study catalogue via our variat ...
63%

modMine


modMine is an integrated web resource of data and tools to browse and search modENCODE data and experimental details, download results and access the GBrowse genome browser.
63%

MEROPS


The MEROPS database is an information resource for peptidases (also termed proteases, proteinases and proteolytic enzymes) and the proteins that inhibit them.
60%

Ensembl Genomes


The Ensembl genome annotation system, developed jointly by EMBL-EBI and the Wellcome Trust Sanger Institute, has been used for the annotation, analysis and display of vertebrate genomes since 2000. Since 2009, the Ensembl site has been complemented b ...
59%

The Arabidopsis Information Resource


The Arabidopsis Information Resource (TAIR) maintains a database of genetic and molecular biology data for the model higher plant Arabidopsis thaliana.
58%

European Nucleotide Archive


The European Nucleotide Archive (ENA) is a globally comprehensive data resource for nucleotide sequence, spanning raw data, alignments and assemblies, functional and taxonomic annotation and rich contextual data relating to sequenced samples and expe ...
58%

Minimum Information about a (Meta)Genome Sequence


MIGS/MIMS (Minimum Information About a (Meta)Genome Sequence) outlines a conceptual structure for extending the core information that has been traditionally captured by the INSDC (DDBJ/EMBL/Genbank) to describe genomic and metagenomic sequences. The ...
56%

Gramene: A curated, open-source, integrated data resource for comparative functional genomics in plants


Gramene's purpose is to provide added value to plant genomics data sets available within the public sector, which will facilitate researchers' ability to understand the plant genomes and take advantage of genomic sequence known in one species for ide ...
56%

VectorBase


VectorBase is a web-accessible data repository for information about invertebrate vectors of human pathogens. VectorBase annotates and maintains vector genomes providing an integrated resource for the research community. Currently, VectorBase contain ...
56%

Integrated Microbial Genomes And Microbiomes


The Integrated Microbial Genomes (IMG/M) aims to support the annotation, analysis and distribution of microbial genome and microbiome datasets sequenced at DOE's Joint Genome Institute (JGI). It also serves as a community resource for analysis and an ...
55%

MGnify


EBI Metagenomics has changed its name to MGnify to reflect a change in scope. This is a free-to-use resource aiming at supporting all metagenomics researchers. The service is an automated pipeline for the analysis and archiving of metagenomic data th ...
52%

PCR Primer Database for Gene Expression Detection and Quantification


PrimerBank is a public resource for PCR primers. These primers are designed for gene expression detection or quantification (real-time PCR). PrimerBank contains over 306,800 primers covering most known human and mouse genes. There are several ways to ...
52%

PomBase


PomBase is a model organism database that provides organization of and access to scientific data for the fission yeast Schizosaccharomyces pombe. PomBase supports genomic sequence and features, genome-wide datasets and manual literature curation as w ...
52%

Synthetic Biology Open Language


The Synthetic Biology Open Language (SBOL) is a standard used for the in silico representation of genetic designs. SBOL is designed to allow synthetic biologists and genetic engineers to electronically exchange designs, send and receive genetic desig ...
51%

Comprehensive Antibiotic Resistance Database


A bioinformatic database of antimicrobial resistance genes, their products and associated phenotypes.
51%

TDR Targets


Identification and ranking of targets and bioactive compounds against neglected tropical diseases. TDR Targets integrates chemical and genomic information and allows users to prioritize targets and compounds to develop and repurpose new drugs and che ...
50%

Virulence Factor Database


VFDB is an integrated and comprehensive database of virulence factors for bacterial pathogens (also including Chlamydia and Mycoplasma).
49%

Minimum Information about a MARKer gene Sequence


MIMARKS is the metadata reporting standard of the Genomic Standards Consortium that covers marker gene sequences from environmental surveys or individual organisms
49%

Eukaryotic Promoter Database


The Eukaryotic Promoter Database (EPD) provides accurate transcription start site (TSS) information for promoters of 15 model organisms, from human to yeast to the malaria parasite Plasmodium falciparum. While the original database was a manually cur ...
48%

Progenetix - genomic copy number aberrations in cancer


The Progenetix database provides an overview of copy number abnormalities in human cancer from Comparative Genomic Hybridization (CGH) experiments. With 30817 cases from 1016 publications (Oct 2013), Progenetix is the largest curated database for who ...
47%

DNA Data Bank of Japan


Annotated collection of all publicly available nucleotide and protein sequences. DDBJ collects sequence data mainly from Japanese researchers, as well as researchers in any other countries. DDBJ is part of the International Nucleotide Sequence Databa ...
47%

Genome Database for Rosaceae


The Genome Database for Rosaceae (GDR) is a curated and integrated web-based relational database providing centralized access to Rosaceae genomics and genetics data and analysis tools to facilitate cross-species utilization of data.
46%

Database of Sequence Tagged Sites


dbSTS is an NCBI resource that contains sequence data for short genomic landmark sequences or Sequence Tagged Sites.
46%

Ensembl Protists


Ensembl Protists holds over 240 genomes of interest covering those involved in disease and of scientific interest. This includes genomes such as Plasmodium falciparum, Dictyostelium discoideum, Phytophthora infestans and Leishmania major. A majority ...
46%

Ensembl Metazoa


Ensembl Metazoa provides access to genomes of metazoans of interest in disease, environmental sciences, agriculture and economic concern. Extensive coverage exists of diptera, nematodes, lepidoptera and hymenoptera.
46%

Minimum Information about any (x) Sequence


The minimum information about any (x) sequence (MIxS) is an overarching framework of sequence metadata, that includes technology-specific checklists from the previous MIGS and MIMS standards, provides a way of introducing additional checklists such a ...
46%

Influenza Virus Resource


Influenza Virus Resource presents data obtained from the NIAID Influenza Genome Sequencing Project as well as from GenBank, combined with tools for flu sequence analysis, annotation and submission to GenBank. In addition, it provides links to other r ...
45%

Candida Genome Database


The Candida Genome Database (CGD) provides access to genomic sequence data and manually curated functional information about genes and proteins of the human pathogen Candida albicans. It collects gene names and aliases, and assigns gene ontology term ...
45%

Genetic and Genomic Information System


GnpIS is a multispecies integrative information system dedicated to plant and fungi pests. It bridges genetic and genomic data, allowing researchers access to both genetic information (e.g. genetic maps, quantitative trait loci, association genetics, ...
45%

Structure Function Linkage Database Archive


Structure Function Linkage Database (SFLD) is a database of enzymes classified by linking sequences to chemical function. A hierachical systems is used to classify enzymes by family or superfamily other category levels include functional domain, subg ...
44%

Rice Genome Annotation Project


This website provides genome sequence from the Nipponbare subspecies of rice and annotation of the 12 rice chromosomes. These data are available through search pages and the Genome Browser that provides an integrated display of annotation data.
44%

Regulatory Element Database for Drosophila


REDfly is a curated collection of known Drosophila transcriptional cis-regulatory modules (CRMs) and transcription factor binding sites (TFBSs). REDfly seeks to include all experimentally verified fly regulatory elements along with their DNA sequence ...
43%

Ensembl Bacteria


Ensembl Bacteria is a browser for bacterial and archaeal genomes. These are taken from the databases of the International Nucleotide Sequence Database Collaboration(the European Nucleotide Archive at the EBI, GenBank at the NCBI, and the DNA Database ...
43%

Ensembl Fungi


Ensembl Fungi is a browser for fungal genomes. A majority of these are taken from the databases of the International Nucleotide Sequence Database Collaboration (the European Nucleotide Archive at the EBI, GenBank at the NCBI, and the DNA Database of ...
43%

Ensembl Plants


Ensembl Plants holds the genomes of plants of significant interest. These range from those of agricultural importance, those which support primary research and of environmental interest. Ensembl Plants datasets are constructed in a direct collaborati ...
43%

Dfam


The Dfam database is a open collection of DNA Transposable Element sequence alignments, hidden Markov Models (HMMs), consensus sequences, and genome annotations. Dfam represents a collection of multiple sequence alignments, each containing a set of r ...
42%

Genetic Codes


NCBI takes great care to ensure that the translation for each coding sequence (CDS) present in GenBank records is correct. Central to this effort is careful checking on the taxonomy of each record and assignment of the correct genetic code for each o ...
42%

BeetleBase


BeetleBase is a community resource for Tribolium genetics, genomics and developmental biology. The database is built on the Chado generic data model, and is able to store various types of data, ranging from genome sequences to mutant phenotypes.
42%

Fungal and Oomycete genomics resource


FungiDB is an integrated genomic and functional genomic database for the kingdom Fungi. The database integrates whole genome sequence and annotation and also includes experimental and environmental isolate sequence data. The database includes compara ...
42%

The Chromosome 7 Annotation Project


The objective of this project is to generate the most comprehensive description of human chromosome 7 to facilitate biological discovery, disease gene research and medical genetic applications.
41%

CoryneRegNet 6.0 - Corynebacterial Regulation Network


Corynebacterial Regulation Network a reference database and analysis platform for corynebacterial transcription factors and gene regulatory networks.
40%

Human Gene and Protein Database


Human Gene and Protein Database (HGPD) presents SDS-PAGE patterns and other informations of human genes and proteins.
40%

Minimal Information About a Phylogenetic Analysis


The MIAPA (minimum information about a phylogenetic analysis) checklist details the list of metadata necessary for researchers to evaluate or reuse a published phylogeny.
39%

Prokaryotic Operon DataBase


The Prokaryotic Operon DataBase (ProOpDB) constitutes one of the most precise and complete repository of operon predictions in our days. Using our novel and highly accurate operon algorithm, we have predicted the operon structures of more than 1,200 ...
39%

BacMap


BacMap is a picture atlas of annotated bacterial genomes. It is an interactive visual database containing hundreds of fully labeled, zoomable, and searchable maps of bacterial genomes.
39%

ProPortal


ProPortal is a database containing genomic, metagenomic, transcriptomic and field data for the marine cyanobacterium Prochlorococcus. They provide a source of cross-referenced data across multiple scales of biological organization—from the genome to ...
39%

The Yeast Metabolome DataBase


The Yeast Metabolome Database (YMDB) is a manually curated database of small molecule metabolites found in or produced by Saccharomyces cerevisiae (also known as Baker’s yeast and Brewer’s yeast). This database covers metabolites described in textboo ...
39%

Minimal Information about a high throughput SEQuencing Experiment


MINSEQE describes the Minimum Information about a high-throughput nucleotide SEQuencing Experiment that is needed to enable the unambiguous interpretation and facilitate reproduction of the results of the experiment. By analogy to the MIAME guideline ...
38%

Generic Feature Format Version 3


The Generic Feature Format Version 3 (GFF3) format was developed after earlier formats, although widely used, became fragmented into multiple incompatible dialects. The GFF3 format addresses the most common extensions to GFF, while preserving backwar ...
38%

Minimal Metagenome Sequence Analysis Standard


A proposed set of minimal standard analyses necessary for proper interpretation of meta-omic data and to allow comparative metagenomics and metatranscriptomics. Please note: We cannot find an up-to-date website for this resource. As such, we have mar ...
38%

Central Aspergillus Data REpository


This project aims to support the international Aspergillus research community by gathering all genomic information regarding this significant genus into one resource - The Central Aspergillus REsource (CADRE). CADRE facilitates visualisation and anal ...
38%

LegumeIP


The LegumeIP 2.0 database hosts large-scale genomics and transcriptomics data and provides integrative bioinformatics tools for the study of gene function and evolution in legumes.
38%

euL1db, the European database of L1-HS retrotransposon insertions in humans


Retrotransposons, which comprises LINE, SINE and LTR-containing elements, accounts for almost half of our genome (Fig. 1). They are mobile genetics elements - also known as jumping genes - but only the L1-HS subfamily has retained the ability to jump ...
37%

CRAM


CRAM is a sequencing read file format that is highly space efficient by using reference-based compression of sequence data and offers both lossless and lossy modes of compression. Building on early proof-of-principle for reference-based compression ( ...
37%

Plant Natural Antisense Transcripts Database


Natural Antisense Transcripts (NATs), a kind of regulatory RNAs, occur prevalently in plant genomes and play significant roles in physiological and/or pathological processes. PlantNATsDB (Plant Natural Antisense Transcripts DataBase) is a platform fo ...
37%

MapViewer


The Map Viewer is a tool of Entrez Genomes that provides special browsing capabilities for eukaryotic chromosomes. It allows the user to view and search an organisms complete genome, display chromosome maps, and zoom into progressively greater levels ...
37%

StellaBase


StellaBase is the Nematostella vectensis genomics database.
36%

GENI-ACT


GENI-ACT is a resource that allows the research community to collaboratively annotate bacterial genomes. Changes can be suggested to existing genomes and these alterations can be ported back to NCBI Genbank. GENI-ACT also has modules which can be use ...
36%

VIRsiRNAdb


VIRsiRNAdb contains information on experimentally validated Viral siRNA/shRNA which target viral genome regions. It provides efficacy information where available, as well as the siRNA sequence, viral target and subtype, as well as the target genomic ...
36%

Human Disease-Related Viral Integration Sites


Dr.VIS collects and locates human disease-related viral integration sites. So far, about 600 sites covering 5 virus organisms and 11 human diseases are available. Integration sites in Dr.VIS are located against chromesome, cytoband, gene and refseq p ...
35%

Short Read Archive eXtensible Markup Language


The SRA data model contains the following objects: Study: information about the sequencing project Sample: information about the sequenced samples Experiment: information about the libraries, platform; associated with study, sample(s) and run(s) Run: ...
35%

Type IV Secretion system Resource


A web-based bacterial type IV secretion system resource for type IV secretion systems (T4SSs) and cognate effectors in bacteria.
35%

MAR databases


The MAR databases is a collection of manually curated marine microbial contextual and sequence databases, based at the Marine Metagenomics Portal. This was developed as a part of the ELIXIR EXCELERATE project in 2017 and is maintained by The Center f ...
35%

Alternative Poly(A) Sites database


APASdb can visualize the precise map and usage quantification of different APA isoforms for all genes. The datasets are deeply profiled by the sequencing alternative polyadenylation sites (SAPAS) method capable of high-throughput sequencing 3'-ends o ...
35%

Prokaryotic Glycoproteins Database


ProGlycProt (Prokaryotic Glycoproteins) is a manually curated, comprehensive repository of experimentally characterized eubacterial and archaeal glycoproteins, generated from an exhaustive literature search. This is the focused beginning of an effort ...
35%

Interrupted coding sequences


ICDS database is a database containing ICDS detected by a similarity-based approach. The definition of each interrupted gene is provided as well as the ICDS genomic localisation with the surrounding sequence.
35%

Oryza Tag Line


Oryza Tag Line consists in a searchable database developed under the Oracle management system integrating phenotypic data resulting from the evaluation of the Genoplante rice insertion line library.
34%

XenMine


XenMine has been created to view, search and analyze Xenopus data, and provides essential information on gene expression changes and regulatory elements present in the genome. It contains published genomic datasets from both Xenopus tropicalis and Xe ...
34%

Enzyme Structure Function Ontology


The ESFO provides a new paradigm for organizing enzyme sequence, structure, and function information, whereby specific elements of enzyme sequence and structure are mapped to specific conserved aspects of function, thus facilitating the functional an ...
33%

OryGenesDB: an interactive tool for rice reverse genetics


The aim of this Oryza sativa database was first to display sequence information such as the T-DNA and Ds flanking sequence tags (FSTs) produced in the framework of the French genomics initiative Genoplante and the EU consortium Cereal Gene Tags. This ...
33%

EcoliWiki: A Wiki-based community resource for Escherichia coli


Community-based resource for the annotation of all non-pathogenic E. coli, its phages, plasmids, and mobile genetic elements.
33%

BEDgraph


The bedGraph format allows display of continuous-valued data in track format. This display type is useful for probability scores and transcriptome data. This track type is similar to the wiggle (WIG) format, but unlike the wiggle format, data exporte ...
32%

Transmembrane Helices in Genome Sequences


A web based database of Transmembrane Helices in Genome Sequences.
31%

Bioconductor


Bioconductor provides tools for the analysis and comprehension of high-throughput genomic data. Bioconductor uses the R statistical programming language, and is open source and open development.
31%

Chickpea Portal


This resource contains genome and gene sequences, features and isolationed chromosome alignments, while functional annotation can be searched in GBrowse. Chickpea forms a critical component of the Australian and Indian farming system, offering offer ...
31%

BCL-2 Database


BCL2DB is a database designed to integrate data on BCL-2 family members and BH3-only proteins.
31%

Acytostelium Gene Database


Genome and transcriptome database of Acytostelium subglobosum
31%

Cnidarian Evolutionary Genomics Database


CnidBase, the Cnidarian Evolutionary Genomics Database, is a tool for investigating the evolutionary, developmental and ecological factors that affect gene expression and gene function in cnidarians.
30%

Bio-Mirror


A world bioinformatic public service for high-speed access to up-to-date DNA & protein biological sequence databanks.
30%

Access to Biological Collection Data DNA extension


ABCDDNA is a theme specific extension for ABCD (Access to Biological Collections Data) created to facilitate storage and exchange of data related to DNA collection units, such as DNA extraction specifics, DNA quality parameters, and data characterisi ...
29%

Multiple Alignment Format


The Multiple Alignment Format stores DNA level multiple alignments in an easily readable format between entire genomes. Unlike previous formats this resource can cope with forward and reverse strand directions, multiple pieces to the alignment, and s ...
29%

Genome Variation Format


The Genome Variation Format (GVF) is a very simple file format for describing sequence alteration features at nucleotide resolution relative to a reference genome.
29%

GenBank Sequence Format


GenBank Sequence Format (GenBank Flat File Format) consists of an annotation section and a sequence section. The start of the annotation section is marked by a line beginning with the word "LOCUS". The start of sequence section is marked by a line be ...
29%

Gene Transfer Format


The Gene transfer format (GTF) is a file format used to hold information about gene structure. It is a tab-delimited text format based on the general feature format (GFF), but contains some additional conventions specific to gene information. A signi ...
29%

Binary sequence information Format


A .2bit file stores multiple DNA sequences (up to 4 Gb total) in a compact randomly-accessible format. The file contains masking information as well as the DNA itself. The DNA sequence is represented as two bits per pixel with associated list of regi ...
26%

.ACE format


The ACE file format is a specification for storing data about genomic contigs. The original ACE format was developed for use with Consed, a program for viewing, editing, and finishing DNA sequence assemblies. ACE files are generated by various assemb ...
26%

National Omics Data Encyclopedia


The National Omics Data Encyclopedia (NODE) is big data library with complete and integrative data storage, safe and efficiency-guaranteed data management as well as comprehensive and user-friendly data service functions. NODE stores raw sequence dat ...
26%

*ReputationScore indicates how established a given datasource is. Find out more.




Need help integrating and/or managing biomedical data?