Tag: nucleic acid sequence


Found 185 sources
Source Match ReputationScore*

CLUSTAL-W Alignment Format


CLUSTAL-W Alignment Format is a simple text-based format, often with a *.aln file extension, used for the input and output of DNA or protein sequences into the Clustal suite of multiple alignment programs.
100%

GenBank


GenBank is the NIH genetic sequence database, an annotated collection of all publicly available DNA sequences. The complete release notes for the current version of GenBank are available on the NCBI ftp site. A new release is made every two months. G ...
100%

Sequence Alignment Map


The Sequence Alignment/Map (SAM) format is a TAB-delimited text format consisting of a header section, which is optional, and an alignment section.
93%

Sequence Read Archive


The Sequence Read Archive (SRA) stores raw sequencing data from the next generation of sequencing platforms Data submitted to SRA. It is organized using a metadata model consisting of six objects: study, sample, experiment, run, analysis and submissi ...
91%

FASTA Sequence Format


FASTA format is a text-based format for representing either nucleotide sequences or peptide sequences, in which nucleotides or amino acids are represented using single-letter codes. The format also allows for sequence names and comments to precede th ...
74%

MEROPS


The MEROPS database is an information resource for peptidases (also termed proteases, proteinases and proteolytic enzymes) and the proteins that inhibit them.
73%

European Nucleotide Archive


The European Nucleotide Archive (ENA) is a globally comprehensive data resource for nucleotide sequence, spanning raw data, alignments and assemblies, functional and taxonomic annotation and rich contextual data relating to sequenced samples and expe ...
72%

Gramene: A curated, open-source, integrated data resource for comparative functional genomics in plants


Gramene's purpose is to provide added value to plant genomics data sets available within the public sector, which will facilitate researchers' ability to understand the plant genomes and take advantage of genomic sequence known in one species for ide ...
72%

NCBI Gene


The Entrez Global Query Cross-Database Search System is a federated search engine, or web portal that allows users to search many discrete health sciences databases at the National Center for Biotechnology Information (NCBI) website. Entrez can effic ...
69%

JASPAR


JASPAR (http://jaspar.genereg.net) is an open-access database of curated, non-redundant transcription factor (TF)-binding profiles stored as position frequency matrices (PFMs) and TF flexible models (TFFMs) for TFs across multiple species in six taxo ...
68%

Insertion Sequence Finder


This database provides a list of insertion sequences (IS) isolated from bacteria and archae. It is organized into individual files containing their general features (name, size, origin, family.....) as well as their DNA and potential protein sequence ...
64%

ConoServer


ConoServer is a database specializing in sequences and structures of peptides expressed by marine cone snails. The database gives access to protein sequences, nucleic acid sequences and structural information on conopeptides. ConoServer's data are fi ...
64%

ImMunoGeneTics Information System


IMGT is a high-quality integrated knowledge resource specialized in the immunoglobulins (IG) or antibodies, T cell receptors (TR), major histocompatibility complex (MHC) of human and other vertebrate species, and in the immunoglobulin superfamily (Ig ...
60%

BioSamples at the European Bioinformatics Institute


The BioSamples database aggregates sample information for reference samples (e.g. Coriell Cell lines) and samples for which data exist in one of the EBI's assay databases such as ArrayExpress, the European Nucleotide Archive or PRIDE. It provides lin ...
59%

Nucleic Acids Database


The Nucleic Acids Database contains information about experimentally-determined nucleic acids and complex assemblies. NDB can be used to perform searches based on annotations relating to sequence, structure and function, and to download, analyze, and ...
59%

SwissRegulon


The Swissregulon Database contains genome-wide annotations of regulatory sites. The predictions are based on Bayesian probabilistic analysis of a combination of input information including i) Experimentally determined binding sites reported in the li ...
56%

NCBI BioSample


The NCBI BioSample database stores submitter-supplied descriptive information, or metadata, about the biological materials from which data stored in NCBI’s primary data archives are derived. NCBI’s archives host data from diverse types of samples fro ...
56%

Yeast Searching for Transcriptional Regulators and Consensus Tracking


YEASTRACT (Yeast Search for Transcriptional Regulators And Consensus Tracking) is a curated repository of more than 48333 regulatory associations between transcription factors (TF) and target genes in Saccharomyces cerevisiae, based on more than 1200 ...
55%

Animal Transcription Factor Database


AnimalTFDB is a comprehensive animal transcription factor database. The resource is classification of transcription factors from 50 genomes from species including Homo sapiens and Caenorhabditis elegans. The database also has information on co-transc ...
55%

HOmo sapiens transcription factor COmprehensive MOdel COllection


HOmo sapiens COmprehensive MOdel COllection (HOCOMOCO) v10 provides transcription factor (TF) binding models for 601 human and 396 mouse TFs. In addition to basic mononucleotide position weight matrices (PWMs), HOCOMOCO provides a set of dinucleotide ...
54%

Universal PBM Resource for Oligonucleotide Binding Evaluation


The UniPROBE (Universal PBM Resource for Oligonucleotide Binding Evaluation) database hosts data generated by universal protein binding microarray (PBM) technology on the in vitro DNA binding specificities of proteins.
54%

Dfam


The Dfam database is a open collection of DNA Transposable Element sequence alignments, hidden Markov Models (HMMs), consensus sequences, and genome annotations. Dfam represents a collection of multiple sequence alignments, each containing a set of r ...
54%

Genome Sequence Archive


GSA is a data repository specialized for archiving raw sequence reads. It supports data generated from a variety of sequencing platforms ranging from Sanger sequencing machines to single-cell sequencing machines and provides data storing and sharing ...
54%

DataBase of Transcriptional Start Sites


This database includes TSS data from adult and embryonic human tissue. DBTSS now contains 491 million TSS tag sequences for collected from a total of 20 tissues and 7 cell cultures.
53%

The ribosomal RNA operon copy number database


The ribosomal RNA operon copy number database is a publicly available, curated resource for ribosomal operon (rrn) copy number information for Bacteria and Archaea.
53%

Database of Sequence Tagged Sites


dbSTS is an NCBI resource that contains sequence data for short genomic landmark sequences or Sequence Tagged Sites.
53%

VISTA Enhancer Browser


Despite the known existence of distant-acting cis-regulatory elements in the human genome, only a small fraction of these elements has been identified and experimentally characterized in vivo. This paucity of enhancer collections with defined activit ...
53%

Open Regulatory Annotation


The Open REGulatory ANNOtation database (ORegAnno) is an open database for the curation of known regulatory elements from scientific literature. Annotation is collected from users worldwide for various biological assays and is automatically cross-ref ...
52%

Polymorphism in microRNAs and their TargetSites


PolymiRTS (Polymorphism in microRNAs and their TargetSites) is a database of naturally occurring DNA variations in microRNA (miRNA) seed regions and miRNA target sites. MicroRNAs pair to the transcripts of protein-coding genes and cause translational ...
51%

UniVec


UniVec is a database that can be used to quickly identify segments within nucleic acid sequences which may be of vector origin (vector contamination). In addition to vector sequences, UniVec also contains sequences for those adapters, linkers, and pr ...
51%

A CLAssification of Mobile genetic Elements


ACLAME is a database dedicated to the collection and classification of mobile genetic elements (MGEs) from various sources, comprising all known phage genomes, plasmids and transposons.
50%

Regulatory Element Database for Drosophila


REDfly is a curated collection of known Drosophila transcriptional cis-regulatory modules (CRMs) and transcription factor binding sites (TFBSs). REDfly seeks to include all experimentally verified fly regulatory elements along with their DNA sequence ...
50%

European Mouse Mutant Archive


The European Mouse Mutant Archive (EMMA) is a non-profit repository for the collection, archiving (via cryopreservation) and distribution of relevant mutant strains essential for basic biomedical research. The laboratory mouse is the most important m ...
50%

LncBook


LncBook is a curated knowledgebase of human lncRNAs that features a comprehensive collection of human lncRNAs and systematic curation of lncRNAs by multi-omics data integration, functional annotation and disease association. It integrates multi-omics ...
50%

RNAcentral


RNAcentral is a free, public resource that offers integrated access to a comprehensive and up-to-date set of non-coding RNA sequences provided by a collaborating group of databases representing a broad range of organisms and RNA types.
50%

Genetic Codes


NCBI takes great care to ensure that the translation for each coding sequence (CDS) present in GenBank records is correct. Central to this effort is careful checking on the taxonomy of each record and assignment of the correct genetic code for each o ...
49%

The Arabidopsis Gene Regulatory Information Server


The Arabidopsis Gene Regulatory Information Server (AGRIS) is a new information resource of Arabidopsis promoter sequences, transcription factors and their target genes. AGRIS currently contains two databases, AtcisDB (Arabidopsis thaliana cis-regula ...
48%

Super-Enhancer Archive


SEA (Super-Enhancer Archive) is a web-based comprehensive resource focusing on the collection, storage and online analysis of super-enhancers. It focuses on integrating super-enhancers in multiple species and annotating their potential roles in the r ...
48%

PROkariotIC Database Of Gene-Regulation


PRODORIC is a comprehensive database about gene regulation and gene expression in prokaryotes. It includes a manually curated and unique collection of transcription factor binding sites.
48%

TransmiR


TransmiR is a database for transcription factor-microRNA regulations, which is free for academic usage.
47%

IRESite


The IRESite database presents information about experimentally studied IRES (Internal Ribosome Entry Site) segments. IRES regions are known to attract the eukaryotic ribosomal translation initiation complex and thus promote translation initiation ind ...
47%

cis-Regulatory Element Database


The cisRED database holds conserved sequence motifs identified by genome scale motif discovery, similarity, clustering, co-occurrence and coexpression calculations. Sequence inputs include low-coverage genome sequence data and ENCODE data.
47%

PAZAR


PAZAR is a software framework for the construction and maintenance of regulatory sequence data annotations; a framework which allows multiple boutique databases to function independently within a larger system (or information mall). The goal of PAZAR ...
47%

Gene Transcription Regulation Database


Gene Transcription Regulation Database (GTRD) is a database of transcription factor binding sites (TFBSs) identified by ChIP-seq experiments for human and mouse.
46%

MAPPER-2


This resource provides information primarily on the upstream non-coding sequence data of genes in 3 genomes which gives insight into the transcription factors binding sites (TFBSs). For each transcript, the region scanned extends from 10,000bp upstre ...
46%

dbSUPER


dbSUPER is the first integrated and interactive database of super-enhancers, which contains 82234 super-enhancers in 102 human and 25 mouse tissue/cell types.
46%

IPD-KIR - Killer-cell Immunoglobulin-like Receptors


The database provides a centralised repository for human KIR sequences. Killer-cell Immunoglobulin-like Receptors (KIR) have been shown to be highly polymorphic at the allelic and haplotypic level. KIRs are members of the immunoglobulin superfamily ( ...
46%

CoryneRegNet 6.0 - Corynebacterial Regulation Network


Corynebacterial Regulation Network a reference database and analysis platform for corynebacterial transcription factors and gene regulatory networks.
46%

IPD-MHC - Major Histocompatibility Complex Database


The IPD-MHC Database provides a centralised repository for sequences of the Major Histocompatibility Complex (MHC) from a number of different species. Through a number of international collaborations IPD is able to provide the MHC sequences of differ ...
46%

The Improved Database Of Chimeric Transcripts and RNA-Seq Data


The ESTs and mRNAs from GenBank have been used to identify chimeric RNAs of two or more different genes. By analyzing thousands of chimeric ESTs by RNA sequencing, we found that the expression level of chimeric ESTs is generally low and they are high ...
46%

GyDB


Gypsy database of mobile genetic elements
46%

CollecTF


CollecTF is a database of transcription factor binding sites (TFBS) in the Bacteria domain. It aims at becoming a reference, highly-accessed database by relying on its ability to customize navigation and data extraction, its relevance to the communit ...
45%

4DNucleome Data Portal


The 4D Nucleome Data Portal (4DN) hosts data generated by the 4DN Network and other reference nucleomics data sets, and an expanding tool set for open data processing and visualization. It is a platform to search, visualize, and download nucleomics d ...
45%

NRG-CING


Validated NMR structures of proteins and nucleic acids.
45%

RegPrecise


Predicted regulons in prokaryotic genomes
45%

FlyFactorSurvey


Drosophila transcription factor binding specificities determined using the bacterial one-hybrid system
45%

Transcription Factor Class


TFClass is a resource that classifies eukaryotic transcription factors (TFs) according to their DNA-binding domains. Combining information from different resources, manually checking the retrieved mammalian TF sequences and applying extensive phyloge ...
45%

A database for spatially resolved transcriptomes


Spatially resolved transcriptomics providing gene expression profiles with positional information is key to tissue function and fundamental to disease pathology. SpatialDB is the first public database that specifically curates spatially resolved tran ...
45%

Telomerase Database


The Telomerase Database is a Web-based tool for the study of structure, function, and evolution of the telomerase ribonucleoprotein. The objective of this database is to serve the research community by providing a comprehensive compilation of informa ...
45%

APPRIS


Annotates variants with biological data such as protein structural information, functionally important residues, conservation of functional domains and evidence of cross-species conservation.
44%

euL1db, the European database of L1-HS retrotransposon insertions in humans


Retrotransposons, which comprises LINE, SINE and LTR-containing elements, accounts for almost half of our genome (Fig. 1). They are mobile genetics elements - also known as jumping genes - but only the L1-HS subfamily has retained the ability to jump ...
44%

Genome Variation Map


The Genome Variation Map (GVM) is a public data repository of genome variations, including single nucleotide polymorphisms (SNP) and small insertions and deletions (INDEL), with particular focuses on human as well as cultivated plants and domesticate ...
44%

Saccharomyces cerevisiae Transcription Factor Database


ScerTF is a database of position weight matrices (PWMs) for transcription factors in Saccharomyces species. It identifies a single matrix for each TF that best predicts in vivo data, providing metrics related to the performance of that matrix in accu ...
43%

ChimerDB


ChimerDB is a database of fusion sequences encompassing bioinformatics analysis of mRNA and EST sequences in the GenBank, manual collection of literature data and integration with other well known databases. Fusion transcripts with nonoverlapping ali ...
43%

MethBank


MethBank stores DNA methylome data across a variety of species. MethBank integrates consensus reference methylomes (CRMs) compiled from healthy human samples at different ages, single-base resolution methylomes (SRMs) of both plant and animal species ...
43%

T-psi-C


T-psi-C is a database of tRNA sequences and 3D tRNA structures. The T-psi-C database can be continuously updated by any member of the scientific community.
43%

Tandem Repeats Database


Tandem Repeats Database (TRDB) is a public repository of information on tandem repeats in genomic DNA and contains a variety of tools for their analysis.
43%

DDBJ/ENA/GenBank Feature Table


The GenBank, EMBL, and DDBJ nucleic acid sequence data banks have from their inception used tables of sites and features to describe the roles and locations of higher order sequence domains and elements within the genome of an organism. In February, ...
42%

Annotated regulatory Binding Sites from Orthologous Promoters


ABS: A database of Annotated regulatory Binding Sites from known binding sites identified in promoters of orthologous vertebrate genes.
42%

Short Read Archive eXtensible Markup Language


The SRA data model contains the following objects: Study: information about the sequencing project Sample: information about the sequenced samples Experiment: information about the libraries, platform; associated with study, sample(s) and run(s) Run: ...
41%

WebGeSTer DB


WebGesTer Database (DB) is the largest compilation of intrinsic terminators of transcription. It comprises of >2,200,000 bacterial terminators identified from a total of 2036 chromosomes and 1508 plasmids. The database is the storehouse for algorithm ...
41%

COXPRESdb


Coexpressed genes and networks in human and mouse
41%

FlyTF


FlyTF (v2) is a manually curated catalogue of Drosophila site-specific transcription factors (TFs). It integrates proteins identified as DNA-binding TFs by computational prediction based on structural domain assignments, and experimentally verified T ...
41%

Real-time PCR Data Markup Language


The RDML file format is developed by the RDML consortium (http://www.rdml.org) and can be used free of charge. The RDML file format was created to encourage the exchange, publication, revision and re-analysis of raw qPCR data. The core of an RDML fil ...
41%

Ligand Expo


Ligand Expo is a data resource for finding information about small molecules bound to proteins and nucleic acids. Tools are provided to search the PDB dictionary for chemical components, to identify structure entries containing particular small molec ...
40%

TcoF-DB


Database for Human Transcription Co-Factors
40%

3D-Footprint


Estimates of DNA-binding specificity for protein-DNA complexes in PDB
40%

NGSmethDB


Next-generation sequencing single-cytosine-resolution DNA methylation data
40%

DBASS5/3


Database of Aberrant Splice Sites: sequences flanking cryptic and de novo 3' and 5' splice sites
40%

Hardwood Genomics Project


The Hardwood Genomics Project is a databases for expressed genes, genetic markers, genetic linkage maps, and reference populations. It provides lasting genomic and biological resources for the discovery and conservation of genes in hardwood trees for ...
40%

BEI Resource Repository


BEI Resources provides reagents, tools and information for studying Category A, B, and C priority pathogens, emerging infectious disease agents, non-pathogenic microbes and other microbiological materials of relevance to the research community.
40%

ODB - Operon database


ODB (Operon DataBase) is a database of known operons among the many complete genomes. Additionally, putative operons that are conserved in terms of known operons are also provided. The first release of ODB conteins about 2000 known operons and 13,000 ...
39%

ASPicDB


ASPicDB is a database designed to provide access to reliable annotations of the alternative splicing pattern of human genes, obtained by ASPic algorithm (Castrignano' et al. 2006), and to the functional annotation of predicted isoforms.
39%

Feature Annotation Location Description Ontology


The Feature Annotation Location Description Ontology (FALDO), to describe the positions of annotated features on linear and circular sequences for data resources represented in RDF and/or OWL. FALDO can be used to describe nucleotide features in sequ ...
39%

R-loopDB


R-loop DB is a collection of R-loop forming sequences (RLFS) predicted computationally in the human genome based on quantitative model of RLFS (QmRLFS). The database additionally includes chromosome coordinates and annotation of many hundred thousand ...
38%

Insect Microsatellite Database


InSatDb, unlike many other microsatellite databases that cater largely to the needs of microsatellites as markers, presents an interactive interface to query information regarding microsatellite characteristics of five fully sequenced insect genomes ...
38%

TassDB


TassDB (TAndem Splice Site DataBase) stores extensive data about alternative splice events at GYNGYN donors and NAGNAG acceptors. These splice events are of subtle nature since they mostly result in the insertion/deletion of a single amino acid or th ...
38%

GenomeTraFaC


GenomeTraFaC is a database of conserved regulatory elements obtained by systematically analyzing the orthologous set of human and mouse genes. It mainly focuses on all of the high-quality mRNA entries of mouse and human genes in the Reference Sequenc ...
38%

SpliceInfo


The database provides a means of investigating alternative splicing and can be used for identifying alternative splicing - related motifs, such as the exonic splicing enhancer (ESE), the exonic splicing silencer (ESS) and other intronic splicing moti ...
38%

Databases of Orthologous Promoters


DoOP is a database of eukaryotic promoter sequences (upstream regions), aiming to facilitate the recognition of regulatory sites conserved between species. Based on the Arabidopsis thaliana and Homo sapiens genome annotation, this resource is also a ...
38%

TFBSshape


TFBSshape provides DNA shape features for transcription factor binding sites (TFBSs) that in addtion to sequence features, usually in the form of position weight matrices (PWMs), characterize DNA binding specificities of transcription factors (TFs) f ...
38%

MetaSRA


MetaSRA is a database of normalized SRA human sample-specific metadata following a schema inspired by the metadata organization of the ENCODE project. This schema involves mapping samples to terms in biomedical ontologies, labeling each sample with a ...
38%

DNAtraffic


A database for systems biology of DNA dynamics during the cell life.
37%

OGRDB


OGRDB is a curated database of immunoglobulin and T cell receptor sequences inferred from immune receptor repertoires, together with supporting information describing the repertoires from which they were derived. Researchers can submit sequences and ...
37%

EBI patent sequences


Non-redundant databases of patent DNA and protein sequences
36%

Pseudogene


This ontology is about human pseudogenes, extending the existing SO framework to incorporate additional attributes. Relationships between pseudogenes and segmental duplications are defined in this standard. To answer research questions and to annotat ...
36%

INSD sequence record XML


The International Nucleotide Sequence Database Collaboration (INSDC) is a long-standing foundational initiative that operates between DDBJ, EMBL-EBI and NCBI. INSDC covers the spectrum of data raw reads, though alignments and assemblies to functional ...
36%

Big Data Nucleic Acid Simulations Database


Atomistic Molecular Dynamics Simulation Trajectories and Analyses of Nucleic Acid Structures. BIGNASim is a complete platform to hold and analyse nucleic acids simulation data, based on two noSQL database engines: Cassandra to hold trajectory data an ...
34%

ENA Sequence XML Schema


ENA Sequence XML Schema is a standardised XML schema for nucleotide sequences. All assembled and annotated sequences must conform to this schema.
34%

Genome Variation Format


The Genome Variation Format (GVF) is a very simple file format for describing sequence alteration features at nucleotide resolution relative to a reference genome.
34%

ENA Sequence Flat File Format


ENA Sequence Flat File Format is a standardised plain text format for nucleotide sequences. This format was previously called the EMBL Sequence Flat File Format.
34%

DDBJ Sequence Read Archive


DDBJ Sequence Read Archive (DRA) is an archive database for output data generated by next-generation sequencing machines including Roche 454 GS System®, Illumina Genome Analyzer®, Applied Biosystems SOLiD® System, and others. DRA is a member of the I ...
32%

Ontology for Genetic Interval


Using BFO (Basic Formal Ontology) as its upper-level ontology, the Ontology for Genetic Interval (OGI) represents gene as an entity with its 3D shape, topography, and primary DNA sequence as the foundation for its 3D structure. There is no official h ...
32%

DiProDB


Database for dinucleotide properties
32%

PANDIT


PANDIT is a collection of multiple sequence alignments and phylogenetic trees covering many common protein domains. It contains the seed protein sequence alignments from the Pfam-A (curated families) database; nucleotide sequence alignments derived f ...
32%

UTRdb/UTRsite


The 5' and 3' untranslated regions of eukaryotic mRNAs may play a crucial role in the regulation of gene expression controlling mRNA localization, stability and translational efficiency. For this reason we developed UTRdb, a specialized database of 5 ...
32%

DBD


DBD provides transcription factor predictions for more than 150 completely sequenced genomes available for browsing and download. Predictions are based on presence of sequence specific DNA binding domain assignments using hidden Markov models from th ...
32%

SpliceAid-F


A comprehensive knowledge of all the factors involved in splicing, both proteins and RNAs, and of their interaction network is crucial for reaching a better understanding of this process and its functions. A large part of relevant information is buri ...
32%

CEGA


CEGA, (Conserved Elements from Genomic Alignments), is a database of conserved vertebrate elements. This database provides acces to precomputed sets of conserved sequences from different species and at different levels of the vertebrate phylogeny.
32%

AREsite


AU-rich elements in vertebrate mRNA UTR sequences
32%

IUPAC-IUB Commission on Biochemical Nomenclature - Abbreviations and Symbols for Nucleic Acids, Polynucleotides and their Constituents


The Abbreviations and Symbols for Nucleic Acids, Polynucleotides and their Constituents, created by the IIUPAC-IUB Commission on Biochemical Nomenclature, formalizes the naming scheme for simple nucleotides; nucleotide coenzymes and related substance ...
30%

Greglist


G-quadruplex motifs and potentially G-quadruplex regulated genes
30%

GISSD


Group I Intron Sequence and Structure Database
30%

YeTFaSCo


Yeast Transcription Factor binding Site sequence Collection
30%

TESS


TESS (Transcription Element Search System, http://www.cbil.upenn.edu/tess) is a web-based service that searches DNA sequence for transcription factor binding sites. It integrates three databases of transcription factors and binding site models, and p ...
30%

lncRNASNP2


30%

CORG - A database for COmparative Regulatory Genomics


Sequence conservation in non-coding, upstream regions of orthologous genes from man and mouse is likely to reflect common regulatory DNA sites. Motivated by this assumption we have delineated a catalogue of conserved non-coding sequence blocks and pr ...
30%

PRODORIC2


30%

TIGR Plant Transcript Assembly database


The TIGR Plant Transcript Assemblies (TA) database (http://plantta.tigr.org) uses expressed sequences collected from the NCBI GenBank Nucleotide database for the construction of transcript assemblies. The sequences collected include expressed Sequenc ...
30%

Spliceosome Database


30%

TrSDB


Transcription factor database
30%

Hollywood


Exon annotation database
30%

SINEBase


A database of short interspersed elements (SINEs)
30%

Factorbook


Human transcription factor binding data from ChIP-seq
30%

L1Base


Functional annotation and prediction of LINE-1 elements
30%

ARED-Plus


30%

ECgene


Genome annotation for alternative splicing
30%

RetrOryza


With the availability of the complete genomic sequence of rice, the identification and annotation of LTR-Retrotransposons has become a necessity as they comprise an important part of plant genomes (1). RetrOryza is a database that aims at providing t ...
30%

TTSMI


Triplex Target DNA Sites in the human genome
30%

MachiBase


Drosophila melanogaster 5' mRNA transcription start site database
30%

DoriC


DoriC regions in bacterial and archaeal genomes
30%

SNP2TFBS


Regulatory SNPs affecting predicted transcription factor binding sites
30%

RetNet


RetNet provides tables of genes and loci causing inherited retinal diseases, such as retinitis pigmentosa, macular degeneration and Usher syndrome, and related information. This information is provided to the research community and other interested i ...
30%

TiProD


TiProD is a database of human promoter sequences for which some functional features are known. It allows a user to query individual promoters and the expression pattern they mediate, gene expression signatures of individual tissues, and to retrieve s ...
30%

GBshape


DNA shape analysis has been established in recent years as an approach that reveals protein-DNA binding specificity determinants beyond nucleotide sequence.GBshape provides DNA shape annotations of entire genomes.The database currently contains annot ...
30%

ExtraTrain


ExtraTrain is a new database for exploring Extragenic and Transcriptional information in prokaryotic organisms. Transcriptional regulation processes are the principal mechanisms of adaptation in prokaryotes. In these processes, the regulatory signals ...
30%

Synthetic Gene Database


The Synthetic Gene Database (http://www.evolvingcode.net/codon/sgdb/index.php) is a resource that has collected together sequence information on synthetic genes (i.e. genes that were designed conceptually, rather than built from an initial, physical ...
30%

OriDB - The DNA Replication Origin Database


OriDB provides a web-based catalogue of confirmed and predicted DNA replication origin sites. At present this is limited to budding yeast (S. cerevisiae). Each proposed or confirmed origin site appears as a record in OriDB, with each record comprisin ...
30%

U12DB


U12-type introns are spliced by the U12-dependent spliceosome and are present in the genomes of many higher eukaryotic lineages including plants, chordates and some invertebrates. The resource described here, the U12 Intron Database (U12DB), aims to ...
30%

RegTransBase


RegTransBase is a manually curated database of regulatory interactions in prokaryotes that captures the knowledge in public scientific literature using a controlled vocabulary. Although several databases describing interactions between regulatory pro ...
30%

PReMod


The PReMod database describes more than 100,000 computational predicted transcriptional regulatory modules within the human genome. These modules represent the regulatory potential for 229 transcription factors families and are the first genome-wide/ ...
30%

SNPSTR


The SNPSTR database contains the SNP-STR/microsatellite compound markers in the five model species, where sufficient SNP information exists in both of the NCBI and Ensembl databases. These species are human (Homo sapiens), mouse (Mus musculus), rat ( ...
30%

CMGSDB


Computational models for gene silencing in C. elegans
30%

CTCF Binding Site Database


Experimentally identified and predicted CTCF binding sties
30%

Plant Stress-Responsive Gene Catalog


Stress-responsive gene in various plant species
30%

ProSAS


Protein Structure and Alternative Splicing: effects of alternative splicing events on protein structure
30%

ooTFD


ooTFD (1) is a database of transcription factors maintained in object-oriented and object-relational database systems. There are, at the time of this writing, about 7500 TF binding sites entries in this database, from both prokaryotic and eukaryotic ...
30%

ACTIVITY


ACTIVITY, a database on DNA site sequences with known activity magnitudes, measurement systems and sequence-activity relationships under fixed experimental conditions is additionally adapted to applications to the phylogenetic footprints of known sit ...
30%

SELEXdb


SELEX_DB is an online resource containing both the experimental data on in vitro selected DNA/RNA oligomers (aptamers) and the applets for these oligomers recognition. In vitro selection of oligomers binding target proteins is a novel technology inte ...
30%

PlantProm


Plant promoter sequences
30%

SCPD - Saccharomyces cerevisiae promoter database


A database of yeast promoters
30%

SpliceNest


A tool for visualizing splicing of genes from EST data
30%

Plant repeat database


Repetitive sequences in plant genomes
30%

SKY/M-FISH and CGH


The NCI and NCBI SKY/M-FISH and CGH Database is a repository of publicly submitted data from Spectral Karyotyping (SKY), Multiplex Fluorescence In Situ Hybridization (M-FISH), and Comparative Genomic Hybridization (CGH), which are complementary fluor ...
30%

EDAS - EST-Derived Alternative Splicing Database


EDAS is a database of alternative splicing derived from the anlaysis of genomic, protein, mRNA and EST data. It provides classification of elementary alternatives into main types, combined searches for specific alternative variants over tissues and d ...
30%

Ciliate IES-MDS database


Macro- and micronuclear genes in spirotrichous ciliates
30%

NPRD - Nucleosome Positioning Region Database


Nucleosome positioning region database
30%

TRACTOR db


Experimental data on the Escherichia coli transcriptional regulatory system has been used in the past years to predict new regulatory elements (promoters, Transcription Factors (TFs), TFs' binding sites, operons) within its genome. As more genomes of ...
30%

TRED - Transcriptional Regulatory Element Database


Transcriptional regulatory element database
30%

HTPSELEX


Transcription factor binding site sequences obtained using high-throughput SELEX method
30%

STIFDB2


Various genes get upregulated in plants during adverse environmental conditions, which alter the metabolic functions to mitigate the stress effects for adaptation. Therefore, it is important to know the regulatory motifs of stress-induced genes for g ...
30%

UCNEbase


A database of ultraconserved non-coding elements and gene regulatory blocks
30%

HEXEvent


Human Exone Splicing Events
30%

ChIPBase


ChIPBase v2.0 is an open database for studying the transcription factor binding sites and motifs, and decoding the transcriptional regulatory networks of lncRNAs, miRNAs, other ncRNAs and protein-coding genes from ChIP-seq data. Our database currentl ...
30%

uORFdb


Upstream ORFs and their effect of translation of downstream CDSs
30%

BloodChIP


Transcription factor binding profiles in human haematopoietic stem/progenitor cells
30%

DPRP


A database of phenotype-specific regulatory programs derived from transcription factor binding data
30%

OnTheFly


DNA-binding specificities of transcription factors in Drosophila
30%

JuncDB


Exon-exon Junction database
30%

MethSMRT


DNA methylation data from single molecule, real-time sequencing
30%

TFBSbank


Transcription factor binding profiles deduced from ChIP-seq or ChIP-chip data
30%

VectorDB


Data available for download from the SGD site, be aware that data dates from 1997
30%

ASPD


ASPD is a new curated database that incorporates data on full-length proteins, protein domains and peptides that were obtained through in vitro directed evolution process (mainly by means of phage display technique). ASPD database is being compiled b ...
30%

S/MARt DB


The nuclear organization of metaphase and interphase cells has been studied over several decades and increasing evidence supports the concept upon which the eukaryotic chromatin is organized in the form of functional independent loop domains [1; 2]. ...
30%

GeneNet


The GeneNet system is designed for collection and analysis of the data on gene and metabolic networks, signal transduction pathways, and kinetic characteristics of elementary processes. In the past two years, the GeneNet structure was considerably im ...
30%

TRANSFAC®


The TRANSFAC® database has been constructed to model the interaction of eukaryotic transcription factors with their DNA-binding sites and how this affects gene expression. At its core are the three tables FACTOR, SITE, and GENE. A link between FACTOR ...
30%

TRANSPATH®


TRANSPATH® is a database on signal transduction pathways that are modeled as bipartite graphs with molecules and reactions as node classes [1,2,3,4,5]. The molecule entries include polypeptides, modified forms, multicomponent complexes, high-order ab ...
30%

Yeast Intron Database


This searchable database contains information about the location, structure, and function of spliceosomal introns in the nuclear genome of Saccharomyces cerevisiae. Searches produce reports for each intron satisfying the search criteria, showing key ...
30%

TRANSCompel®


The TRANSCompel® database is devoted to the particular aspect of gene transcriptional regulation [1-7]. It contains information about composite elements - the basic structures of combinatorial gene regulation [7]. Composite regulatory elements consis ...
30%

UgMicroSatdb


UniGene MicroSatellite database: short tandem repeats from various eukaryotic genomes
30%

UTRome


3'UTRs and their functional elements in C. elegans
30%

ECRbase


Evolutionary conservation of DNA sequences provides a tool for the identification of functional elements in genomes. We have created a database of evolutionary conserved regions in vertebrate genomes, entitled ECRbase, which is constructed from a col ...
30%

UCbase and miRfunc


Ultraconserved sequences (UCRs) were first described by Bejerano et al. in 2004. They are highly conserved genome regions that share 100% identity among human, mouse and rat. UCRs are 481 sequences longer than 200 bases. They are frequently located a ...
30%

European Genome-phenome Archive (EGA)


The European Genome–phenome Archive (EGA) is a permanent repository for all types of potentially identifiable genetic and phenotypic data from biomedical research projects. The EGA contains data collected from individuals who have given consen ...
30%

TranspoGene


Transposed elements influence on the transcriptome of seven vertebrates and invertebrates
30%

*ReputationScore indicates how established a given datasource is. Find out more.



Need help integrating and/or managing biomedical data?