sequence sites, features and motifs

Loading, please wait

Source	ReputationScore*
Pfam The Pfam database is a large collection of protein families, each represented by multiple sequence alignments and hidden Markov models (HMMs). Pfam also generates higher-level groupings of related entries, known as clans. A clan is a collection of Pf ...	100%
PROSITE PROSITE is a database of protein families and domains. PROSITE consists of documentation entries describing protein domains, families and functional sites as well as associated patterns and profiles to identify them.	67%
TIGRFAMs TIGRFAMs is a collection of manually curated protein families focusing primarily on prokaryotic sequences.It consists of hidden Markov models (HMMs), multiple sequence alignments, Gene Ontology (GO) terminology, Enzyme Commission (EC) numbers, gene s ...	52%
PRINTS PRINTS is a collection of groups of conserved protein motifs, called fingerprints, used to define a protein family. A fingerprint is a group of conserved motifs used to characterize a protein family. Usually, the motifs do not overlap, though they ma ...	51%
GlyTouCan The international glycan structure repository for glycans published in the literature. Any glycan structure, ranging in resolution from monosaccharide composition to fully defined structures can be registered and have an accession number assigned as ...	50%
IMGT/LIGM-DB IMGT/LIGM-DB is the IMGT® comprehensive database of immunoglobulin (IG) and T cell receptor (TR) nucleotide sequences, from human and other vertebrate species, with translation for fully annotated sequences, created in 1989 by LIGM (http://www.imgt.o ...	50%
The human DEPhOsphorylation Database DEPOD - the human DEPhOsphorylation Database is a manually curated database collecting human active and inactive phosphatases, their experimentally verified protein and non-protein substrates, and dephosphorylation site information, and pathways in w ...	46%
Major Intrinsic Proteins Modification Database This is a database of comparative protein structure models of the MIP (Major Intrinsic Protein) family of proteins. The MIPs have been identified from the completed genome sequence of organisms available at NCBI.	45%
PHOSIDA Phosphorylation sites in various species identified by mass spectrometry	45%
IRESite The IRESite database presents information about experimentally studied IRES (Internal Ribosome Entry Site) segments. IRES regions are known to attract the eukaryotic ribosomal translation initiation complex and thus promote translation initiation ind ...	45%
cis-Regulatory Element Database The cisRED database holds conserved sequence motifs identified by genome scale motif discovery, similarity, clustering, co-occurrence and coexpression calculations. Sequence inputs include low-coverage genome sequence data and ENCODE data.	44%
VDJdb: a curated database of T-cell receptors with known antigen specificity The primary goal of VDJdb is to facilitate access to existing information on T-cell receptor antigen specificities, i.e. the ability to recognize certain epitopes in certain MHC contexts. Our mission is to both aggregate the scarce TCR specificity in ...	43%
GyDB Gypsy database of mobile genetic elements	43%
Affymetrix NetAffx Analysis Center Allows correlation of GeneChip array results with array design and annotation information; provides access to array content information, including probe sequences and gene annotations; free registration is required.	42%
ParameciumDB ParameciumDB is a new model organism database for Paramecium, built using components of the Generic Model Organism Database (http://www.gmod.org) construction set (Chado relational database schema, Turnkey generic web framework and Gbrowse). The data ...	40%
SpliceAid Experimental RNA target motifs bound by splicing proteins in humans.	37%
RepTar Predicted targets of host and viral miRNAs	37%
TMPD The Tobacco Markers & Primers Database.	34%
RiboDB Fast and easy retrieval of r-protein sequences from publicly available complete prokaryotic genome sequences.	33%
ConsRM ConsRM is a collection and large-scale prediction of the evolutionarily conserved RNA methylation sites, with implications for the functional epitranscriptome.	33%
Conservation Archive A database portal containing esources that can be used to infer baseline species conditions.	32%
TUPDB TUPDB (Target-Unrelated Peptide Data Bank) is a comprehensive database of target-unrelated peptides (TUPs) and TUP motifs. It contains extensive information extracted from research articles and public databases.	31%
Expansin Engineering Database Expansin Engineering Database integrates information on sequence, structure and function of expansins.	31%
PopTargs PopTargs is a database for studying population evolutionary genetics of human microRNA target sites. These are the scripts used to create the MySQL database that is used by PopTargs.essex.ac.uk. The pipeline can be altered to create similar database ...	31%
UPObase an online database of unspecific peroxygenases. UPObase: Unspecific Peroxygenase Database : Homepage. Unspecific Peroxygenase Database (UPObase) is a genome mining pipeline based database which consists of all the sequences of fungal unspecific per ...	31%
Crispi A CRISPR Interactive database.	28%
BMC Caller A webtool to identify and analyze bacterial microcompartment types in sequence data.	28%
LjaFGD LjaFGD is the Lonicera japonica functional genomics database..	28%
ColabFold ColabFold databases are MMseqs2 expandable profile databases to generate diverse multiple sequence alignments to predict protein structures.	28%
ImitateDB ImitateDB is a comprehensive database for information about molecular mimicry candidates represented as DMPs and MMPs for each experimentally validated unique host pathogen protein-protein interaction.	28%
DESSO-DB A web database for sequence and shape motif analyses and identification.	28%
GlycoPathDB A database of monosaccharide biosynthesis pathways. Monosaccharide Biosynthesis Pathways Database. Pathways for monosaccharide biosynthesis. Welcome to MonoPathDB, a database of biosynthesis pathways and enzymes for monosaccharides.	28%
GPCR-SSFE GPCR-Sequence-Structure-Feature-Extractor (SSFE). Provides template suggestions and homology models of Class A GPCRs. Identifies key sequence and structural motifs in Class A GPCRs to guide template selection and build homology models.	28%
B-AMP B-AMP is an Antimicrobial Peptide (AMP) repository for biofilms, consisting of a vast library of 5544 structural AMP models, AMPs annotated to relevant biofilm literature, and protein-peptide interaction models with potential biofilm targets.	28%
CANT-HYD Calgary approach to ANnoTating HYDrocarbon degradation genes (CANT-HYD), a database of 37 HMMs of marker genes involved in anaerobic and aerobic degradation pathways of aliphatic and aromatic hydrocarbons.	28%

*ReputationScore indicates how established a given datasource is. Find out more.

Tag: sequence sites, features and motifs