Tag: gene and protein families


Found 95 sources
Source Match ReputationScore*

Human Protein Atlas


The Human Protein Atlas is program started with the aim to map of all the human proteins in cells, tissues and organs using integration of various omics technologies. It consists of three parts: Tissue Atlas showing the distribution of proteins acros ...
100%

Pfam


The Pfam database is a large collection of protein families, each represented by multiple sequence alignments and hidden Markov models (HMMs). Pfam also generates higher-level groupings of related entries, known as clans. A clan is a collection of Pf ...
98%

Conserved Domain Database


The Conserved Domain Database (CDD) brings together several collections of multiple sequence alignments representing conserved domains, including NCBI-curated domains, which use 3D-structure information to explicitly to define domain boundaries and p ...
79%

Simple Modular Architecture Research Tool


SMART (Simple Modular Architecture Research Tool) is a web resource providing simple identification and extensive annotation of protein domains and the exploration of protein domain architectures. It allows the identification and annotation of geneti ...
75%

Structural Classification Of Proteins


The SCOP database is a curated both manually and with the use of automated tools. This freely available resource aims to provide a comprehensive description of the structural and evolutionary relationships between all proteins whose structure is know ...
70%

MEROPS


The MEROPS database is an information resource for peptidases (also termed proteases, proteinases and proteolytic enzymes) and the proteins that inhibit them.
68%

Integrated resource of protein families, domains and functional sites


InterPro is a resource that provides functional analysis of protein sequences by classifying them into families and predicting the presence of domains and important sites. To classify proteins in this way, InterPro uses predictive models, known as si ...
66%

PROSITE


PROSITE is a database of protein families and domains. PROSITE consists of documentation entries describing protein domains, families and functional sites as well as associated patterns and profiles to identify them.
66%

Transporter Classification Database


This freely accessible database details a comprehensive IUBMB approved classification system for membrane transport proteins known as the Transporter Classification (TC) system. The TC system is analogous to the Enzyme Commission (EC) system for clas ...
64%

Phytozome


Phytozome, the Plant Comparative Genomics portal of the Department of Energy's Joint Genome Institute, provides JGI users and the broader plant science community a hub for accessing, visualizing and analyzing JGI-sequenced plant genomes, as well as s ...
63%

PRotein Ontology


Protein Ontology (PRO) provides an ontological representation of protein-related entities by explicitly defining them and showing the relationships between them. Each PRO term represents a distinct class of entities (including specific modified forms ...
61%

SUPERFAMILY


SUPERFAMILY is a database of structural and functional annotation for all proteins and genomes.
60%

Rfam


The Rfam database is a collection of RNA families, each represented by multiple sequence alignments, consensus secondary structures and covariance models (CMs). The families in Rfam break down into three broad functional classes: non-coding RNA genes ...
58%

ProDom


ProDom is a comprehensive set of protein domain families automatically generated from the UniProt Knowledge Database.
56%

UniGene gene-oriented nucleotide sequence clusters


Each UniGene entry is a set of transcript sequences that appear to come from the same transcription locus (gene or expressed pseudogene), together with information on protein similarities, gene expression, cDNA clone reagents, and genomic location.
54%

Group II introns database


Database for identification and cataloguing of group II introns. All bacterial introns listed are full-length and appear to be functional, based on intron RNA and IEP characteristics. The database names the full-length introns, and provides informati ...
52%

Extracellular Matrix Interaction Database


MatrixDB stores experimental data established by full-length proteins, matricryptins, glycosaminoglycans, lipids and cations. MatrixDB reports interactions with individual polypeptide chains or with multimers (e.g. collagens, laminins, thrombospondin ...
52%

PRINTS


PRINTS is a collection of groups of conserved protein motifs, called fingerprints, used to define a protein family. A fingerprint is a group of conserved motifs used to characterize a protein family. Usually, the motifs do not overlap, though they ma ...
51%

Database of Orthologous Groups


OrthoDB presents a catalog of eukaryotic orthologous protein-coding genes. Orthology refers to the last common ancestor of the species under consideration, and thus OrthoDB explicitly delineates orthologs at each radiation along the species phylogeny ...
51%

H-Invitational Database


H-Invitational Database (H-InvDB) is an integrated database of human genes and transcripts. By extensive analyses of all human transcripts, we provide curated annotations of human genes and transcripts that include gene structures, alternative splici ...
51%

TIGRFAMs


TIGRFAMs is a collection of manually curated protein families focusing primarily on prokaryotic sequences.It consists of hidden Markov models (HMMs), multiple sequence alignments, Gene Ontology (GO) terminology, Enzyme Commission (EC) numbers, gene s ...
50%

MiST 3.0: Microbial Signal Transduction database


Bacteria and archaea employ dedicated signal transduction systems that modulate gene expression, second-messenger turnover, quorum sensing, biofilm formation, motility, host-pathogen and beneficial interactions. The updated Microbial Signal Transduct ...
49%

Online GEne Essentiality database


Online GEne Essentiality database
49%

HAMAP database of microbial protein families


HAMAP is a system, based on manual protein annotation, that identifies and semi-automatically annotates proteins that are part of well-conserved families or subfamilies: the HAMAP families. HAMAP is based on manually created family rules and is appli ...
47%

A CLAssification of Mobile genetic Elements


ACLAME is a database dedicated to the collection and classification of mobile genetic elements (MGEs) from various sources, comprising all known phage genomes, plasmids and transposons.
47%

Condel


Method to assess the outcome of non-synonymous SNVs using a CONsensus DELeteriousness score that combines various tools.
47%

ProtoNet


This resource is a hierarchical clustering of UniProt protein sequences into hierarchical trees. This resource allows for the study of sub-family and super-family of a protein, using UniRef50 clusters.
46%

AceView Worm Genome


AceView provides a curated, comprehensive and non-redundant sequence representation of all public mRNA sequences (mRNAs from GenBank or RefSeq, and single pass cDNA sequences from dbEST and Trace). These experimental cDNA sequences are first co-align ...
46%

MicrobesOnline


A suite of Web-based tools for comparative genomics
46%

Bacterial protein tYrosine Kinase database


The Bacterial protein tYrosine Kinase database (BYKdb) contains computer-annotated BY-kinase sequences. The database web interface allows static and dynamic queries and provides integrated analysis tools including sequence annotation.
46%

Database of Arabidopsis Transcription Factors


The Database of Arabidopsis Transcription Factors (DATF) collects all Arabidopsis transcription factors (totally 1922 Loci; 2290 Gene Models) and classifies them into 64 families.The Version 2 of DATF was updated at July 2006. It is based on the Arab ...
46%

Human Endogenous Retrovirus database


This database is compiled from the human genome nucleotide sequences obtained mostly in the Human Genome Projects. The database makes it possible to continuously improve classification and characterization of retroviral families. The HERV database no ...
46%

Major Intrinsic Proteins Modification Database


This is a database of comparative protein structure models of the MIP (Major Intrinsic Protein) family of proteins. The MIPs have been identified from the completed genome sequence of organisms available at NCBI.
45%

PlnTFDB


The Plant Transcription Factor Database is an integrative database that provides putatively complete sets of transcription factors (TFs) and other transcriptional regulators (TRs) in plant species whose genomes have been completely sequenced and anno ...
45%

PIR SuperFamily


The PIR SuperFamily concept is being used as a guiding principle to provide comprehensive and non-overlapping clustering of UniProtKB sequences into a hierarchical order to reflect their evolutionary relationships.
44%

HOGENOM


HOGENOM is a phylogenomic database providing families of homologous genes and associated phylogenetic trees (and sequence alignments) for a wide set sequenced organisms.
44%

CleanEx


CleanEx is a database which provides access to public gene expression data via unique approved gene symbols and which represents heterogeneous expression data produced by different technologies in a way that facilitates joint analysis and cross-datas ...
43%

UniProtKB Swiss-Prot


Manually annotated component of UniProtKB (produced by the UniProt consortium). It contains manually-annotated (reviewed) records with information extracted from the literature and curator-evaluated computational analysis.
43%

GreenPhylDB: A phylogenomic database for plant comparative genomics


GreenPhylDB comprises 37 full genomes from the major phylum of plant evolution. Clustering of these genomes was performed to define a consistent and extensive set of homeomorphic plant families.
42%

PeroxiBase


Peroxibase provides access to peroxidase sequences from all kingdoms of life, and provides a series of bioinformatics tools and facilities suitable for analysing these sequences.
42%

GermOnline


GermOnline is a cross-species database gateway focusing on high-throughput expression data relevant for germline development, the meiotic cell cycle and mitosis in healthy versus malignant cells. The portal provides access to the Saccharomyces Genomi ...
41%

LegumeIP


The LegumeIP 2.0 database hosts large-scale genomics and transcriptomics data and provides integrative bioinformatics tools for the study of gene function and evolution in legumes.
41%

Homologous Vertebrate Genes Database


HOVERGEN is a database of homologous vertebrate genes that allows one to select sets of homologous genes among vertebrate species, and to visualize multiple alignments and phylogenetic trees.
41%

Mouse Atlas of Gene Expression


The Mouse Atlas of Gene Expression is a quantitative and comprehensive atlas of gene expression in mouse development. Gene expression levels from 198 tissue samples was measured using 202 Serial Analysis of Gene Expression (SAGE). Emphasis was on mou ...
40%

Complete Arabidopsis transcriptome database


This database contains data on 105 Arabidopsis transcriptome projects (3262 hybridized samples) in public access out of a total of 172 projects (~6440 hybridized samples).
39%

PREX


PeroxiRedoxin classification indEX
39%

phiSITE


Gene regulation in bacteriophages
39%

tRNADB-CE


tRNA Gene DataBase Curated by Experts
39%

BGMUT


Blood Group Antigen Gene Mutation Database
39%

HOMOLENS


Database of homologous genes from Ensembl organisms and Ensembl families, structured under ACNUC sequence database management system.
38%

Protegen


Protective antigen database and analysis system
38%

P2CS


Prokaryotic 2-Component Systems database
38%

CuticleDB


Structural proteins of Arthropod cuticle
38%

PLANT-PIs


Plant protease inhibitors (PIs) can be counted among the defensive proteins that plants display to minimize the adverse effects deriving from the attack of phytophagous insects. They are usually present in seeds and storage tissues, but are also expr ...
38%

ThYme


Thioester-active enzymes
37%

CUBE-DB


Detection of functional divergence in human protein families. Cube-DB is a database of pre-evaluated conservation and specialization scores for residues in paralogous proteins belonging to multi-member families of human proteins. Protein family class ...
37%

4DXpress


Database for cross species expression pattern comparisons
37%

FunShift


Functional divergence between the subfamilies of a protein domain family
36%

FunSimMat


Gene Ontology-based functional similarity values for proteins and protein families
36%

FusariumDB


Comparative genomics of Fusarium strains
36%

fPOP


fPOP (footprinting Pockets Of Proteins) is a database of the protein functional surfaces identified by shape analysis. In this relational database, we collected the spatial patterns of protein binding sites including both holo and apo forms from more ...
36%

Structural and functional annotation of Arabidopsis thaliana gene and protein families


GeneFarm is a database whose purpose is to store traceable annotations for Arabidopsis nuclear genes and gene products.
35%

HMS-ICS


The Hyperlink Management System (HMS) automatically updates and maintains hyperlinks among major databases using various data IDs (e.g. HUGO Gene Symbols, IDs from PDB, UniProt). The ID Converter System (ICS) supports the conversion of data IDs using ...
34%

EBI patent sequences


Non-redundant databases of patent DNA and protein sequences
34%

BiG-FAM


Biosynthetic gene cluster families
34%

PhEVER


Phylogenetic Exploration of Viruses and their Evolutionary Relationships
34%

Protein Naming Utility


Repository of protein names and naming rules
34%

Laminin Database


Laminins (LM) correspond to a large number of heretotrimeric glycoproteins, playing and a major role in several cell functions, including differentiation, proliferation, adhesion, and migration [1-3]. In addition to binding to other extracellular mat ...
33%

SilkDB 3.0


SilkDB is an open-accessibility database and powerful platform that provides comprehensive information on the silkworm (Bombyx mori) genome. Since SilkDB 2.0 was released 10 years ago, vast quantities of data about multiple aspects of the silkworm ha ...
33%

GPSDB


Collection of gene and protein names, organized by species that can be used to search for a given gene/protein name, retrieve all synonyms for this entity, and query Medline with a set of user-selected terms.
32%

UniParc


The UniProt archive (UniParc), part of the UniProt databases, is an archival protein sequence collection from all major publicly accessible resources. New and revised protein sequences are added daily into UniParc while not deleting the previous vers ...
32%

PSC


Database of proteins classified by their functional surfaces.
32%

ARGminer


ARGminer is a web platform dedicated to the validation of ARGs by using sequence alignment and evidence from multiple sources.
32%

EvolClust


EvolClust predicts groups of genes that are conserved in terms of gene order across different species distinguishing it from the background gene order conservation found between species. We define a cluster as a group of homologous proteins that are ...
32%

PADS Arsenal


A database of prokaryotic defense systems related genes. Procaryotic Antiviral Defense System. procaryotic database, defense system database, pangenome database. PADS Arsenal A Database of Prokaryotic Defense Systems Related Genes.
32%

CpGDB


CpGDB (Chloroplast Genome Database) is a user friendly, freely available and dynamic relational database which allows search and download of complete chloroplast genome sequences, individual gene sequences and feature records of different plant speci ...
31%

ProtoBug


Database and resource of protein families in Arthropod genomes.
31%

RiceRelativesGD


a genomic database of rice relatives for rice research | Welcome to RiceRelativesGD, a genomic database of rice relatives for rice research! | The cultivated rice (Oryza sativa) is one of the most important crops in the world
31%

GTDB


an integrated resource for glycosyltransferase sequences and annotations. Glycosyltransferases (GTs), a large class of carbohydrate-active enzymes, adds glycosyl moieties to various substrates to generate multiple bioactive compounds, including natu ...
31%

Cyanolyase


Sequences and motifs of the phycobilin lyase protein family
30%

Genomicus


The Spirochetes Genome Browser
30%

RRMdb


Evolutionary-oriented database of RNA recognition motif sequences.
28%

SulfAtlas


SulfAtlas describes the family and sub-families of structurally-related sulfatases. Sub-families are created based on phylogenetic analyses and essentially correspond to different substrate specificities.
28%

ImmunoDB


A simple access point to view gene family assignments, annotations and phylogenetic data for insect immune-related genes and gene families.
28%

Prokaryotic protein phosphatase database (PptaseDB)


Prokaryotic protein phosphatases.
28%

Entrez Protein Clusters


A collection of related protein sequences (clusters) consists of proteins derived from the annotations of whole genomes, organelles and plasmids. It currently limited to Archaea, Bacteria, Plants, Fungi, Protozoans, and Viruses.
28%

Cell cycle database (CellCycleDB)


A collection of genes and proteins involved in human and yeast cell cycle.
28%

MoonDB


MoonDB is a database containing predicted Extreme Multifunctional (EMF) proteins (i.e. proteins with several unrelated functions), as well as a set of manually curated moonlighting proteins. Moonlighting proteins are a subclass of multifunctional pro ...
28%

Phymyco-DB


A database designed to help the mycologist community to access to fungal DNA markers data for phylogenetic analyses.
28%

Genomicus-fungi


Genomicus-fungi is a genome browser that enables users to navigate in fungi genomes in several dimensions: linearly along chromosome axes, transversaly across different species, and chronologicaly along evolutionary time.
28%

Genomicus-protists


Genomicus-protists is a genome browser that enables users to navigate in protists genomes in several dimensions: linearly along chromosome axes, transversaly across different species, and chronologicaly along evolutionary time.
28%

Genomicus-metazoa


Genomicus-metazoa is a genome browser that enables users to navigate in metazoa genomes in several dimensions: linearly along chromosome axes, transversaly across different species, and chronologicaly along evolutionary time.
28%

Genomicus-tunicates


Genomicus-tunicates is a genome browser that enables users to navigate in tunicates genomes in several dimensions: linearly along chromosome axes, transversaly across different species, and chronologicaly along evolutionary time. It uses ANISEED (htt ...
28%

MoonProt 3.0


An update of the moonlighting proteins database. MoonProt – A Database for Moonlighting Proteins. A Database for Moonlighting Proteins. A moonlighting protein is a single protein that has multiple functions that are not due to gene fusions, multip ...
28%

Genomicus-plants


Genomicus-Plants is a genome browser that enables users to navigate in plants genomes in several dimensions: linearly along chromosome axes, transversaly across different species, and chronologicaly along evolutionary time.
28%

*ReputationScore indicates how established a given datasource is. Find out more.




Need help integrating and/or managing biomedical data?