iUUCD

The ubiquitin and ubiquitin-like (Ub/Ubl) conjugation is one of the most important post- translational modifications (PTMs) in proteins, and regulates a large number of cellular processes, such as cell cycle, signal transduction, apoptosis and autophagy, while aberrant modification is implicated in numerous pathologies, such as neurodegenerative disorders, inflammatory diseases and cancers. Identification and annotation of regulators in Ub/Ubl systems is fundamental for understanding the molecular mechanisms of Ub/Ubl conjugations, and provides a highly useful reservoir for discovering disease biomarkers and drug targets. In 2013, we reported a family-based database of UUCD 1.0 for ubiquitin and ubiquitin-like conjugations, containing 738 ubiquitin-activating enzyme (E1s), 2,937 ubiquitin-conjugating enzymes (E2s), 46,631 ubiquitin-protein ligases (E3s) and 6,647 deubiquitinating enzymes (DUBs) in 70 eukaryotes (1), whereas proteins with ubiquitin domains (UBDs) and ubiquitin-like domains (ULDs) were not included at that time (2,3). In this update, we greatly improved our previous database, and provided a much more comprehensive resource of iUUCD 2.0 (http://iuucd.biocuckoo.org/). Firstly, 27 E1s, 109 E2s, 1,153 E3s, 164 DUBs, 396 UBDs and 183 ULDs were collected from scientific literatures and classified into 1, 4, 23, 8, 27, 11 families, respectively. Secondly, HMM identification and ortholog search were adopted to detect the potential regulators. Ultimately, the iUUCD 2.0 totally contains 136,512 regulators in ubiquitin and ubiquitin-like conjugation systems, including 1,230 E1s, 5,636 E2s, 93,343 E3s, 9,548 DUBs, 30,173 UBDs and 11,099 ULDs in 148 eukaryotic species. In particular, we provided rich annotations for regulators of eight model organisms, especially in humans, by compiling and integrating the knowledge from nearly 70 widely used public databases that cover (i) Cancer Mutations, including ICGC, COSMIC, TCGA, CGAP and IntOGen; (ii) Single Nucleotide Polymorphisms (SNPs), such as dbSNP; (iii) mRNA Expression, including GEO, ArrayExpress, GXD, FFGED, TCGA, ICGC, COSMIC, HUMAN PROTEOME MAP and The Human Protein Atlas; (iv) DNA and RNA Elements, including UTRdb, AREsite, JASPAR CORE, circBase, circRNADb, CircNet, Circ2Traits, miRTarBase, microRNA.org, TRANSFAC, miRWalk, TargetScan, miRecords, RepTar, miRNAMap, SomamiR DB 2.0, miRcode, RAID v2.0 and LncRNADisease; (v) Protein- protein Interactions (PPIs), including IID, iRefIndex, PINA, HINT, Mentha, SZDB and InWeb_IM; (vi) Protein 3D Structures, including PDB, MMDB and SCOP; (vii) Disease-associated information, including ClinVar, OMIM, GWASdb and GWAS CENTRAL; (viii) Drug-target Relations, including DrugBank, TTD, KPID, CARLSBAD, SuperTarget, GRAC and PDTD; (ix) Post-translational Modifications (PTMs), including CPLM, dbPAF, dbPPT, phosSNP, PhosphositePlus, Phospho.ELM, dbPTM, PHOSIDA, BioGRID, HPRD, UniProt, O-GlycBase, PhosphoBase and mUbiSiDa; (x) DNA Methylation, including MethyCancer, TCGA, ICGC and COSMIC; (xi) Protein Expression/Proteomics, including The Human Protein Atlas, Human Proteome Map and GPMDB. Compared with our previously developed UUCD 1.0 (~0.41 GB), iUUCD 2.0 has a size of ~32.1 GB of data with a > 75-fold increase in data volume. We expect that iUUCD 2.0 can be more useful for further study of ubiquitin and ubiquitin-like conjugations.

Webpage:
http://iuucd.biocuckoo.org/

Publications:

Tags:

protein sequence protein family

More to explore:

1/20



Need help integrating and/or managing biomedical data?