Tag: natural language processing

Found 47 sources

Source	Match	ReputationScore*
DisGeNET: a knowledge base for disease genomics DisGeNET is a discovery platform containing one of the largest collections available of genes and variants involved in human diseases. DisGeNET integrates data from expert curated repositories, GWAS catalogues, animal models, and the scientific liter ...		100%
TISSUES TISSUES is a weekly updated web resource that integrates evidence on tissue expression from manually curated literature, proteomics and transcriptomics screens, and automatic text mining. All evidence to common protein identifiers and Brenda Tissue O ...		58%
DISEASES DISEASES is a weekly updated web resource that integrates evidence on disease-gene associations from automatic text mining, manually curated literature, cancer mutation data, and genome-wide association studies. We further unify the evidence by assig ...		54%
COMPARTMENTS COMPARTMENTS is a weekly updated web resource that integrates evidence on protein subcellular localization from manually curated literature, high-throughput screens, automatic text mining, and sequence-based prediction methods. We map all evidence to ...		53%
Natural Products Atlas An Open Access Knowledge Base for Microbial Natural Products Discovery. The Natural Products Atlas Network views of chemical space. The Natural Products Atlas provides a unique tools for exploring natural products chemical space, offering perspecti ...		50%
CancerResource Cancer-relevant proteins and compound interactions		49%
Arabidopsis Hormone Database Plant hormones are small organic molecules that influence almost every aspect of plant growth and development. Genetic and molecular studies have revealed a large number of genes that are involved in responses to numerous plant hormones, including au ...		48%
COVID-19 Research Collaborations The COVID-19 Research Collaborations database stores information on researchers and institutions for the purposes of identifying potential research experts or collaborators in areas related to the coronavirus epidemic across basic science, translatio ...		48%
ORGANISMS ORGANISMS is a weekly updated web resource that facilitates taxonomy-aware search and retrieval of articles. To this end, the the resource performs named entity recognition of terms from the NCBI Taxonomy on PubMed abstracts. The resource further pro ...		48%
CancerMine Literature-mined resource for drivers, oncogenes and tumor suppressors in cancer.		47%
MetaBioME Metagenomic BioMining Engine: homologs of commercially useful enzymes in metagenomic datasets		47%
Clinical Trials Ontology The Clinical Trials Ontology (CTO) is also known as the Clinical Trial Ontology-Neurodegenerative Diseases (CTO-NDD), and describes clinical trials in the field of neurodegeneration. This resource has been created for use in the IMI-funded AETIONOMY ...		46%
UKPMC UK PubMed Central is a full-text article database that extends the functionality of the original PubMed Central (PMC) repository. Now includes both a UKPMC and PubMed search, as well as access to other records such as Agricola, Patents and recent bio ...		44%
Blood Exposome Database Generating the Blood Exposome Database Using a Comprehensive Text Mining and Database Fusion Approach \| The exposome represents the sum of all exposures during the life-span of an organism (from chemicals to microbes, viruses, radiation and other sou ...		44%
CIViCmine CIViCmine is a literature-mined database of clinically relevant cancer biomarkers from data from PubMed and Pubmed Central Open Access subset.		43%
COVID-19 Ontology The COVID-19 ontology covers the role of molecular and cellular entities in virus-host-interactions, in the virus life cycle, as well as a wide spectrum of medical and epidemiological concepts linked to COVID-19.		43%
ReGEO Restructured version of Gene Expression Omnibus (GEO) that provides a user friendly interface for curating GEO database		42%
DES-TOMATO DES-TOMATO is a topic-specific literature exploration system developed to allow the exploration of information related to tomato. The information provided in DES-TOMATO is obtained through the text-mining of available scientific literature, namely fu ...		42%
COVID-19-CT-CXR COVID-19-CT-CXR is a public database of COVID-19 CXR and CT images, which are automatically extracted from COVID-19-relevant articles from the PubMed Central Open Access (PMC-OA) Subset.		41%
GIDB Knowledge database for the automated curation and multidimensional analysis of molecular signatures in gastrointestinal (GI) cancer.		40%
DeepKG An End-to-End Deep Learning-Based Workflow for Biomedical Knowledge Graph Extraction, Optimization and Applications.		39%
CRAFT The Colorado Richly Annotated Full Text Corpus (CRAFT) is a manually annotated corpus consisting of 67 full-text biomedical journal articles from the PubMed Central Open Access Subset.		39%
R-loopBase A knowledgebase for genome-wide R-loop formation and regulation.		39%
HFIP Heart Failure Integrated Platform (HFIP) is an integrated multi-omics data and knowledge platform for the precision medicine of heart failure.		39%
COVID-19 preVIEW COVID-19 preVIEW is a tool for semantic search to explore COVID-19 research preprints.		39%
HmtVar A data and text mining pipeline to annotate human mitochondrial variants with functional and clinical information. The main web resource to explore human mitochondrial variability data and their pathological correlation. HmtVar is a manually-curate ...		39%
BRONCO Berlin-Tübingen-Oncology corpus (BRONCO) is a large and freely available corpus of shuffled sentences from German oncological discharge summaries annotated with diagnosis, treatments, medications, and further attributes including negation and specula ...		39%
GEREDB Gene expression regulation database curated by mining abstracts from literature. Gene Expression Regulation Database. Gene Expression Regulation Database (GREDB) has been developed to facilitate systems-level analyses that will provide insights int ...		39%
Ontology of Language Disorder in Autism Language terms used in the domain of autism. The language terms were obtained via text mining and automatic retrieval of terms from the corpus of PubMed abstracts.		38%
Ocular Immune-Mediated Inflammatory Diseases Ontology Application ontology for Ocular Immune-mediated Inflammatory Diseases, built from domain experts in ophthalmology, clinical guidelines, and enhanced with patient-preferred terms.		38%
InflamNat Web-Based Database and Predictor of Anti-Inflammatory Natural Products.		36%
Modelzoo Model Zoo curates and provides a platform for deep learning researchers to easily find pre-trained models for a variety of platforms and uses.		36%
KOMBAT Knowledgebase of Microbes’ Battling Agents for Therapeutics.		36%
CDCDB A large and continuously updated drug combination database.		36%
PHILM2Web A high-throughput database of macromolecular host-pathogen interactions on the Web.		36%
BRHD Brain Research Hotspot Database (BRHD) a A comprehensive platform for the latest advances in brain research		36%
FamPlex It is an effective resource for improving named entity recognition, grounding, and relationship resolution in automated reading of biomedical text.		36%
emiRIT emiRIT is a text-mining based database for microRNA information.		36%
FRCD FRCD (Food Risk Component Database) is a comprehensive food risk component database with molecular scaffold, chemical diversity, toxicity, and biodegradability analysis.		36%
M3Cs MicroRNA childhood Cancer Catalog (M3Cs) is a high-quality curated collection of published miRNA research studies on 16 pediatric cancer diseases.		36%
CoKE COVID-19 Knowledge Extractor (COKE) is a tool and a web portal to extract drug - target protein associations from the CORD-19 corpus of scientific publications on COVID-19.		36%
MEDI-2 MEDI (MEDication Indication) is an ensemble medication indication resource for primary and secondary uses of electronic medical record (EMR) data.		36%
BioRED a biomedical relation extraction dataset (BioRED) with multiple entity types (e.g. gene/protein, disease, chemical) and relation pairs (e.g. gene-disease; chemical-chemical) at the document level, on a set of 600 PubMed abstracts.		36%
LTM-TCM A comprehensive database for the linking of Traditional Chinese Medicine with modern medicine at molecular and phenotypic levels.		36%
CNV-ETLAI A scalable artificial intelligence platform that automatically finds copy number variations (CNVs) in journal articles and transforms them into a database.		36%
EpiGraphDB EpiGraphDB is an analytical platform and database to support data mining in epidemiology. The platform incorporates a graph of causal estimates generated by systematically applying Mendelian randomization to a wide array of phenotypes, and augments t ...		36%
CONSORT-TM CONSORT-TM is a corpus annotated with CONSORT checklist items, and studied baseline sentence classification methods as well as their combinations to recognize a subset of these items.		36%

*ReputationScore indicates how established a given datasource is. Find out more.

Need help integrating and/or managing biomedical data?