BioThesaurus

BioThesaurus is a web-based system that maps a comprehensive collection of protein and gene names to protein entries in the UniProt Knowledgebase (UniProtKB). Currently covering more than two million protein sequences, BioThesaurus consists of over 2.8 million names extracted from multiple molecular biology databases according to the database cross-references provided in iProClass (Wu et al, 2004). The BioThesaurus web site allows the retrieval of synonymous names of given protein entries and the identification of protein entries sharing the same names. The BioThesaurus dataset can be used for automatic protein named entity recognition. It is updated monthly and can be freely downloaded at http://pir.georgetown.edu/iprolink/biothesaurus/data/thesaurus.

Webpage:

http://pir.georgetown.edu/iprolink/biothesaurus/

Publications:

Publications
More detailed information about this field from each metasource.

http://bioinformatics.oxfordjournals.org/cgi/content/abstract/22/1/103
metasource: Nucleic Acid Research database catalogue
version: extracted_at: 2022-11-04T11:16:25.468657

http://bioinformatics.oxfordjournals.org/cgi/content/abstract/22/1/103

Tags:

Tags
More detailed information about this field from each metasource.

genomics databases (non-vertebrate)
metasource: Nucleic Acid Research database catalogue
version: extracted_at: 2022-11-04T11:16:25.468657

genome annotation terms, ontologies and nomenclature
metasource: Nucleic Acid Research database catalogue
version: extracted_at: 2022-11-04T11:16:25.468657

genomics genome annotation terms, ontologies and nomenclature

More to explore:

1/20

Previous Next

Need help integrating and/or managing biomedical data?

BioThesaurus

Webpage:

Publications:

Publications
More detailed information about this field from each metasource.

http://bioinformatics.oxfordjournals.org/cgi/content/abstract/22/1/103
metasource: Nucleic Acid Research database catalogue
version: extracted_at: 2022-11-04T11:16:25.468657

More to explore:

1/20

iProLINK

PIR - Protein Information Resource

PIR SuperFamily

UniProt Knowledgebase

NeXO

DGA

IGRhCellID

GONUTS

Protein Naming Utility

RESID Database of Protein Modifications

Gene Ontology Annotation Database

The Protein Database

PRotein Ontology

MIRIAM Registry

MetaBase

UniProt Taxonomy

Named Entity Recognition Ontology

IUBMB Nomenclature database

iProClass

Jenalib: Jena Library of Biological Macromolecules

BioThesaurus

Webpage:

Publications: Publications More detailed information about this field from each metasource. × http://bioinformatics.oxfordjournals.org/cgi/content/abstract/22/1/103 metasource: Nucleic Acid Research database catalogue version: extracted_at: 2022-11-04T11:16:25.468657 Close

More to explore:

1/20

iProLINK

PIR - Protein Information Resource

PIR SuperFamily

UniProt Knowledgebase

NeXO

DGA

IGRhCellID

GONUTS

Protein Naming Utility

RESID Database of Protein Modifications

Gene Ontology Annotation Database

The Protein Database

PRotein Ontology

MIRIAM Registry

MetaBase

UniProt Taxonomy

Named Entity Recognition Ontology

IUBMB Nomenclature database

iProClass

Jenalib: Jena Library of Biological Macromolecules

Publications:

Publications
More detailed information about this field from each metasource.

http://bioinformatics.oxfordjournals.org/cgi/content/abstract/22/1/103
metasource: Nucleic Acid Research database catalogue
version: extracted_at: 2022-11-04T11:16:25.468657