SBASE (http://www.icgeb.trieste.it/sbase) is an on-line collection of protein domain sequences and related computational tools designed to facilitate detection of domain homologies based on simple database search. The tenth - "jubilee release" of the SBASE library of protein domain sequences contains 1,052,904 annotated structural, functional, ligand-binding and topogenic segments of proteins clustered into over 6000 domain groups. Domain identification and functional prediction are based on a comparison of BLAST search outputs with a knowledge base of biologically significant similarities within the known domain groups. The knowledge base is generated automatically for each domain group from the comparison of within-group (“selfâ€Â\x9D) and out-of-group (“non-selfâ€Â\x9D) similarities. This is a memory-based approach wherein group-specific similarity functions are automatically learned from the database [Stanfill and Waltz, 1986]
protein sequence protein domains and classification