The PEDANT genome database ( provides exhaustive automatic analysis of genomic sequences by a large variety of established bioinformatics tools through a comprehensive Web-based user interface. Nearly 3000 completely sequenced publicly available eukaryotic, eubacterial, archaeal and viral genomes with more than 4.5 million proteins have been processed so far.In particular, all completely sequenced genomes from the NCBI's Reference Sequence collection (RefSeq) (1) are covered. The PEDANT processing pipeline has been sped up by an order of magnitude through the utilization of precalculated similarity information stored in the similarity matrix of proteins (SIMAP) database (2), making it possible to process newly sequenced genomes immediately as they become available.For programmatic access Web Services are available at



genomics general genomics

More to explore:


Need help integrating and/or managing biomedical data?