A sample information file accompanying a .bed binary genotype table.
biological sample phenotype
Extended variant information file accompanying a .bed binary genotype table.
Original standard text format for sample pedigree information and genotype calls.
Variant information file accompanying a .ped text pedigree + genotype table.
The Food and Agriculture Microdata (FAM) Catalogue provides an inventory of datasets collected through farm and household surveys which contain information related to agriculture, food security, and n ...
Biosynthetic gene cluster families
The BioSamples database aggregates sample information for reference samples (e.g. Coriell Cell lines) and samples for which data exist in one of the EBI's assay databases such as ArrayExpress, the Eur ...
MIAPPE is a reporting guideline for plant phenotyping experiments. It comprises a checklist, i.e., a list of attributes to describe an experiment so that it is understandable and replicable. It should ...
MIRAGE (Minimum Information Required for A Glycomics Experiment) was created to improve the quality of glycomics data in the scientific literature. The sample preparation guidelines are designed to in ...
MetaSRA is a database of normalized SRA human sample-specific metadata following a schema inspired by the metadata organization of the ENCODE project. This schema involves mapping samples to terms in ...
A CT (Connectivity Table) file contains secondary structure information for a RNA sequence.
ICM binary file Format is used in databases pertaining to structural biology and protein families. This format can be used for the graphical representation of RNA, DNA, and proteins interactions.
The BPMAP file contains information relating to the design of the Affymetrix tiling arrays. The format of the BPMAP file is a binary file with data stored in big-endian format.
PDBx/mmCIF is a dictionary of data archiving macromolecule crystallographic experiments and their results.
This is a data table format used in the PUMAdb workflow to transform a .pcl file into a .cdt (clustered data table) file which contains the original data, but reordered, to reflect the clustering.
The FAANG metadata sample specification document describes the principles and structure for the FAANG metadata guidance. The main goal of the FAANG standards is to ensure all FAANG samples are well de ...
COG-UK Consortium has published a dataset which contains over 20K SARS-CoV-2 viral genome sequences available as open access.
The EBI BioSamples JSON Format is a JSON schema used for submitting data to the EBI BioSamples database. This format can also be used to update or curate existing samples.
The acronym CIF is used both for the Crystallographic Information File, the data exchange standard file format of Hall, Allen & Brown (1991) (see Documentation), and for the Crystallographic Informati ...
The NCBI BioSample database stores submitter-supplied descriptive information, or metadata, about the biological materials from which data stored in NCBI’s primary data archives are derived. NCBI’s ar ...
Clusters of Orthologous Groups of proteins (COGs) were delineated by comparing protein sequences encoded in complete genomes, representing major phylogenetic lineages. Each COG consists of individual ...