OBD:Similarity Statistics

From NCBO Wiki
Jump to: navigation, search

OBD has various capabilities for computing the similarity between entities based on ontology annotations.

OBD is flexible with respect to what kind of annotations and ontologies are used. For example:

  • Based on phenotypes annotations, what mouse genes (or genotypes) are similar to the human gene EPB41?
  • Based on GO annotations, what is the correlation between similarity between functional annotations and sequence similarity?
  • Based on annotations of microbes to environmental habitats, what organisms have similar profiles to a particular microbe?

Here we will use annotations of genes, alleles and genotypes to phenotypes for illustrative purposes, but the methods are generic.

Use of reasoner

All similarity statistics are run using the pre-reasoned database. If a gene affects the hippocampus it also affects the brain. If a mutation affects the permeability of a mitochondrial membrane, it also affects the permeability of a membrane. This ensure that annotations are different levels of granularity and specificity are compared, commonalities are found.

Inter-species comparisons

In cases where we want to compare anatomical terms across species we reason using Uberon

Statistics

See the javadoc for SimilarityPair