The Evidence Rank

The Evidence Rank indicates the quality of the annotations used at source to make the predictions. The Gene Ontology Dictionary comes with an evidence code for every annotation. If this evidence is good (for example, experimentally verified rather than electronically assigned), the reliability of the prediction made from such annotation is correspondingly higher.

GO evidence codes and their assigned ranks

Evidence descriptionCodeRank
1Inferred by curatorIC0
2Traceable author statementTAS1
3Inferred from direct assayIDA1
4Inferred from mutant phenotypeIMP2
5Inferred from genetic interactionIGI2
6Inferred from physical interactionIPI2
7Inferred from sequence or structural similarityISS3
8Inferred from expression patternIEP3
9Inferred from Reviewed Computational AnalysisRCA3
10Non-traceable author statementNAS4
11Inferred from electronic annotationIEA5
12No dataND6
13No recordNR6

Please note that the evidence codes are arranged in descending order of their reliability. The order is somewhat arbitrary and merely reflects a belief that reliability of functional inferences observed from such evidences may be interpreted with the stated level of confidence.

The Evidence Rank is defined by the formula:

Evidence Rank (ER) is basically an cumulative average of the individual evidences (IE) of the annotations.

ER = IE_1, if only one GO annotation is used; ER = (IE_1+IE_2)/2 if two GO annotations are used; ER = (((IE_1+IE_2)/2) + IE_3)/2 if three GO annotations are used, and so on.

Please note that if the Evidence Rank is low, then it is a better value than high.
For example, an Evidence Rank with a value 5 would mean that the prediction was derived from all electronic annotations. Therefore even if the Bayesian weight of the prediction is high, the confidence in the result has to be interpreted with adequate caution.