Sim_Relevance_2006
Sim_Relevance_2006(
dag,
terms,
IC_method = "IC_annotation",
verbose = simona_opt$verbose
)
The IC method is fixed to IC_annotation
.
If thinking Lin_1998 is a measure of how close term a
and b
to their MICA term c
, the relevance method corrects it by multiplying
a factor which considers the specificity of how c
brings the information. The factor is calculated as 1-p(c)
where p(c)
is the annotation-based
probability p(c) = k/N
where k
is the number of items annotated to c
and N
is the total number of items annotated to the DAG. Then
the Relevance semantic similarity is calculated as:
= (1 - p(c)) * IC_Lin
sim = (1 - p(c)) * 2*IC(c)/(IC(a) + IC(b))
Paper link: doi:10.1186/1471-2105-7-302 .