IC_Sanchez_2011
IC_Sanchez_2011(
dag,
use_cache = simona_opt$use_cache,
verbose = simona_opt$verbose
)
It measures the average contribution of term t
on leaf terms. First denote zeta
as the number of leaf terms that
can be reached from term t
(or t
's offspring that are leaves.). Since all t
's ancestors can also
reach t
's leaves, the contribution of t
on leaf terms is scaled by n_ancestors
which is the number of t
's ancestor terms.
The final information content is normalized by the total number of leaves in the DAG, which is the possible maximal value of zeta
.
The complete definition of information content is:
= -log( (zeta/n_ancestor) / n_all_leaves) IC
Paper link: doi:10.1016/j.knosys.2010.10.001 .