학술논문

Equitability, mutual information, and the maximal information coefficient.

Document Type

Journal

Author

Kinney, Justin B. (1-CSHL) AMS Author Profile; Atwal, Gurinder S. (1-CSHL) AMS Author Profile

Source

Proceedings of the National Academy of Sciences of the United States of America (Proc. Natl. Acad. Sci. USA) (20140101), 111, no. 9, 3354-3359. ISSN: 0027-8424 (print).eISSN: 1091-6490.

Subject

62 Statistics -- 62A Foundational and philosophical topics
62A99 None of the above, but in this section

Language

English

Abstract

Summary: ``How should one quantify the strength of association betweentwo random variables without bias for relationships of a specific form?Despite its conceptual simplicity, this notion of statistical`equitability' has yet to receive a definitive mathematicalformalization. Here we argue that equitability is properly formalizedby a self-consistency condition closely related to Data ProcessingInequality. Mutual information, a fundamental quantity in informationtheory, is shown to satisfy this equitability criterion. These findingsare at odds with the recent work of D. N. Reshef et al. [Science {\bf 334}(2011), no. 6062, 1518--1524, \doi{10.1126/science.1205438}], which proposed analternative definition of equitability and introduced a new statistic,the `maximal information coefficient' (MIC), said to satisfyequitability in contradistinction to mutual information. Theseconclusions, however, were supported only with limited simulationevidence, not with mathematical arguments. Upon revisiting theseclaims, we prove that the mathematical definition of equitabilityproposed by Reshef et al. cannot be satisfied by any (nontrivial)dependence measure. We also identify artifacts in the reportedsimulation evidence. When these artifacts are removed, estimates ofmutual information are found to be more equitable than estimates ofMIC. Mutual information is also observed to have consistently higherstatistical power than MIC. We conclude that estimating mutualinformation provides a natural (and often practical) way to equitablyquantify statistical associations in large datasets.''

Online Access

Full Text (JSTOR) Full Text (PNAS) Full Text (JSTOR) Web of Science JCR 저널정보 Scopus Find it@PNU

이메일

부산대학교 도서관

Online Access

메일 발송