节点文献
分类间题中基于信息论度量的客观评价研究(英文)
Information-theoretic Measures for Objective Evaluation of Classifications
【摘要】 This work presents a systematic study of objective evaluations of abstaining classifications using information-theoretic measures (ITMs). First, we define objective measures as the ones which do not depend on any free parameter. According to this definition, technical simplicity for examining "objectivity" or "subjectivity" is directly provided for classification evaluations. Second, we propose 24 normalized ITMs for investigation, which are derived from either mutual information, divergence, or cross-entropy. Contrary to conventional performance measures that apply empirical formulas based on users intuitions or preferences, the ITMs are theoretically more general for realizing objective evaluations of classifications. They are able to distinguish "error types" and "reject types" in binary classifications without the need to inputting data of cost terms. Third, to better understand and select the ITMs, we suggest three desirable features for classification assessment measures, which appear more crucial and appealing from the viewpoint of classification applications. Using these features as "meta-measures", we can reveal the advantages and limitations of ITMs from a higher level of evaluation knowledge. Numerical examples are given to demonstrate our claims and compare the differences among the proposed measures. The best measure is selected in terms of the meta-measures, and its specific properties regarding error types and reject types are analytically derived.
【Abstract】 This work presents a systematic study of objective evaluations of abstaining classifications using information-theoretic measures (ITMs). First, we define objective measures as the ones which do not depend on any free parameter. According to this definition, technical simplicity for examining "objectivity" or "subjectivity" is directly provided for classification evaluations. Second, we propose 24 normalized ITMs for investigation, which are derived from either mutual information, divergence, or cross-entropy. Contrary to conventional performance measures that apply empirical formulas based on users intuitions or preferences, the ITMs are theoretically more general for realizing objective evaluations of classifications. They are able to distinguish "error types" and "reject types" in binary classifications without the need to inputting data of cost terms. Third, to better understand and select the ITMs, we suggest three desirable features for classification assessment measures, which appear more crucial and appealing from the viewpoint of classification applications. Using these features as "meta-measures", we can reveal the advantages and limitations of ITMs from a higher level of evaluation knowledge. Numerical examples are given to demonstrate our claims and compare the differences among the proposed measures. The best measure is selected in terms of the meta-measures, and its specific properties regarding error types and reject types are analytically derived.
【Key words】 Abstaining classifications; error types; reject types; entropy; similarity; objectivity; information-theoretic measures (ITMs);
- 【文献出处】 自动化学报 ,Acta Automatica Sinica , 编辑部邮箱 ,2012年07期
- 【分类号】TP18;O236
- 【被引频次】6
- 【下载频次】137