Four different feature selection methods are discussed,including Document Frequency(DF),Mutual Information(MI),X2 test(CHI),Correlation Coefficient(CC),and the correction of text categorization is compared using the algorithm of K nearest neighbor.

 
  • 考察了文档频率DF、互信息MI、CHI统计、CC统计四种不同的特征选择方法;并结合K近邻算法进行分类精度上的比较.
今日热词
目录 附录 查词历史