Random shapley forests: cooperative game based random forests with consistency

Jianyuan Sun, Hui Yu, Guoqiang Zhong*, Junyu Dong*, Shu Zhang, Hongchuan Yu

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

87 Downloads (Pure)

Abstract

The original random forests algorithm has been widely used and has achieved excellent performance for the classification and regression tasks. However, the research on the theory of random forests lags far behind its applications. In this paper, to narrow the gap between the applications and theory of random forests, we propose a new random forests algorithm, called random Shapley forests (RSFs), based on the Shapley value. The Shapley value is one of the well-known
solutions in the cooperative game, which can fairly assess the power of each player in a game. In the construction of RSFs, RSFs uses the Shapley value to evaluate the importance of each feature at each tree node by computing the dependency among the possible feature coalitions. In particular, inspired by the existing consistency theory, we have proved the consistency of the proposed random forests algorithm. Moreover, to verify the effectiveness of the proposed algorithm, experiments on eight UCI benchmark datasets and four real-world datasets have been conducted. The results show that RSFs perform better than or at least comparable with the existing consistent random forests, the original random forests and a classic classifier, support vector machines.
Original languageEnglish
Number of pages10
JournalIEEE Transactions on Cybernetics
Early online date23 Mar 2020
DOIs
Publication statusEarly online - 23 Mar 2020

Keywords

  • Random forests
  • feature evaluation
  • Shapley value
  • consistency

Fingerprint

Dive into the research topics of 'Random shapley forests: cooperative game based random forests with consistency'. Together they form a unique fingerprint.

Cite this