0

On the Trade-off between the Number of Nodes and the Number of Trees in a Random Forest

In this paper, we focus on the prediction phase of a random forest and study the problem of representing a bag of decision trees using a smaller bag of decision trees, where we only consider binary decision problems on the binary domain and simple decision trees in which an…

Year
2023
Hosting
Abstract onlyARXIV-DEFAULT

Cite

Notes

Only stored in your browser.

Attribution

Abstract & full text
arxiv.org/abs/2312.11540ARXIV-DEFAULT
TL;DR
Semantic Scholar
Attribution policy →

Abstract

In this paper, we focus on the prediction phase of a random forest and study the problem of representing a bag of decision trees using a smaller bag of decision trees, where we only consider binary decision problems on the binary domain and simple decision trees in which an internal node is limited to querying the Boolean value of a single variable. As a main result, we show that the majority function of n variables can be represented by a bag of T (< n) decision trees each with polynomial size if n-T is a constant, where n and T must be odd (in order to avoid the tie break). We also show that a bag of n decision trees can be represented by a bag of T decision trees each with polynomial size if n-T is a constant and a small classification error is allowed. A related result on the k-out-of-n functions is presented too.