# Asymptotic Properties of High-Dimensional Random Forests

The empirical success and popularity of random forests raise a natural question of how to understand its underling mechanisms from the theoretical perspective.

recent work on the consistency of random forests.

- some of the earlier consistency results usually considered certain simplified versions of the original random forests algorithm, where
**the splitting rules are assumed to be independent of the response.** - some contributes to the consistency of the original version of the random forests algorithm for the classical setting of fixed-dimensional ambient feature space.
- some consistency results with the rates of convergence in terms of the number of informative features in sparse models by assuming a simplified version of the random forests algorithm.
- additional theoretical results on random forests includes the pointwise consistency, asymptotic distribution, and confidence intervals of random forests predictions.

unclear how to characterize the consistency rate for the original version of the random forests algorithm in a general high-dimensional nonparametric regression setting.

main contribution:

- characterize such consistency rate for random forests with non-fully grown trees
- the random forests estimator can be consistent with a rate of some polynomial order of sample size
- the bias analysis reveals how the bias depends on the sample size, column subsampling parameter, and forest height.

