Regression conformal prediction with random forests
2014 (English)In: Machine Learning, ISSN 0885-6125, E-ISSN 1573-0565, Vol. 97, no 1-2, p. 155-176Article in journal (Refereed) Published
Abstract [en]
Regression conformal prediction produces prediction intervals that are valid, i.e., the probability of excluding the correct target value is bounded by a predefined confidence level. The most important criterion when comparing conformal regressors is efficiency; the prediction intervals should be as tight (informative) as possible. In this study, the use of random forests as the underlying model for regression conformal prediction is investigated and compared to existing state-of-the-art techniques, which are based on neural networks and k-nearest neighbors. In addition to their robust predictive performance, random forests allow for determining the size of the prediction intervals by using out-of-bag estimates instead of requiring a separate calibration set. An extensive empirical investigation, using 33 publicly available data sets, was undertaken to compare the use of random forests to existing stateof- the-art conformal predictors. The results show that the suggested approach, on almost all confidence levels and using both standard and normalized nonconformity functions, produced significantly more efficient conformal predictors than the existing alternatives.
Place, publisher, year, edition, pages
Springer, 2014. Vol. 97, no 1-2, p. 155-176
Keywords [en]
Conformal prediction, Random forests, Regression, Machine learning, Data mining
National Category
Computer Sciences Computer and Information Sciences
Identifiers
URN: urn:nbn:se:hj:diva-38084DOI: 10.1007/s10994-014-5453-0Scopus ID: 2-s2.0-84906946396Local ID: 0;0;miljJAILOAI: oai:DiVA.org:hj-38084DiVA, id: diva2:1163349
Note
Sponsorship:
This work was supported by the Swedish Foundation for Strategic Research through the project High-Performance
Data Mining for Drug Effect Detection (IIS11-0053) and the Knowledge Foundation through the project Big
Data Analytics by Online Ensemble Learning (20120192).
2017-12-062017-12-062019-08-23Bibliographically approved