Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Interpretable regression trees using conformal prediction
Jönköping University, School of Engineering, JTH, Computer Science and Informatics. Jönköping University, School of Engineering, JTH. Research area Computer Science and Informatics. Department of Information Technology, University of Borås, Sweden.ORCID iD: 0000-0003-0412-6199
Department of Information Technology, University of Borås, Sweden.
Jönköping University, School of Engineering, JTH. Research area Computer Science and Informatics. Department of Information Technology, University of Borås, Sweden.ORCID iD: 0000-0003-0274-9026
School of Electrical Engineering and Computer Science, KTH Royal Institute of Technology, Sweden.
2018 (English)In: Expert systems with applications, ISSN 0957-4174, E-ISSN 1873-6793, Vol. 97, p. 394-404Article in journal (Refereed) Published
Abstract [en]

A key property of conformal predictors is that they are valid, i.e., their error rate on novel data is bounded by a preset level of confidence. For regression, this is achieved by turning the point predictions of the underlying model into prediction intervals. Thus, the most important performance metric for evaluating conformal regressors is not the error rate, but the size of the prediction intervals, where models generating smaller (more informative) intervals are said to be more efficient. State-of-the-art conformal regressors typically utilize two separate predictive models: the underlying model providing the center point of each prediction interval, and a normalization model used to scale each prediction interval according to the estimated level of difficulty for each test instance. When using a regression tree as the underlying model, this approach may cause test instances falling into a specific leaf to receive different prediction intervals. This clearly deteriorates the interpretability of a conformal regression tree compared to a standard regression tree, since the path from the root to a leaf can no longer be translated into a rule explaining all predictions in that leaf. In fact, the model cannot even be interpreted on its own, i.e., without reference to the corresponding normalization model. Current practice effectively presents two options for constructing conformal regression trees: to employ a (global) normalization model, and thereby sacrifice interpretability; or to avoid normalization, and thereby sacrifice both efficiency and individualized predictions. In this paper, two additional approaches are considered, both employing local normalization: the first approach estimates the difficulty by the standard deviation of the target values in each leaf, while the second approach employs Mondrian conformal prediction, which results in regression trees where each rule (path from root node to leaf node) is independently valid. An empirical evaluation shows that the first approach is as efficient as current state-of-the-art approaches, thus eliminating the efficiency vs. interpretability trade-off present in existing methods. Moreover, it is shown that if a validity guarantee is required for each single rule, as provided by the Mondrian approach, a penalty with respect to efficiency has to be paid, but it is only substantial at very high confidence levels.

Place, publisher, year, edition, pages
Elsevier, 2018. Vol. 97, p. 394-404
Keyword [en]
Conformal prediction, Interpretability, Predictive regression, Regression trees, Economic and social effects, Efficiency, Forestry, Query processing, Regression analysis, Conformal predictions, Conformal predictors, Empirical evaluations, Level of difficulties, State-of-the-art approach, Forecasting
National Category
Probability Theory and Statistics
Identifiers
URN: urn:nbn:se:hj:diva-38624DOI: 10.1016/j.eswa.2017.12.041ISI: 000425074100030Scopus ID: 2-s2.0-85040125577OAI: oai:DiVA.org:hj-38624DiVA, id: diva2:1176515
Available from: 2018-01-22 Created: 2018-01-22 Last updated: 2018-03-20Bibliographically approved

Open Access in DiVA

No full text in DiVA

Other links

Publisher's full textScopus

Authority records BETA

Johansson, UlfLöfström, Tuve

Search in DiVA

By author/editor
Johansson, UlfLöfström, Tuve
By organisation
JTH, Computer Science and InformaticsJTH. Research area Computer Science and Informatics
In the same journal
Expert systems with applications
Probability Theory and Statistics

Search outside of DiVA

GoogleGoogle Scholar

doi
urn-nbn

Altmetric score

doi
urn-nbn
Total: 52 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf