Additive model explained

In statistics, an additive model (AM) is a nonparametric regression method. It was suggested by Jerome H. Friedman and Werner Stuetzle (1981)[1] and is an essential part of the ACE algorithm. The AM uses a one-dimensional smoother to build a restricted class of nonparametric regression models. Because of this, it is less affected by the curse of dimensionality than a p-dimensional smoother. Furthermore, the AM is more flexible than a standard linear model, while being more interpretable than a general regression surface at the cost of approximation errors. Problems with AM, like many other machine-learning methods, include model selection, overfitting, and multicollinearity.

Description

Given a data set

\{yi,xi1,\ldots,xip

n
\}
i=1
of n statistical units, where

\{xi1,\ldots,xip

n
\}
i=1
represent predictors and

yi

is the outcome, the additive model takes the form

E[yi|xi1,\ldots,xip]=\beta0+\sum

p
j=1

fj(xij)

or

Y=\beta0+\sum

p
j=1

fj(Xj)+\varepsilon

Where

E[\epsilon]=0

,

Var(\epsilon)=\sigma2

and

E[fj(Xj)]=0

. The functions

fj(xij)

are unknown smooth functions fit from the data. Fitting the AM (i.e. the functions

fj(xij)

) can be done using the backfitting algorithm proposed by Andreas Buja, Trevor Hastie and Robert Tibshirani (1989).[2]

See also

Further reading

Notes and References

  1. [Friedman, J.H.]
  2. Buja, A., Hastie, T., and Tibshirani, R. (1989). "Linear Smoothers and Additive Models", The Annals of Statistics 17(2):453 - 555.