Newey–West estimator explained

A Newey–West estimator is used in statistics and econometrics to provide an estimate of the covariance matrix of the parameters of a regression-type model where the standard assumptions of regression analysis do not apply.[1] It was devised by Whitney K. Newey and Kenneth D. West in 1987, although there are a number of later variants.[2] [3] [4] [5] The estimator is used to try to overcome autocorrelation (also called serial correlation), and heteroskedasticity in the error terms in the models, often for regressions applied to time series data. The abbreviation "HAC," sometimes used for the estimator, stands for "heteroskedasticity and autocorrelation consistent."[2] There are a number of HAC estimators described in, and HAC estimator does not refer uniquely to Newey–West. One version of Newey–West Bartlett requires the user to specify the bandwidth and usage of the Bartlett kernel from Kernel density estimation

Regression models estimated with time series data often exhibit autocorrelation; that is, the error terms are correlated over time. The heteroscedastic consistent estimator of the error covariance is constructed from a term

X\operatorname{T

}\Sigma X, where

X

is the design matrix for the regression problem and

\Sigma

is the covariance matrix of the residuals. The least squares estimator

b

is a consistent estimator of

\beta

. This implies that the least squares residuals

ei

are "point-wise" consistent estimators of their population counterparts

Ei

. The general approach, then, will be to use

X

and

e

to devise an estimator of

X\operatorname{T

}\Sigma X.[6] This means that as the time between error terms increases, the correlation between the error terms decreases. The estimator thus can be used to improve the ordinary least squares (OLS) regression when the residuals are heteroscedastic and/or autocorrelated.

X\operatorname{T

}\Sigma X=\frac \sum^T_ e_t^2 x_t x^_t + \frac \sum^L_ \sum^T_ w_\ell e_t e_(x_t x^_ + x_ x^_t)

w\ell=1-

\ell
L+1

where T is the sample size,

et

is the

tth

residual and

xt

is the

tth

row of the design matrix, and

w\ell

is the Bartlett kernel [7] and can be thought of as a weight that decreases with increasing separation between samples. Disturbances that are farther apart from each other are given lower weight, while those with equal subscripts are given a weight of 1. This ensures that second term converges (in some appropriate sense) to a finite matrix. This weighting scheme also ensures that the resulting covariance matrix is positive semi-definite.[2] L = 0 reduces the Newey–West estimator to Huber–White standard error.[8] L specifies the "maximum lag considered for the control of autocorrelation. A common choice for L" is

T1/4

.[9]

Software implementations

In Julia, the CovarianceMatrices.jl package [10] supports several types of heteroskedasticity and autocorrelation consistent covariance matrix estimation including Newey–West, White, and Arellano.

In R, the packages sandwich[11] and plm[12] include a function for the Newey–West estimator.

In Stata, the command newey produces Newey–West standard errors for coefficients estimated by OLS regression.[13]

In MATLAB, the command hac in the Econometrics toolbox produces the Newey–West estimator (among others).[14]

In Python, the statsmodels[15] module includes functions for the covariance matrix using Newey–West.

In Gretl, the option --robust to several estimation commands (such as ols) in the context of a time-series dataset produces Newey–West standard errors.[16]

In SAS, the Newey–West corrected standard errors can be obtained in PROC AUTOREG and PROC MODEL [17]

See also

Further reading

Notes and References

  1. Web site: Newey West estimator – Quantitative Finance Collector. 2009-05-18. https://web.archive.org/web/20180624175743/http://www.mathfinance.cn/newey-west-estimator/. 2018-06-24.
  2. 10.2307/1913610 . Newey . Whitney K . West . Kenneth D . 1987 . A Simple, Positive Semi-definite, Heteroscedasticity and Autocorrelation Consistent Covariance Matrix . 1913610. Econometrica . 55 . 3. 703–708 .
  3. 10.2307/2938229 . Andrews . Donald W. K. . Donald Andrews . 1991 . Heteroskedasticity and autocorrelation consistent covariance matrix estimation . 2938229. Econometrica . 59 . 3. 817–858 .
  4. 10.2307/2297912 . Newey . Whitney K. . West . Kenneth D. . 1994 . Automatic lag selection in covariance matrix estimation . 2297912. Review of Economic Studies . 61 . 4. 631–654 .
  5. Smith . Richard J. . 2005 . Automatic positive semidefinite HAC covariance matrix and GMM estimation . Econometric Theory . 21 . 1. 158–170 . 10.1017/S0266466605050103 .
  6. Book: Greene, William H. . 1997 . Econometric Analysis . registration . 3rd .
  7. Web site: time series – Bartlett Kernel (Newey West Covariance Matrix) . 2022-09-15 . Cross Validated . en.
  8. Web site: Verallgemeinerte Kleinst-Quadrate-Schätzung . Generalized Least Squares estimation . www.uni-kassel.de. Uni-Kassel. 2023-09-21.
  9. Book: Greene, William H. . Econometric analysis . 2012 . Pearson . 978-0-273-75356-8 . 7th . Boston . 726074601.
  10. Web site: CovarianceMatrices.jl package .
  11. Web site: sandwich: Robust Covariance Matrix Estimators . CRAN .
  12. Web site: plm: Linear Models for Panel Data . CRAN .
  13. Web site: Regression with Newey–West standard errors . Stata Manual .
  14. Web site: Heteroscedasticity and autocorrelation consistent covariance estimators . Econometrics Toolbox .
  15. Web site: statsmodels: Statistics . statsmodels .
  16. Web site: Robust covariance matrix estimation . Gretl User's Guide, chapter 22 .
  17. Web site: Usage Note 40098: Newey–West correction of standard errors for heteroscedasticity and autocorrelation.