Abstract
This paper concerns selection of the optimal subset of variables
in a lenear regression setting. The posed problem is
combinatiorial and the globally best subset can only be found in
exponential time. We define a cost function for the subset
selection problem by adding the penalty term to the usual least
squares criterion. We propose an optimization technique for the
posed probelm based on a modified version of the Newton-Raphson
iterations, combined with a backward elimination type algorithm.
THe Newton-Raphson modification concerns iterative approximations
to the non-convex cost function of the subset selection problem so
as to guarantee positive definiteness of the Hessian term, hence
avoiding numerical instability. The backward Elemination type
algorithm attempts to improve the results upon termination of the
modified Newton-Raphson search by sing the current solution as an
initial guess. The efficiency of the method is illustrated by a
numerical example with highly correlated explanatory variables for
which the commonly used techiques such as forward
selection/backward elimination perform poorly.
| Original language | English |
|---|---|
| Title of host publication | Symposium i Anvendt Statistik |
| Place of Publication | Copenhagen |
| Publisher | AKF - SFI SUrvey |
| Publication date | 1999 |
| Pages | 220-230 |
| Publication status | Published - 1999 |
| Event | 21st Symposium in Applied Statistics - Handelshøjskolen (CBS), Copenhagen, Denmark Duration: 25 Jan 1999 → 27 Jan 1999 Conference number: 21 |
Conference
| Conference | 21st Symposium in Applied Statistics |
|---|---|
| Number | 21 |
| Location | Handelshøjskolen (CBS) |
| Country/Territory | Denmark |
| City | Copenhagen |
| Period | 25/01/1999 → 27/01/1999 |
Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver