Multivariate adaptive regression splines

In statistics, multivariate adaptive regression splines (MARS) is a form of regression analysis introduced by Jerome H. Friedman in 1991. It is a non-parametric regression technique and can be seen as an extension of linear models that automatically models nonlinearities and interactions between variables. In statistics, multivariate adaptive regression splines (MARS) is a form of regression analysis introduced by Jerome H. Friedman in 1991. It is a non-parametric regression technique and can be seen as an extension of linear models that automatically models nonlinearities and interactions between variables. The term 'MARS' is trademarked and licensed to Salford Systems. In order to avoid trademark infringements, many open-source implementations of MARS are called 'Earth'. This section introduces MARS using a few examples. We start with a set of data: a matrix of input variables x, and a vector of the observed responses y, with a response for each row in x. For example, the data could be: Here there is only one independent variable, so the x matrix is just a single column. Given these measurements, we would like to build a model which predicts the expected y for a given x. A linear model for the above data is The hat on the y ^ {displaystyle {widehat {y}}} indicates that y ^ {displaystyle {widehat {y}}} is estimated from the data. The figure on the right shows a plot of this function: a line giving the predicted y ^ {displaystyle {widehat {y}}} versus x, with the original values of y shown as red dots. The data at the extremes of x indicates that the relationship between y and x may be non-linear (look at the red dots relative to the regression line at low and high values of x). We thus turn to MARS to automatically build a model taking into account non-linearities. MARS software constructs a model from the given x and y as follows The figure on the right shows a plot of this function: the predicted y ^ {displaystyle {widehat {y}}} versus x, with the original values of y once again shown as red dots. The predicted response is now a better fit to the original y values. MARS has automatically produced a kink in the predicted y to take into account non-linearity. The kink is produced by hinge functions. The hinge functions are the expressions starting with max {displaystyle max } (where max ( a , b ) {displaystyle max(a,b)} is a {displaystyle a} if a > b {displaystyle a>b} , else b {displaystyle b} ). Hinge functions are described in more detail below.

[ "Bayesian multivariate linear regression", "Polynomial regression", "Logistic regression", "Nonparametric regression" ]
Parent Topic
Child Topic
    No Parent Topic