Processing math: 100%

 

 

 

Regression analysis, overarching aims

Regression modeling deals with the description of the sampling distribution of a given random variable y and how it varies as function of another variable or a set of such variables x=[x0,x1,,xn1]T. The first variable is called the dependent, the outcome or the response variable while the set of variables x is called the independent variable, or the predictor variable or the explanatory variable, or simply just the inputs.

A regression model aims at finding a likelihood function p(y|x) or in the more traditional sense a function y(x), that is the conditional distribution for y with a given x. The estimation of p(y|x) is made using a data set with

  • n cases i=0,1,2,,n1
  • Response (target, dependent or outcome) variable yi with i=0,1,2,,n1
  • p so-called explanatory (independent or predictor or feature) variables xi=[xi0,xi1,,xip1] with i=0,1,2,,n1 and explanatory variables running from 0 to p1. See below for more explicit examples.

The goal of the regression analysis is to extract/exploit relationship between y and x in order to infer specific dependencies, approximations to the likelihood functions, functional relationships and to make predictions, making fits and many other things.