We have explored the use of logistic regression for when the dependent variable has two classes,...

Question

Question

We have explored the use of logistic regression for when the dependent variable has two classes,...

We have explored the use of logistic regression for when the dependent variable has two classes, Yes or No, Admit or Not, etc. Suppose that there are m (>2) classes. For example, Buy, Sell, or Hold. How will you model this using logistic regression? You don't have to solve it, but provide a strategy to model this.

math Statistics-And-Probability

Add a comment Improve this question Transcribed image text

Answer 1

Answer #1

ogistic regression is generally used where the dependent variable is Binary or Dichotomous. That means the dependent variable can take only two possible values such as “Yes or No”, “Default or No Default”, “Living or Dead”, “Responder or Non Responder”, “Yes or No” etc. Independent factors or variables can be categorical or numerical variables.

Please note that even though logistic (logit) regression is frequently used for binary variables (2 classes), it can be used for categorical dependent variables with more than 2 classes. In this case it’s called Multinomial Logistic Regression.

Here we will focus on Logistic Regression with binary dependent variables as it is most commonly used.

Applications of Logistic Regression-

Logistic regression is used for prediction of output which is binary, as stated above. For example, if a credit card company is going to build a model to decide whether to issue a credit card to a customer or not, it will model for whether the customer is going to “Default” or “Not Default” on this credit card. This is called “Default Propensity Modeling” in banking lingo.

Similarly an ecommerce company that is sending out costly advertisement / promotional offer mails to customers, will like to know whether a particular customer is likely to respond to the offer or not. In Other words, whether a customer will be “Responder” or “Non Responder”. This is called “Propensity to Respond Modeling”

Using insights generated from the logistic regression output, companies may optimize their business strategies to achieve their business goals such as minimize expenses or losses, maximize return on investment (ROI) in marketing campaigns etc.

Underlying Algorithm and Assumptions

The underlying algorithm of Maximum Likelihood Estimation (MLE) determines the regression coefficient for the model that accurately predicts the probability of the binary dependent variable. The algorithm stops when the convergence criterion is met or maximum number of iterations are reached. Since the probability of any event lies between 0 and 1 (or 0% to 100%), when we plot the probability of dependent variable by independent factors, it will demonstrate an ‘S’ shape curve.

Let’s take an example- here we are predicting the probability of a given candidate to get admission in a school of his or her choice by the score candidates receives in the admission test. Since the dependent variable is binary/dichotomous- “Admission “or “No Admission”, we can use a logistic regression model to predict the probability of getting the “Admission”. Let’s first plot the data and analyse the shape to confirm that this is following an ‘S’ shape.

main-qimg-e629686458444e7d077b4cd2886a05

Since the relationship between the Score and Probability of Selection is not linear but shows an ‘S’ shape, we can’t use a linear model to predict probability of selection by a score. We need to do Logit transformation of the dependent variable to make the correlation between the predictor and dependent variable linear.

Logit Transformation is defined as follows-

Logit = Log (p/1-p) = log (probability of event happening/ probability of event not happening) = log (Odds)

main-qimg-b5b692c33021912b96bdda8533ddf6

Now we can model using regression to predict the probability of a certain outcome of the dependent variable. The regression equation that the model will try to come out is-

Log (p/1-p) = b0+ b1*x1+b2*x2+ e

Where b0 is the Y intercept, e is the error in the model, b1 is the coefficient (slope) for independent factor x1, and b2 is the coefficient (slope) for independent factor x2 and so on…

In the above example, the regression equation will look like this-

Log (p/1-p) = b0 + b1*Score+ e

The model will generate the coefficients b0 and b1 that gives us the best model in terms of key metrics that we will be discussing later.

Add a comment

Answer 2

We have explored the use of logistic regression for when the dependent variable has two classes,...

Homework Answers

Add Answer to:
We have explored the use of logistic regression for when the dependent variable has two classes,...

Post as a guest

Earn Coins

When evaluating a multiple regression model, for example when we regress dependent variable Y on two...

Theoretical questions: Regression without intercept(40 pts) In this question, we consider a two-variable regression model when...

Theoretical questions: Regression without intercept(40 pts) In this question, we consider a two-variable regression model when...

Theoretical questions: Regression without intercept(40 pts) In this question, we consider a two-variable regression model when...

Suppose we have the following values for a dependent variable, Y, and three independent variables, X1,...

Let the random variable Y represent hourly wages and the random variable X represent education. Suppose we have the...

Suppose this model turns out to be our best model when working with apartments: Least Squares...

Please use DataAnalysis A study was conducted to build a regression model to predict miles per gallon (MPG) of vehicles...

2.4 We have defined the simple linear regression model to be y =B1 + B2x+e. Suppose...

Need help with stats true or false questions Decide (with short explanations) whether the following statements are true or false a) We consider the model y-Ao +A(z) +E. Let (-0.01, 1.5) be a 95% con...

We have explored the use of logistic regression for when the dependent variable has two classes,...

Homework Answers

Add Answer to: We have explored the use of logistic regression for when the dependent variable has two classes,...

Post as a guest

Earn Coins

Add Answer to:
We have explored the use of logistic regression for when the dependent variable has two classes,...