Question

We have explored the use of logistic regression for when the dependent variable has two classes,...

We have explored the use of logistic regression for when the dependent variable has two classes, Yes or No, Admit or Not, etc. Suppose that there are m (>2) classes. For example, Buy, Sell, or Hold. How will you model this using logistic regression? You don't have to solve it, but provide a strategy to model this.

0 0
Add a comment Improve this question Transcribed image text
Answer #1

ogistic regression is generally used where the dependent variable is Binary or Dichotomous. That means the dependent variable can take only two possible values such as “Yes or No”, “Default or No Default”, “Living or Dead”, “Responder or Non Responder”, “Yes or No” etc. Independent factors or variables can be categorical or numerical variables.

Please note that even though logistic (logit) regression is frequently used for binary variables (2 classes), it can be used for categorical dependent variables with more than 2 classes. In this case it’s called Multinomial Logistic Regression.

Here we will focus on Logistic Regression with binary dependent variables as it is most commonly used.

Applications of Logistic Regression-

Logistic regression is used for prediction of output which is binary, as stated above. For example, if a credit card company is going to build a model to decide whether to issue a credit card to a customer or not, it will model for whether the customer is going to “Default” or “Not Default” on this credit card. This is called “Default Propensity Modeling” in banking lingo.

Similarly an ecommerce company that is sending out costly advertisement / promotional offer mails to customers, will like to know whether a particular customer is likely to respond to the offer or not. In Other words, whether a customer will be “Responder” or “Non Responder”. This is called “Propensity to Respond Modeling”

Using insights generated from the logistic regression output, companies may optimize their business strategies to achieve their business goals such as minimize expenses or losses, maximize return on investment (ROI) in marketing campaigns etc.

Underlying Algorithm and Assumptions

The underlying algorithm of Maximum Likelihood Estimation (MLE) determines the regression coefficient for the model that accurately predicts the probability of the binary dependent variable. The algorithm stops when the convergence criterion is met or maximum number of iterations are reached. Since the probability of any event lies between 0 and 1 (or 0% to 100%), when we plot the probability of dependent variable by independent factors, it will demonstrate an ‘S’ shape curve.

Let’s take an example- here we are predicting the probability of a given candidate to get admission in a school of his or her choice by the score candidates receives in the admission test. Since the dependent variable is binary/dichotomous- “Admission “or “No Admission”, we can use a logistic regression model to predict the probability of getting the “Admission”. Let’s first plot the data and analyse the shape to confirm that this is following an ‘S’ shape.

main-qimg-e629686458444e7d077b4cd2886a05

Since the relationship between the Score and Probability of Selection is not linear but shows an ‘S’ shape, we can’t use a linear model to predict probability of selection by a score. We need to do Logit transformation of the dependent variable to make the correlation between the predictor and dependent variable linear.

Logit Transformation is defined as follows-

Logit = Log (p/1-p) = log (probability of event happening/ probability of event not happening) = log (Odds)

main-qimg-b5b692c33021912b96bdda8533ddf6

Now we can model using regression to predict the probability of a certain outcome of the dependent variable. The regression equation that the model will try to come out is-

Log (p/1-p) = b0+ b1*x1+b2*x2+ e

Where b0 is the Y intercept, e is the error in the model, b1 is the coefficient (slope) for independent factor x1, and b2 is the coefficient (slope) for independent factor x2 and so on…

In the above example, the regression equation will look like this-

Log (p/1-p) = b0 + b1*Score+ e

The model will generate the coefficients b0 and b1 that gives us the best model in terms of key metrics that we will be discussing later.

Add a comment
Know the answer?
Add Answer to:
We have explored the use of logistic regression for when the dependent variable has two classes,...
Your Answer:

Post as a guest

Your Name:

What's your source?

Earn Coins

Coins can be redeemed for fabulous gifts.

Not the answer you're looking for? Ask your own homework help question. Our experts will answer your question WITHIN MINUTES for Free.
Similar Homework Help Questions
  • When evaluating a multiple regression model, for example when we regress dependent variable Y on two...

    When evaluating a multiple regression model, for example when we regress dependent variable Y on two independent variables X1 and X2, a commonly used goodness of fit measure is: A. Correlation between Y and X1 B. Correlation between Y and X2 C. Correlation between X1 and X2 D. Adjusted-R2 E. None of the above

  • Theoretical questions: Regression without intercept(40 pts) In this question, we consider a two-variable regression model when...

    Theoretical questions: Regression without intercept(40 pts) In this question, we consider a two-variable regression model when there is no intercept in the model: There is no intercept zo in the model. Suppose we have n different samples. Then answer the following questions: (a) Write the design matrix X for our model, using the subscript notation we introduce in class.(10 pts) (b) Write the explicit solution of βί and ß2, in terms of Σ'al ril, Ση! x2, Σ¡al 2ilxi2Σ aily, and...

  • Theoretical questions: Regression without intercept(40 pts) In this question, we consider a two-variable regression model when...

    Theoretical questions: Regression without intercept(40 pts) In this question, we consider a two-variable regression model when there is no intercept in the model: There is no intercept x0 in the model. Suppose we have n different samples. Then answer the following questions: (b) Write the explícit solution of βι and函, in terms of Ση 1 гг, Σί.1 za, Σ-1 zazi2Σ㈡ raVi and Σ-1Equ.(30 pts) (Hint: You can refer to the SLR example in slides, they have similar idea)

  • Theoretical questions: Regression without intercept(40 pts) In this question, we consider a two-variable regression model when...

    Theoretical questions: Regression without intercept(40 pts) In this question, we consider a two-variable regression model when there is no intercept in the model: There is no intercept x0 in the model. Suppose we have n different samples. Then answer the following questions: (a) Write the design matrix X for our model, using the subscript notation we introduce in class.(10 pts)

  • Suppose we have the following values for a dependent variable, Y, and three independent variables, X1,...

    Suppose we have the following values for a dependent variable, Y, and three independent variables, X1, X2, and X3. The variable X3 is a dummy variable where 1 = male and 2 = female: X1       X2       X3       Y 0          40        1          30 0          50        0          10 2          20        0          40 2          50        1          50 4          90        0          60 4          60        0          70 4          70        1          80 4          40        1          90 6          40        0          70 6          50        1          90 8          80       ...

  • Let the random variable Y represent hourly wages and the random variable X represent education. Suppose we have the...

    Let the random variable Y represent hourly wages and the random variable X represent education. Suppose we have the following regression equation in mind to estimate the return to education: (a) Can we say that this regression would capture the causal effect of education on wages? Support your answer with reasoning. (b) Using the sample equivalent of the two equations E(u)-0 and E(uX)-0 derive the regression estimators for A, and β1-Write down each mathe tnatical step, what would be the...

  • Suppose this model turns out to be our best model when working with apartments: Least Squares...

    Suppose this model turns out to be our best model when working with apartments: Least Squares Linear Regression of Price Predictor Variables Constant Size X1sq Location Coefficient Std Error T P VIF 1123.55 1404.09 0.80 0.4277 0.0 -0.51803 2.45842 -0.21 0.8348 68.6 7.678E-04 1.082E-03 1.71 0.0416 68.8 -232.236 101.717 -2.28 0.0271 1.1 In determining this was the best model, was it necessary to test the size variable using a t-test? Maybe. It depends on what happens when we tested the...

  • Please use DataAnalysis A study was conducted to build a regression model to predict miles per gallon (MPG) of vehicles...

    Please use DataAnalysis A study was conducted to build a regression model to predict miles per gallon (MPG) of vehicles. To develop the model, you obtained MPG of 43 random vehicles. In addition, you collected the following information - Length: vehicle length (inches) - Width: vehicle width (inches) - Weight: vehicle weight (pounds) - Made in Japan: whether the car is manufactured in Japan or not a. Fit a multiple regression model using all four independent variables. For "made in...

  • 2.4 We have defined the simple linear regression model to be y =B1 + B2x+e. Suppose...

    2.4 We have defined the simple linear regression model to be y =B1 + B2x+e. Suppose however that we knew, for a fact, that ßı = 0. (a) What does the linear regression model look like, algebraically, if ßı = 0? (b) What does the linear regression model look like, graphically, if ßı = 0? (c) If Bi=0 the least squares "sum of squares" function becomes S(R2) = Gyi - B2x;)?. Using the data, x 1 2 3 4 5...

  • Need help with stats true or false questions Decide (with short explanations) whether the following statements are true or false a) We consider the model y-Ao +A(z) +E. Let (-0.01, 1.5) be a 95% con...

    Need help with stats true or false questions Decide (with short explanations) whether the following statements are true or false a) We consider the model y-Ao +A(z) +E. Let (-0.01, 1.5) be a 95% confidence interval for A In this case, a t-test with significance level 1% rejects the null hypothesis Ho : A-0 against a two sided alternative. b) Complicated models with a lot of parameters are better for prediction then simple models with just a few parameters c)...

ADVERTISEMENT
Free Homework Help App
Download From Google Play
Scan Your Homework
to Get Instant Free Answers
Need Online Homework Help?
Ask a Question
Get Answers For Free
Most questions answered within 3 hours.
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT