Define and describe the nature of dummy variables. Include an explanation of how the number of dummy variables included in the model depends on the number of characteristics of the variable.
Describe a regression model that includes an intercept-differential variable.
Describe a regression model that includes a slope-differential variable.
Describe how can a dummy variable can be used in the second assignment’s regression model to determine if the difference in math scores differs significantly between men and women?
Second assignment Data:
Table 1. Grade Point Average (Y) and Family Income (X) in Thousands of Dollars
n |
Y |
X |
|
1 |
4.0 |
21.0 |
|
2 |
3.0 |
15.0 |
|
3 |
3.5 |
15.0 |
|
4 |
2.0 |
9.0 |
|
5 |
3.0 |
12.0 |
|
6 |
3.5 |
18.0 |
|
7 |
2.5 |
6.0 |
|
8 |
2.5 |
12.0 |
Multiple questions has been posted. As HOMEWORKLIB RULES's policy, answering the first question below:
It shall be noted that dummy variables are those variable that takes values 0 and 1 only and they are created for a categorical variable , one dummy variable per value the categorical variable takes.
The value of 1 taken by dummy variable shows the presence of the value of the categorical variable, whereas, the value 0 shows the absence of that value that categorical variable takes.
The dummy variables can act as an explanatory variable in regression analysis.
Categorical variable | D1 | D2 | D3 | D4 | D5 |
1 | 1 | 0 | 0 | 0 | 0 |
2 | 0 | 1 | 0 | 0 | 0 |
3 | 0 | 0 | 1 | 0 | 0 |
4 | 0 | 0 | 0 | 1 | 0 |
5 | 0 | 0 | 0 | 0 | 1 |
In regression model with intercept, Dummy Variables that could be included as explanatory variables is always 1 less than the number of dummy variables created per categorical variable to avoid the problem of dummy trap.
It would not make sense to pass the categorical variable as an explanatory variable in the regression model with intercept.
If there are 5 values (or characteristics) of a categorical variable and there are 5 dummy variables created, one each of the values taken by categorical variable, then in the regression model with intercept, instead of passing the categorical variable as independent explanatory variable, use any four dummy variables, while keeping one dummy variable as the constant.
For example:
Yi = b0 + b1*X1 + b2*X2 + b3*D1 + b4*D2 +b5*D3 + b6*D4
Thus, in the above regression model, b0 is the intercept, X1 and X2 are continuous explanatory variables and D1, D2, D3 and D4 are the dummy variables
The dummy variable D5 is observed as the base
It shall be noted that D1, D2, D3, D4 and D5 are the dummy variables corresponding to the given categorical variable.
Define and describe the nature of dummy variables. Include an explanation of how the number of...
SnowGeese File:
Trial Diet WtChange
DigEff ADFiber
1 Plants -6.0 0.0
28.5
2 Plants -5.0 2.5
27.5
3 Plants -4.5 5.0
27.5
4 Plants 0.0 0.0
32.5
5 Plants 2.0 0.0
32.0
6 Plants 3.5 1.0
30.0
7 Plants -2.0 2.5
34.0
8 Plants -2.5 10.0
36.5
9 Plants -3.5 20.0
28.5
10 Plants -2.5 12.5
29.0
11 Plants -3.0 28.0
28.0
12 Plants -8.5 30.0
28.0
13 Plants -3.5 18.0
30.0
14 Plants -3.0 15.0
31.0
15 Plants -2.5 ...
The table below gives the number of hours ten randomly selected students spent studying and their corresponding midterm exam grades. Using this data, consider the equation of the regression line, y = bo + b1x, for predicting the midterm exam grade that a student will earn based on the number of hours spent studying. Keep in mind, the correlation coefficient may or may not be statistically significant for the data given. Remember, in practice, it would not be appropriate to...
can you answer question 9 please
Problems 473 results from parts (a), (b), and (c). What model seems most plausible? How do the data limit your conclusions? tle the data from Freund (1979), presented in Problem 22 in Chapter 14. Taking be model discussed there as the maximum model, repeat parts (a) through (h) of Problem 6. In part (h), note the possible role of collinearity. A random sample of data was collected on residential sales in a large city....
1. For each of the following regression models, write down the X matrix and 3 vector. Assume in both cases that there are four observations (a) Y BoB1X1 + B2X1X2 (b) log Y Bo B1XiB2X2+ 2. For each of the following regression models, write down the X matrix and vector. Assume in both cases that there are five observations. (a) YB1XB2X2+BXE (b) VYBoB, X,a +2 log10 X2+E regression model never reduces R2, why 3. If adding predictor variables to a...
All Greens is a franchise store that sells house plants and lawn and garden supplies. Although All Greens is a franchise, each store is owned and managed by private individuals. Some friends have asked you to go into business with them to open a new All Greens store in the suburbs of San Diego. The national franchise headquarters sent you the following information at your request. These data are about 27 All Greens stores in California. Each of the 27...
All Greens is a franchise store that sells house plants and lawn and garden supplies. Although All Greens is a franchise, each store is owned and managed by private individuals. Some friends have asked you to go into business with them to open a new All Greens store in the suburbs of San Diego. The national franchise headquarters sent you the following information at your request. These data are about 27 All Greens stores in California. Each of the 27...