Question

Demonstrate regression diagnostics, variables selection and validation.

A hospital administration wished to study the relation between patient satisfaction (Y) and patient’s age (X1), severity of illness (X2) and anxiety level. The administration randomly selected 15 patients and the results of the study are shown below.

 

The regression equation is

y = 184 - 0.976 x1 - 1.14 x2 - 12.3 x3

 

 

Predictor     Coef  SE Coef      T

Constant    184.02    31.70   5.81  

x1         -0.9763   0.4500  -2.17  

x2         -1.1361   0.9786  -1.16  

x3          -12.34    14.30  -0.86  

 

 

S = 11.0529   R-Sq = 70.2%   R-Sq(adj) = 62.0%

 

 

Analysis of Variance

 

Source          DF      SS      MS     F

Regression       3  3161.5  1053.8  8.63

Residual Error  11  1343.8   122.2

Total           14  4505.3

 

 

Obs

Y

X1

X2

X3

residual

d

r

Hii

COOK

DFFIT

1

48

50

51

2.3

-0.88

-0.09

-0.09

0.20

0.00

-0.04

2

57

36

46

2.3

-11.23

-1.14

-1.15

0.20

0.08

-0.58

3

66

40

48

2.3

3.95

0.38

0.36

0.11

0.00

0.13

4

70

41

44

1.8

-1.79

-0.20

-0.19

0.37

0.01

-0.15

5

89

28

43

1.8

3.38

0.37

0.35

0.30

0.01

0.23

6

36

49

54

2.9

-3.05

-0.36

-0.35

0.43

0.02

-0.30

7

46

42

50

2.2

-13.06

-1.24

-1.27

0.09

0.04

-0.39

8

54

45

48

2.4

-1.94

-0.20

-0.19

0.22

0.00

-0.10

9

26

52

62

2.9

-1.03

-0.13

-0.12

0.49

0.00

-0.12

10

77

29

50

2.1

4.01

0.43

0.42

0.29

0.02

0.27

11

89

29

48

2.4

17.44

1.93

2.26

0.33

0.45

DFFIT11

12

67

43

53

2.4

14.79

1.40

1.47

0.09

0.05

0.46

13

47

38

55

2.2

-10.29

-1.15

-1.17

0.35

0.18

-0.86

14

51

34

51

2.3

-13.50

-1.33

-1.38

0.15

0.08

-0.58

15

57

53

54

2.2

13.22

1.53

1.64

0.39

0.37

1.30

 

 

 

 

 

Answer the following questions.

 

a) Name 3 methods of scaling residuals.      (3 marks)

 

b) Name 2 method to measures of influential.     (2 marks)

 

c) Obtain the full model for the above problem.     (1 mark)

 

d) Test the multiple regression model for significance at the 0.05 level. (5 marks)

 

e) From the standardized residual, identify any outlier.    (2 marks)

 

f) From the hat matrix, identify any leverage.     (2 marks)

 

g) Calculate the value of DFFITS.      (4 marks)

 

h) Comment on the value in (g) in term of influence.    (1 mark)

 

 


0 0
Add a comment Improve this question Transcribed image text
Answer #2

answered by: Book Solutions
Add a comment
Know the answer?
Add Answer to:
Demonstrate regression diagnostics, variables selection and validation.
Your Answer:

Post as a guest

Your Name:

What's your source?

Earn Coins

Coins can be redeemed for fabulous gifts.

Not the answer you're looking for? Ask your own homework help question. Our experts will answer your question WITHIN MINUTES for Free.
Similar Homework Help Questions
  • 3. A hospital administrator wished to study the relation between patient satisfaction (Y) and pat...

    solvel only E F G H below is pat data 3. A hospital administrator wished to study the relation between patient satisfaction (Y) and patients age (Xi, in years), severity of illness (X2, an index) and anxiety level (X3, an index). The administrator randomly selected 23 patients and collected the data in pat, where larger values of Y, X2 and X3 are, respectively, associated with more satisfaction, increased severity of illness and more anxiety. The data is saved in Moodle2...

  • question #1 A-D. Please show work. Q1. The following Regression function has been developed to check...

    question #1 A-D. Please show work. Q1. The following Regression function has been developed to check the relationship between the dependent variable y and the independent variable x1. Consider the following Minitab output and answer the questions. Regression Equation 9 = 0.86 + 0.65 x1 a) (4pt). Please fill out the Coefficients table appropriately. Coefficients Term Coef SE Coef T-Value P-Value VIF Constant (1) 1.38 16.58 0.000 X1 (ii) 0.231 7.00 0.000 1.00 b) (4pt). Please fill out the ANOVA...

  • Consider a multiple regression model of the dependent variable y on independent variables x1, X2, X3, and x4: Using data with n 60 observations for each of the variables, a student obtains the follow...

    Consider a multiple regression model of the dependent variable y on independent variables x1, X2, X3, and x4: Using data with n 60 observations for each of the variables, a student obtains the following estimated regression equation for the model given: y0.35 0.58x1 + 0.45x2-0.25x3 - 0.10x4 He would like to conduct significance tests for a multiple regression relationship. He uses the F test to determine whether a significant relationship exists between the dependent variable and He uses the t...

  • Q1. The following Regression function has been developed to check the relationship between the dependent variable...

    Q1. The following Regression function has been developed to check the relationship between the dependent variable y and the independent variable xz. Consider the following Minitab output and answer the questions. Regression Equation 9 = 0.86 + 0.65 x a) (Apt). Please fill out the Coefficients table appropriately. Coefficients Term Coef SE Coef T-Value P-Value VIF Constant 0 1.38 16.58 0.000 X1 0.231 7.00 0.000 1.00 b) (4pt). Please fill out the ANOVA table appropriately. Analysis of Variance Source DF...

  • The accompanying computer excel output (please see attachment on Blackboard) provides details of data and analysis...

    The accompanying computer excel output (please see attachment on Blackboard) provides details of data and analysis on the topic of “patient satisfaction”. A hospital administrator wished to study the relation between patient satisfaction (Y) and patients’ age (X1, in years) , severity of illness (X2), and anxiety level (X3). The output includes a listing of all 46 observations; descriptive statistics for each variable; and, the results of a regression analysis that uses patient satisfaction as the dependent variable, and patient’s...

  • Please calculate the chemical shift for Ha using the "Aromatic Proton Shift Calculation" Table. Ha H3C02C...

    Please calculate the chemical shift for Ha using the "Aromatic Proton Shift Calculation" Table. Ha H3C02C CN MezN Hb Hc Answer: Fr Jump to... CHEM 308 Class 15.04.2024 CHEM 308 AROMATIC PROTONS CHEMICAL SHIFT CALCULATION SHEET H Zomo DAH = 7.36 + Zorme + Zmeta + Zpara Z mets Zpara Zi for R (ppm) Substituent R Zortho Zmeta Zpara Zmet Zpara H CH, 0.0 -0.18 0.02 0.02 -0.07 C(CH3) CHCI CH,OH 0.0 -0.11 -0.08 -0.01 -0.07 Zi for R (ppm)...

  • Use Table 8.1, a computer, or a calculator to answer the following. Suppose a candidate for...

    Use Table 8.1, a computer, or a calculator to answer the following. Suppose a candidate for public office is favored by only 47% of the voters. If a sample survey randomly selects 2,500 voters, the percentage in the sample who favor the candidate can be thought of as a measurement from a normal curve with a mean of 47% and a standard deviation of 1%. Based on this information, how often (as a %) would such a survey show that...

  • Assume that vou build a multiple linear regression model using three independent predictor variables, You initialy...

    Assume that vou build a multiple linear regression model using three independent predictor variables, You initialy determine that X1 should be included in the model. You then test the other model combinations that indlude X1. Based on the results below, Which model would you choose and why? Std. Adj R Error Model R2 X1 0.85 0.841 2969.57 X1+X2 0,852 0.834 |3031.69 0.888 2491.95 0.902 0.884 2539.88 X1+X3 0.9 Full X1+X2. It has the lowest Adj R and the highest Standard...

  • a. What is the Sample Regression Equation? b. Which of the independent variables are significant? Why?...

    a. What is the Sample Regression Equation? b. Which of the independent variables are significant? Why? Use α = 0.10 in ALL questions. Use only the p-value to explain your answers (No need to go to tables). c. Test the overall significance of the model by relying on F Statistic. d. What is the value of adjusted r-square? e. Comment on the Normality assumption for the residuals for this model. In other word, has the normality assumption been satisfied? Explain...

  • The following results were obtained from an undrained shear box test carried out on a set...

    The following results were obtained from an undrained shear box test carried out on a set of undisturbed soil samples. 0.2 0.8 Normal Load (N) Strain (%) 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 0 21 46 70 89 107 121 131 136 138 138 137 136 0.4 Shearing force (N) 0 33 72 110 139 164 180 192 201 210 217 224 230 234 237 236 0 45...

ADVERTISEMENT
Free Homework Help App
Download From Google Play
Scan Your Homework
to Get Instant Free Answers
Need Online Homework Help?
Ask a Question
Get Answers For Free
Most questions answered within 3 hours.
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT