The regression equation that we want to estimate is
where is the
intercept of the regression line
is the
slope
is a random disturbance
We calculate the following
n=3 is the sample size
sample means of X and Y are
The sum pf squares
14. The correlation coefficient is
ans: D. 0.5
15. The estimate of slope coefficient is
The estimate of intercept is
The estimated least square line is
ans: C. y=1+0.5x
16. We have already calculated the total sum of squares as
ans: D) 2
17. The regression sum of squares is
ans: A) 0.5
18. The coefficient of determination is
ans: A) 1/4
19. The sum of square Error is
The degrees of freedom of error is n-2=3-2=1
Hence the mean square error is
The estimate of error variance is
ans: C) 1.5
20. The hypotheses that needs to be tested are
We know the following
is the estimate of slope
is the standard error of slope coefficient
is the hypothesized value of the slope coefficient (from the null
hypothesis)
The test statistics is
The degrees of freedom for t statistics is n-2 =3-2=1
This is a 2 tailed test. From the t table for df=1, and level of significance alpha=0.05 (the area under both the tails) we get the critical value = 12.706. That means the acceptance region is -12.706 to +12.706.
We will reject the null hypothesis if the test statistics lies outside the acceptance region.
Here the test statistics is 0.577 and it lies within -12.706 to +12.706. Hence we do not reject the null hypothesis.
We conclude that there is no sufficient evidence to support the claim that there is significant linear relationship between X and Y.
ans: B) Do not reject null
21. Without calculating we know that the prediction interval for
the future Y value is wider than the confidence interval for
average Y value. Here we see that (-6.985,10.985) has a width of
(10.985-(-6.985)) = 17.97. The width of (-15.969,19.969) is
(19.969-(-15.969)) = 35.938
Hence we can say that the prediction interval is
(-15.969,19.969)
ans: B) (-15.969,19.969)
Calculations:
The predicted value of Y for X=2 is
95% confidence interval indicates a significance level of
. The degrees of freedom for t is n-2=3-2=1. Using the t table for
df=1 and area under right tail = 0.025 or area under both the
tails=0.05 we get the critical value as
The 95% confidence interval for the average Y value is
The 95% prediction interval for the future Y value is
The following information applies to Problems 14-21. Sup pose an experiment yielded Y values of 1,...
1. The following is data obtained from an experiment, Do this by hand and using MATLAE 2.5 a. Determine the equation ofthe least squares line of the form y-As + Ax b. Determine the significance of the fit What is the R2? c. Determine the confidence intervals for the βο and A. Also use bootstrap analysis to determine the confidence intervals of the regression coefficient. d. Test the hypothesis that B 5 e. What are the mean response corresponding tox...
The β 1 term indicates a. the Y value for a given value of X. b. the average change in Y for a unit change in X. c. the Y value when X equals zero. d. the change in observed X for a given change in Y. What does regression analysis attempt to establish? a. linearity in the relationship between independent variables b. a mathematical relationship between a dependent variable, for which future values will be forecast, and one or...
Please solve the problems (d),(e),(f),(g)
3. (20 pts) Do not use a computer to complete the answers for this problem. Show your work in all problems. Consider the following data: 2 -2 2 1 0 4 0 2 10 0 -2 8 Consider the multiple regression model where ci ~ iid N(0, σ*) for i 1, 2, 3, 4, 5. (a) Fill in the values for response vector, design matrix and coefficient vector: X- (b) Find the vector of least...
(Do this problem without using R) Consider the simple linear regression model y =β0 + β1x + ε, where the errors are independent and normally distributed, with mean zero and constant variance σ2. Suppose we observe 4 observations x = (1, 1, −1, −1) and y = (5, 3, 4, 0). (a) Fit the simple linear regression model to this data and report the fitted regression line. (b) Carry out a test of hypotheses using α = 0.05 to determine...
Use the LTI (Linear Time-Invariant) system equation shown on the left for Q#1 1-14. This is a regression system model representing the outputs as linear combination of previous samples of the output. The delay of one sample is represented by z'. YH z1 0.2 | | | If the system is causal, withy(0) = K, then y(-1) should be 12 The output value y(1) should be 13 The coefficient for the term containing the sample y[k-2]is (a) 1 (b)-0.25*K (c)...
0 2 10 0 2 8 Consider the multiple regression model where є¡ ~ iid Ņ(0, σ*) for i = i, 2, 3, 4, 5. (c) Fill in the values for the following ANOVA table: Source of Variation Sum of Squares df Mean Square F Regression on Xi, X2 Error Total (Corrected) (d) State the nul and alternative hypotheses associated with the F test from the ANOVA table in part (c) and do the F test (e) Compute R2 (f)...
The following output is a regression of hourly EARNINGS on HEIGHT. a. Create a 99% confidence interval for the slope coefficient B1 and interpret it. b. Is the slope coefficient on HEIGHT statistically significant at a level of significance of 1%? c. Years of schooling S also affects EARNINGS but has not been added to the regression. Where does the effect of Soccur in this regression output? SUMMARY OUTPUT Regression Statistics Multiple R 0.241124399 R Square 0.058140976 Adjusted R Square...
Part A Run a regression on the following dataset. What proportion of the variation in Y is explained by the regression? Also, what is the standard error of the estimate? X Y 262 14,041 319 16,953 361 18,984 381 19,870 405 20,953 439 22,538 472 23,985 508 25,641 547 27,365 592 29,967 a. 99.95%, 24,446.06 b. 99.95%; 0.5042 c. 99.91%; 195,568.50 d. 99.91%; 156.35 e. None of the above Part B Which of the following statements is correct? a. Based...
Q.30 Run a regression on the following dataset. What proportion of the variation in Y is explained by the regression? Also, what is the standard error of the estimate? X Y 262 14,041 319 16,953 361 18,984 381 19,870 405 20,953 439 22,538 472 23,985 508 25,641 547 27,365 592 29,967 a. 99.95%, 24,446.06 b. 99.95%; 0.5042 c. 99.91%; 195,568.50 d. 99.91%; 156.35 e. None of the above Which of the following statements is correct? a. Based on the F...
Part A A marketing manager wishes to determine whether a relationship exists between the number of cereal boxes (Y) sold at the company's retail store and the number of clerks on duty (X) per day. The sample data below will be used to examine this empirical problem. Calculate the slope of the regression line. X Y 3 4 2 4 2 2 1 2 a. 0 b. 1 c. 2 d. None of the above Part 2 Calculate the Y-intercept...