a) To find Residual Sum of Squares we need to identify the alpha and beta.
X | Y | X^2 | Y^2 | XY | |
3.4 | 5 | 11.56 | 25 | 17 | |
3.8 | 6 | 14.44 | 36 | 22.8 | |
2.8 | 4 | 7.84 | 16 | 11.2 | |
3.2 | 6 | 10.24 | 36 | 19.2 | |
Sum | 13.2 | 21 | 44.08 | 113 | 70.2 |
From this table, n=4, Sum X = 13.2, Sum Y=21, Sum X^2 = 44.08, Sum Y^2 = 113, Sum XY = 70.2
beta = n SumXY - (Sum X Sum Y) / n Sum X^2 - (Sum X)^2
beta = 1.7307
alpha = Ybar - Beta Xbar
Ybar = Sum Y / n = 21/4 = 5.25
Xbar = Sum X / n = 13.2 /4 = 3.3
Now, alpha = 5.25 - (1.7307 x 3.3) = -0.46131
Thus regression equation is, Y = -0.46131 + 1.7307 X
Now RSS can be calculated using the formula,
Unfortunately Latex equation is not working due to some technical problem, I will write the formula as,
RSS = Summation {from 1 to n} (yi - (alpha + beta xi))^2
= [5 - (-0.46131 + (1.7307 * 3.4))]^2 + [6 - (-0.46131 + (1.7307 * 3.8))]^2 + [4 - (-0.46131 + (1.7307 * 2.8))]^2 + [6 - (-0.46131 + (1.7307 * 3.2))]^2
= 1.1923
c) Plot the regression line:
d) Standard Error of Slope = Sqrt (Sum (yi - y(hat))^2/ n-2 ) / Sqrt (Sum (xi - x)^2)
Which is nothing but, beta divided by the t-statistic. = 1.7307/1.347 = 1.2847
Excel Output:
SUMMARY OUTPUT | ||||||||
Regression Statistics | ||||||||
Multiple R | 0.802955069 | |||||||
R Square | 0.644736842 | |||||||
Adjusted R Square | 0.289473684 | |||||||
Standard Error | 0.973328527 | |||||||
Observations | 3 | |||||||
ANOVA | ||||||||
df | SS | MS | F | Significance F | ||||
Regression | 1 | 1.719298 | 1.719298246 | 1.814814815 | 0.406519728 | |||
Residual | 1 | 0.947368 | 0.947368421 | |||||
Total | 2 | 2.666667 | ||||||
Coefficients | Standard Error | t Stat | P-value | Lower 95% | Upper 95% | Lower 95.0% | Upper 95.0% | |
Intercept | -0.684210526 | 4.502077 | -0.151976635 | 0.903983399 | -57.88852368 | 56.520103 | -57.888524 | 56.52010263 |
1.842105263 | 1.367409 | 1.347150628 | 0.406519728 | -15.53246751 | 19.216678 | -15.532468 | 19.21667803 | |
RESIDUAL OUTPUT | PROBABILITY OUTPUT | |||||||
Observation | Predicted 5 | Residuals | Standard Residuals | Percentile | 5 | |||
1 | 6.315789474 | -0.31579 | -0.458831468 | 16.66666667 | 4 | |||
2 | 4.473684211 | -0.47368 | -0.688247202 | 50 | 6 | |||
3 | 5.210526316 | 0.789474 | 1.147078669 | 83.33333333 | 6 |
Salary GPA Instructions: 3.4 In this problem you will characterize the predictive relationship between 3.8 2.8...
QUESTION 27 Q27. A manager at a local bank analyzed the relationship between monthly salary (y, in $) and length of service (x, measured in months) for 30 employees. She estimates the model: Salary = Bo + B1 Service + ε. The following ANOVA table below shows a portion of the regression results. df SS M S F Regression 555,420 555,420 7.64 Residual 27 1,962,873 72,699 Total 28 2 ,518,293 Coefficients Standard Error t-stat p-value Intercept 784.92 322.25 2.44 0.02...
5. Confidence intervals for prediction You are a starting pitcher in the major leagues. It's January 2008, and you are in the process of negotiating your salary for the 2008 season. You hire a statistician to help you with your negotiations. She specifies the following simple linear regression model: where y2008 salary (in millions of dollars) and x-performance during the regular 2007 season Then she selects a random sample of 50 major league starting pitchers with signed salary agreements for...
A manager at a local bank analyzed the relationship between monthly salary (y, in $) and length of service (x, measured in months) for 30 employees. She estimates the model: Salary = β0 + β1 Service + ε. The following ANOVA table summarizes a portion of the regression results. df SS MS F Regression 1 555,420 555,420 7.64 Residual 27 1,962,873 72,699 Total 28 2,518,293 Coefficients Standard Error t-stat p-value Intercept 784.92 322.25 2.44 0.02 Service 9.19 3.20 2.87 0.01...
You are a starting pitcher in the major leagues. It's January 2008, and you are in the process of negotiating your salary for the 2008 season. You hire a statistician to help you with your negotiations. She specifies the following simple linear regression model where y2008 salary (in millions of dollars) and x = performance during the regular 2007 season Then she selects a random sample of 50 major league starting pitchers with signed salary agreements for the 2008 season....
You are a starting pitcher in the major leagues. It's January 2008, and you are in the process of negotiating your salary for the 2008 season. You hire a statistician to help you with your negotiations. She specifies the following simple linear regression model: where y 2008 salary (in millions of dollars) and performance during the regular 2007 season x - Then she selects a random sample of 50 major league starting pitchers with signed salary agreements for the 2008...
In this exercise, you will investigate the relationship between a worker's age and earnings. (Generally, older workers have more job experience, leading to higher productivity and earnings.) The following table contains data for full-time, full-years workers, age 25-34, with a high school diploma or B.A./B.S. as their highest degree. Download the data from the table by clicking the download table icon 0 . A detailed description of the variables used in the data set is available here Use a statistical...
gretl: model 1 File Edit Tests Save Graphs Analysis LaTeX Question 5 In your first year microeconomics course you learned about differentiated products. As an econometrics student differentiated products are interesting because they are prime candidates for hedonic price modelling. As mentioned in class, a hedonic price model is a regression model that relates the price of a differentiated product (a residential house in this case) to its characteristics. For this assignment you will construct a simple hedonic model for...