(a) Generating the summary statistics: Using excel,
We get the output:
Substituting the mean and standard deviations to compute the Coefficient of Variation (in %):
s | CV(%) | ||
x1 | 79.04 | 12.08 | 12.08 / 79.04 = 15.54 |
x2 | 79.48 | 12.50 | 15.73 |
x3 | 81.48 | 11.77 | 14.44 |
x4 | 162.04 | 24.04 | 14.84 |
Comparing the spread relative to the mean would be the same as comparing the CVs. We find the CV for all the four variables are approximately equal i.e. they do not appear to be significantly different. We may say that:
Yes, the spread is about the same.Yes, the tests are about the same level of difficulty.
(b) Computing the correlation coefficient:
r | r2 | |
x1,x2 | 0.90 | 0.81 |
x1,x3 | 0.89 | 0.80 |
x1,x4 | 0.95 | 0.90 |
x2,x3 | 0.85 | 0.72 |
x2,x4 | 0.93 | 0.86 |
x3,x4 | 0.97 | 0.95 |
We find the highest correlation coefficient obtained is 0.97 observed between 3 and 4. Hence, we may say that:
3. Beacause it has the highest correlation with 4. Yes, the other two still have a lot of influence because of their high correlations with 4.
(c) Running a multiple regression by regressing x4 on the predictors x1,x2 and x3, we get the output:
(d) The fitted regression equation can be expressed as:
where the intercept is estimated to be -4.34 and the slope of x1 is estimated to be 0.36, other predictors in the model being constant; similarly for x2 and x3. Hence, we may say that:
If we hold all other explanatory variables as fixed constant, then we can look at one coefficient as slope.
Here, the slope of x3 can be interpreted as: the mean score of x4 increases by 1.17 units for a unit increase in x3. Hence, if marks in x3 increases by 14, x4 marks is expected to increase by (14)(1.17) = 16.38 points
(e) We test the significance of the slope coefficients by testing the hypothesis:
Vs
t | P-value | |
2.93 | 0.008 | |
5.38 | 0.000 | |
11.33 | 0.000 |
Since, the p-value of the t test of all three predictors are significant at 5% level (since, 0.008,0.000 < 0.05), we may say that we reject all null hypothesis; there is sufficient evidence that differ from zero.
If a coefficient is different from zero, then it contributes to the regression equation.
(f) The 90% CI for slope can be constructed using the formula:
where the critical value of t is obtained as:
= 1.714
Lower Limit | Upper Limit | |
0.36 - 1.714 x 0.12 = 0.15 | 0.36 + 1.714 x 0.12 = 0.56 | |
0.37 | 0.72 | |
0.99 | 1.34 |
For x1 = 68, x2 = 72,x3 = 75
y = -4.34 + 0.36(68) + 0.54(72) + 1.17(75)
= 146.49 = 146 (approx.)
Professor Gill has taught General Psychology for many years. During the semester, she gives three multiple-choice...
The systolic blood pressure of individuals is thought to be related to both age and weight. For a random sample of 11 men, the following data were obtained Weight (pounds) Systolic Blood pressue Age (years) 149 132 52 173 143 59 184 153 67 194 162 73 211 154 64 196 16B 74 220 137 54 188 61 188 159 65 207 128 46 167 166 72 217 (a) Generate summary statistics, including the mean and standard deviation of each...
Student stress at final exam time comes partly from the uncertainty of grades and the consequences of those grades. Can knowledge of a midterm grade be used to predict a final exam grade? A random sample of 200 BCOM students from recent years was taken and their percentage grades on assignments, midterm exam, and final exam were recorded. Let’s examine the ability of midterm and assignment grades to predict final exam grades. The data are shown here: Assignment Midterm FinalExam...
Question 1 Which of the following choices best describes the scatterplot shown below? Group of answer choices Linear, negative, strong No form, weak Linear, positive, strong Curved, weak Question 2 Which of the following choices is most likely to be the correlation of the data in the scatterplot shown below? Group of answer choices -0.14 1.04 0.86 -0.92 Question 3 Most roller coasters get their speed by dropping down a steep initial incline, so it makes sense that we can...
8) (3 pts) Schlitz has been told that every thousand dollars added to the median household income should increase the page cost by less than $750. She wishes to test whether the regression provides convincing evidence to reject this claim. Therefore, she should test a) H0: βINCOME = -0.750 versus Ha: βINCOME < -0.750 b) H0: βINCOME = 0.750 versus Ha: βINCOME < 0.750 c) H0: βINCOME = -0.750 versus Ha: βINCOME ≠ -0.750 d) H0: βINCOME = 0.750 versus...
Al Greens is a franchise store that sells house plants and lawn and garden supplies. Although Al Greens is a franchise, each store is owned and managed by private individuals. Some friends have asked you to go into business with them to open a new All Greens store in the suburbs of San Diego. The national franchise headquarters sent you the following information at your request. These data are about 27 Al Greens stores in California. Each of the 27...
Industrial-organizational psychologists are interested in all of the following except1. how to best diagnose clinical disorders and offer therapy to employees.2. how personality characteristics influence work behavior.3. how culture influences people's perceptions of their working environments.4. how people's work affects their home life.An organizational psychologist would be most likely concerned with1. studying the interaction between humans and technology.2. All of the these3. interviewing potential employees.4. helping people organize their schedules and daily planners.5. understanding the emotional and motivational side of...
All Greens is a franchise store that sells house plants and lawn and garden supplies. Although All Greens is a franchise, each store is owned and managed by private individuals. Some friends have asked you to go into business with them to open a new All Greens store in the suburbs of San Diego. The national franchise headquarters sent you the following information at your request. These data are about 27 All Greens stores in California. Each of the 27...
Use the following regression output from R studio and answer the following questions. 1. Comment on the relation between the two test marks. A. There appears to be no linear relation between the two test marks B. There appears to be strong negative linear relation between the two test marks. C. There appears to be weak positive linear relation between the two test marks. D. There appears to be strong positive linear relation between the two test marks. 2. If the linear...
1. Many companies use a incoming shipments of parts, raw materials, and so on. In the electronics industry, component parts are commonly shipped from suppliers in large lots. Inspection of a sample of n components can be viewed as the n trials of a binomial experimem. The outcome for each component tested (trialD will be that the component is classified as good or defective defective components in the lot do not exceed 1 %. Suppose a random sample of fiver...
The following ANOVA model is for a multiple regression model with two independent variables: Degrees of Sum of Mean Source Freedom Squares Squares F Regression 2 60 Error 18 120 Total 20 180 Determine the Regression Mean Square (MSR): Determine the Mean Square Error (MSE): Compute the overall Fstat test statistic. Is the Fstat significant at the 0.05 level? A linear regression was run on auto sales relative to consumer income. The Regression Sum of Squares (SSR) was 360 and...