8) (3 pts) Schlitz has been told that every thousand dollars added to the median household income should increase the page cost by less than $750. She wishes to test whether the regression provides convincing evidence to reject this claim. Therefore, she should test a) H0: βINCOME = -0.750 versus Ha: βINCOME < -0.750 b) H0: βINCOME = 0.750 versus Ha: βINCOME < 0.750 c) H0: βINCOME = -0.750 versus Ha: βINCOME ≠ -0.750 d) H0: βINCOME = 0.750 versus Ha: βINCOME ≠ 0.750 e) H0: βINCOME = -0.750 versus Ha: βINCOME > -0.750 f) H0: βINCOME = 0.750 versus Ha: βINCOME > 0.750 9) (3 pts) Schlitz is intrigued by the number 16.9149, which appears near the top of the regression output and is labeled as “Standard Error.” Which is the most accurate interpretation of this number? (choose one) a) 16.9149% of the variation in the PAGECOST variable can be explained by this regression. b) It measures the precision with which the regression has estimated the intercept coefficient. c) It is the standard deviation of the population parameter μ. d) It is the standard deviation of the residuals. 10) (2 pts) Schlitz is thinking of adding another explanatory variable to her regression (the actual numeric ratings from Media Journal). How will the standard error of the estimate change? (choose one): a) It will increase. b) It will decrease. c) It may increase or may decrease. d) It will definitely stay the same. 11) (3 pts) Suppose Schlitz ran a second regression of PAGECOST versus INCOME and RATING. Consider a plot of PAGECOST versus INCOME in which we represent the “above average rating” predictions with a red line and the “below average rating” predictions with a white line. Which is true? a) The intercepts of the red line and white line will be the same, but the slopes might be different. b) The slopes of the red line and the white line will be the same, but the intercepts might be different. c) The red and white lines will be identical. d) The red and white lines will have different intercepts and different slopes. 12) (1 pts) Suppose Schlitz runs a simple regression of PAGECOST versus DIE, where for DIE she enters the outcomes of 55 rolls of a 6-sided die. What is the probability that the resulting F-test indicates a significant regression at the 5% level? a) 0.00 b) 0.01 c) 0.05 d) 0.10 e) None of the above 13) (2 pts) Schlitz believes that using the “standard error of the estimate” to calculate an approximate prediction interval gives an under-estimate of the length of the true interval. This is a) True b) False c) Sometimes true and sometimes false. 14) (3 pts) If Schlitz’s regression suffers from multi-collinearity, it means that a) Residual errors are highly correlated with an explanatory variable b) An explanatory variable is highly correlated with the dependent variable c) Two or more explanatory variables are highly correlated with each other. d) Residual errors are highly correlated with the dependent variable e) There is a problem with covariation 15. (3 pts) A correlation analysis between sales and sales training scores results in correlation coefficient, R = +0.98. Which of the following best interprets the relationship between sales and sales training? a) 98% of the salespeople taking the test have higher sales b) the correlation between sales and sales training is very weak and insignificant c) the correlation between sales and sales training is strong and positive d) 98% of the variation in sales is explained by variations in sales training scores e) none of the above 16. (2 pts) Which of the following best describes the sampling distribution of the mean? a) it is the frequency distribution for all of the elements of the population b) it is a probability distribution of the means of varying sample sizes for relevant populations c) it is a probability distribution of the means of all possible samples of a given size drawn from a particular population d) none of the above e) (A), (B) and (C)
8) (3 pts) Schlitz has been told that every thousand dollars added to the median household income should increase the page cost by less than $750. She wishes to test whether the regression provides convincing evidence to reject this claim.
Therefore, she should test
We should note that there would be an increase in the page cost .Hence the null hypothesis should be a positive 0.750.
Next we should test whether the increase is 0.750 or lesser than 0.750 .Hence alternative hypothesis should be <0.750.So the correct choice is b).
b) H0: βINCOME = 0.750 versus Ha: βINCOME < 0.750
9) (3 pts) Schlitz is intrigued by the number 16.9149, which appears near the top of the regression output and is labeled as “Standard Error.” Which is the most accurate interpretation of this number?
As .,Standard error is a statistical term that measures the accuracy with which a sample represents a population.
. b) It measures the precision with which the regression has estimated the intercept coefficient.
This is the most accurate interpretation .
10) (2 pts) Schlitz is thinking of adding another explanatory variable to her regression (the actual numeric ratings from Media Journal). How will the standard error of the estimate change?
This question can be answered by defining R2.As R2 or the coefficient of determination i.e explained variation/total variation gives us how close the estimate fit is to the actual fit.As we keep on adding MUTUALLY INDEPENDENT variables the R2 increases and hence standard error estimate goes on decreasing as accuracy of the model increases.Note that if the variables are not independent collinearity may exist which may cause standard error to increase.
b) It will decrease
11) (3 pts) Suppose Schlitz ran a second regression of PAGECOST versus INCOME and RATING. Consider a plot of PAGECOST versus INCOME in which we represent the “above average rating” predictions with a red line and the “below average rating” predictions with a white line. Which is true?
The model here would be
PAGECOST = A*INCOME+B*RATING +C, where A and B are regression coefficients of income and rating respectively and C is the intercept.
For Y (Pagecost) vs. X1 (Income) we plot a graph and above average rating where red line represents above average rating and white line is below average rating
a) The intercepts of the red line and white line will be the same, but the slopes might be different.
The reason is that C i.e regressor of constant would be same in both cases but B would vary depending on the rating hence model Y would have same intercept but different regressor.
12) (1 pts) Suppose Schlitz runs a simple regression of PAGECOST versus DIE, where for DIE she enters the outcomes of 55 rolls of a 6-sided die. What is the probability that the resulting F-test indicates a significant regression at the 5% level?
As pagecost Y is indepednent of the regressor die the probability that F test would indicate a significant regressor is 0
a) 0.00
13) (2 pts) Schlitz believes that using the “standard error of the estimate” to calculate an approximate prediction interval gives an under-estimate of the length of the true interval.
This is false as any confidence interval can be predicted by
( (mean of estimate)-Zalpha*(S,E of estimate) ,(mean of estimate)+Zalpha*(S,E of estimate))
This is b)false
14) (3 pts) If Schlitz’s regression suffers from multi-collinearity, it means that
c) Two or more explanatory variables are highly correlated with each other.
15. (3 pts) A correlation analysis between sales and sales training scores results in correlation coefficient, R = +0.98. Which of the following best interprets the relationship between sales and sales training?
As R squared is the ratio of explained variation and total variation ,hence
d) 98% of the variation in sales is explained by variations in sales training scores
16. (2 pts) Which of the following best describes the sampling distribution of the mean?
a) it is the frequency distribution for all of the elements of the population
b) it is a probability distribution of the means of varying sample sizes for relevant populations
c) it is a probability distribution of the means of all possible samples of a given size drawn from a particular population
Answer is :
e) (A), (B) and (C) as all describe the sampling distribution.Note that frequency distribution is a population distribution when weights are the respective probabilities.
8) (3 pts) Schlitz has been told that every thousand dollars added to the median household...
17. A major airline company is concerned that its proportion of late arrivals has substantially increased in the past month. Historical data shows that on the average 18% of the company airplanes have arrived late. In a random sample of 1,250 airplanes, 250 airplanes have arrived late. If we are conducting a hypothesis test of a single proportion to determine if the proportion of late arrivals has increased: What is the correct statement of null and alternative hypothesis? A. H0:...
Hello, I need help understanding these problems. I already have the answers for them! I don't really understand #9. So researcher A is using the regression line to predict Math SAT from Critical reading score. Researcher B is using Average Math SAT score to predict Critical reading score? How would you calculate RMS error for researcher A and B? I need an explanation please! 10. For this question when you switch critical reading and Math SAT, the correlation will stay...
The data in the accompanying table give the prices (in dollars) for gold link chains at the Web site of a discount jeweler. The data include the length of the chain (in inches) and its width (in millimeters). All of the chains are 14-carat gold in a similar link style. Use the price as the response. For one explanatory variable, use the width of the chain. For the second, calculate the "volume" of the chain as π times its length...
The multiplication of two variables is used as a predictor if the two variables jointly affect the response. True O False Question 7 1 pts Even if the P-value of the F test in a multiple regression model is nearly zero, it is possible that the R of the model is much less than one. OT False Question 8 1 pts in selecting independent variables for a regression model, neither the forward selection method nor the backward elimination method guarantee...
using spss 2. The following table lists total sales of a specific merchandise in six stores during four seasons (Final Q2.sav): Sales (in thousand dollars) Winter Fall Summer Young Adults Young Adults Young AdultsYoung Adults 57 59 69 60 78 79 68 67 68 67 58 64 65 63 61 63 80 54 63 62 76 68 75 79 64 60 72 81 78 78 83 57 60 58 74 81 78 61 63 61 59 84 57 81 63...
10. A random sample of boarding school students was asked how many 8-ounce servings of soda they had consumed on a certain Sunday and how many hours of sleep they got that night. Their responses are displayed in the scatter plot below. Soda Sleep 0 6 0 8 1 6 1 7 2 7 3 5 3 8 4 6 5 5 6 3 6 6 7 4 7 6 8 3 10 2 a) Which variable has been used...
orrect Question 7 0/1 pts The amounts of 6 restaurant bill x (in dollars) and the corresponding amounts of the tips y (in dollars) are given below Bill 32.98 49.72 70.29 97.34 43.58 52.44 Tip 4.50 5.28 10.00 16.00 5.50 7.00 The regression equation is ý = 0.19.2 - 2.73, and the coefficient of determination r2 = 0.97. Based on the given r, which of the following conclusions may be made? Hint: Correlation is r, we are given p2. Thus...
TEST 1: ANSWERS INTS EACH). This section takes around 5 minutes. Name Spring 2019 8) A researcher wants to determine whether female teachers give higher or lower grades, on average, then male teachers. She picks a random sample by picking a random sample of schools, in the schools picked, picking a random sample of departments, and in the departments picked, picking a random sample of teachers. What kind of sampling was performed? d) voluntary response e) cluster b) stratified Random...