1.
Heteroscedastisity refers to the case when the variability of the random disturbance is different for different components of the vector i.e. a random disturbance term {\displaystyle \epsilon _{i}} that has mean zero and standard deviation of {\displaystyle \sigma _{i}}
Autocorrelation in regression refers to the non zero correlation between the different components of random distubances i.e. Cov( ei, ej) is not 0.
b.
Estimated regression equation shows the equation for y hat i.e. the predicted y on the basic of estimated regression coefficients. For instance, y hat= b0 + b1 * x.
On the other hand, specified regression model is the equation for actual values of y. It is based on population regression coefficients and error terms. For instance, y = βo + β1 * x + e
C.
A data type, specially in programming, is a classification that specifies which type of value a variable can take, for eg. Integers, floating point, character, strings, array etc.
Level of measurement or scale of measure is a classification that describes the nature of the data under consideration. There are 4 levels of measurement - nominal, ordinal, ratio, interval.
D.
Multiple Regression is the statistical model that we use to predict a continuous outcome on the basis of two or more continuous predictor variables using some mathematical equation, such as predicting the speed of a vehicle on the basis of fuel consumption.
ANOVA is the statistical technique that we use to analyze the differences among the population mean of a continuous variable that depends on one or more categorical variables, such as average yield of a crop depending on different fertilisers.
E.
An outlier is a data point that differs significantly from other observations, may be due to variability in the measurement or experimental error. They differ significantly from the overall pattern of the data.
Infuencers are the points in the sample that when removed can significantly change the slope of the regression line. The influencers can be seen as very extreme outliers.
Distinguish between the following: Heteroskedasticity and autocorrelation specified regression model vs estimated regression equation data type...
(a) Distinguish between autocorrelation and heteroscedasticity and explain their implications for the OLS estimators. (b) Briefly discuss the alternative tests, at least two in case, employed to detect the problems of autocorrelation and heteroscedasticity in the estimated regression model. (c) Using the data on consumer prices, broad money (M2) and Treasury bill rate, as given in question (1), test the quantity theory of money (QTM) as represented by: pt=β0+β1mt+β1yt+ut such that β0>;β1>0;β2<0;β1=1;β2=-1 Show the estimated regression model, together with all...
1.The Breusch-Pagan test for heteroskedasticity A. tests for a relationship between the estimated residuals and the independent variables B. tests for a relationship between the squared estimated residuals and the independent variables C. tests for a relationship between the estimated residuals and the dependent variable D. tests for a relationship between the squared estimated residuals and the dependent variable 2. In the presence of heteroskedasticity hypothesis testing is unreliable (T/F) 3. Plotting the residuals (predicted errors) against the independent variables...
In a multiple regression model involving 44 observations, the following estimated regression equation was obtained. ŷ= 29 + 18x1 + 43x2 + 87x3 For this model, SSR = 600 and SSE = 400. The F statistic for the above model is 10 60 400 600
Problem 2 (25 pts). The following multiple regression model is to study the tradeoff between time spent on sleeping and working and to look at other factors affecting sleep: sleep-a-bítoturk + bgeduc-b3age te where sleep and totwork (total work) are measured in minutes per week and educ and age are measured in years The estimated model is as follows. sleep3638.25 0.148 totwrk -11.13 educ + 2.2 age (5.88) (112.28) (0.017) n = 706, R-0.113 The standard error of each coefficient...
6. In a regression model involving 46 observations, the following estimated regression equation was obtained. û = 17+4X1 - 3X2 + 8X3 +5X4 + 8Xs For this model, SST = 3410 and SSE = 510. a. Compute the coefficient of determination. b. Perform an F test and determine whether or not the regression model is significant
In a regression model involving 30 observations, the following estimated regression equation was obtained: 17+ 4x4 - 3x2 + 8x3+8X For this model SSR = 700 and SSE - 100. The computed Fstatistic for testing the significance of the above model is 43.75 7.00 50.19 0.875
Exhibit 1 In a regression model involving 30 observations, the following estimated regression equation was obtained: Y=17+4X, - 3X,+8X,+8X For this model SSR = 700 and SSE = 100. 12. Refer to Exhibit 1. The conclusion is that the 2. model is not significant b. model is significant c. slope of X, is significant d. slope of X, is significant
Suppose an estimated regression equation is y^ ̂ = - 17.00 5 + 12 . 966 x . For this simple linear regression model , there are 6 observations, SSE = 1 04 3. 03 and SST = 10568 . Does the evidence indicate a significant relationship between x and y ? Hint: This question does not require MINITAB . Use the ANOVA table and the fact SST = SSR + SSE, consider an F - tes t .
For a multiple regression model, why is the estimated correlation between the coefficients beta 1 hat and beta 2 hat positive when the correlation between the regressors variables is negative?
In a regression analysis involving 30 observations, the following estimated regression equation was obtained. y=17.6+3.8x1-2.3x2+7.6x3+2.7x4 For this estimated regression equation SST= 1805 and SSR=1756. a. At a= 0.05 , test the significance of the relationship among the variables. SSE=_________________ (to 1 decimal, if necessary) MSR=________________ (to 1 decimal, if necessary) MSE=+_______________ (to 2 decimals, if necessary) What is the value of the F test statistic (to 1 decimal)? _______________ What is the p-value? - Select your answer -less than .01,between...