(i)
y = 33.5413 - 0.0877(x1) - 0.0553(x2) = 0.0759(x3) + 1.3299(x5) - 0.0002(x6)
(ii)
(iii)
d) In most cases we use Log Transformation to increase the model efficiency.
e)
Without Number 17 R2 is reduced from 0.82 to 0.816 but without number 2 R2 becomes 0.824.Though there is only slight variation reducing coefficient of determination becomes serious so we can say that the number 17 is more influential than the number 2.
. The data set below contains information about the gasoline mileage performance for 32 au- tomob...
The following data were collected on a simple random sample of 20 patients with hypertension: Y=mean arterial blood pressure (mmHg), X1=age(years), X2= weight (kg), X3=body surface area (sq m), X4=duration of hypertension, X5 =basal pulse (beats/min), X6=measure of stress. A researcher is interested in developing a regression model to predict mean arterial blood pressure and has produced the following output: > rcorr(as.matrix(hyper)) Y X1 X2 X3 X4 X5 X6 Y 1.00 0.66 0.95 0.87 0.29 0.72 0.16 X1 0.66 1.00...
The following data were collected on a simple random sample of 20 patients with hypertension: Y=mean arterial blood pressure (mmHg), X1=age(years), X2= weight (kg), X3=body surface area (sq m), X4=duration of hypertension, X5 =basal pulse (beats/min), X6=measure of stress. A researcher is interested in developing a regression model to predict mean arterial blood pressure and has produced the following output: > rcorr(as.matrix(hyper)) Y X1 X2 X3 X4 X5 X6 Y 1.00 0.66 0.95 0.87 0.29 0.72 0.16 X1 0.66 1.00 0.41 0.38 0.34 0.62 0.37 X2 0.95 0.41...
Two linear regression models are fitted using software and below is their R2 and adjusted R2 values. Which of the two models fits the data better? Why does it fit the model better? In order from Model, R specification, R2, Adjusted R2 Model Model 1 : Y ∼ X1 + X3, 0.91, 0.84 Model 2 : Y ∼ X1 + X2, 0.88, 0.86
4. The anscombe data set in the datasets R package (should automatically be loaded) contains 4 pairs of response-explanatory variables. The pairs are xl-yl, x2-y2, x3-y3, and x4-y4 where x is the explanatory variable and y is the response variable. (a) Run 4 simple linear regression analyses (one on each of the 4 pairs) to verify that the regression output is exactly the same (up to numerical accuracy) b) For each pair, describe what is wrong (if anything) and use...
3. Description of each X and data for 27 franchise stores are given below The data (X1, X2, X3, X4, X5, X6) are for each franchise store. X1 annual net sales/$1000 X2 number sq. ft/1000 X3 - inventory I$1000 X4- amount spent on advertising /$1000 X5 size of sales district/1000 families X6 number of competing stores in distric X1 X2 X3 X4 X5 X6 231 3 294 8.2 8.2 11 156 2.2 232 6.9 4.1 12 10 0.5 149 3...
2. Suppose Y ~ Exp(a), which has pdf f(y)-1 exp(-y/a). (a) Use the following R code to generate data from the model Yi ~ Exp(0.05/Xi), and provide the scatterplot of Y against X set.seed(123) n <- 500 <-rnorm (n, x 3, 1) Y <- rexp(n, X) (b) Fit the model Yi-Ao + Ax, + ε¡ using the lm function in R and provide a plot of the best fit line on the scatterplot of Y vs X, and the residual...
Please include the R code for each individual question. Save PDF to My Note The article "The Undrained Strength of Some Thawed Permafrost Soils" (Canadian Geotech. J., 1979: 420-427) contained the accompanying data on y shear strength of sandy soil (kPa), xl depth (m), and x2 water content (%) Obs Depth Content Strength 8.9 31.5 14.7 2 36.6 27.0 48.0 3 36.8 25.9 25.6 46.1 39.1 10.0 56.9 39.216.0 66.9 38.3 16.8 77.3 33.9 20.7 88.4 33.8 38.8 9 6.5...
Question 2: Suppose that we wish to fit a regression model for which the true regression line passes through the origin (0,0). The appropriate model is Y = Bx + €. Assume that we have n pairs of data (x1.yı) ... (Xn,yn). a) From first principle, derive the least square estimate of B. (write the loss function then take first derivative W.r.t coefficient etc) b) Assume that e is normally distributed what is the distribution of Y? Explain your answer...
Help is needed on question 1. The second picture is the data set “Showtime.xlsx” needed to answer the question . Stat 351 Homework #5 (Section 15.8-16.1) Make sure to show your work if you did any caleulation, and Minitab output if you used Minitab. I. Please download the dataset "Showtime.xlsx" from Canvas. The dataset "Showtime.xlsx" gives the data on weekly gross revenue (y), television advertising (x1), and newspaper advertising (32) for Showtime Movie Theaters. Use Minitab to help you answer...