Question

Question 1 (50 pts): Suppose that a client of yours measure the heights (in inches) of n - 30 wheats grown at locations of various elevations (measured as meters above sea levels). Af- ter some discussion, you decided to fit a linear regression of wheat heights (denoted as yi) on the elevations of the locations (denoted as zi) as follows where ei, E2, . . . , En are i.i.d. errors with Elei] 0 and var(G) σ2. You calculated some descriptive statistics on the data set before transferring it to your colleague. The summary statistics are as follows. 7 TL TI i-1 i-1 Your colleague fitted a simple linear regression using least squares estimation, but somehow lost the original set of data. All that is left is the contaminated R output below. Call: 1m(formula y x) Residuals: Median 0.06319 Mi n 3Q Max 1.62276 Coefficients: Estimate std. Error t value Pr ltl (Intercept) 0.475946 49.01 2e-16 10.46 3.56e-11 0.047799 signif. codes: 0 0.0010.01 0.05 0.11 Residual standard error: 1.045 Multiple R-squared: F-statistic: 109.3 on 1 and 28 DF, p-value: 3.56le- In addition, your colleague vaguely remembered that Σ_1e-30.6, where ei the residual. Your task is to recover as many results as possible before the meeting with your client.1. (10 pts) What is the point estimate of B? Interpret the meaning of Bi with this estimated value in words that your client with no statistical background can understand. 2. (10 pts) Can you recover the estimated value of Bo? If so, show how you calcu- late βο: if not, explain why. 3. (8 pts) The colleague remembered that Bo 10. Interpret the meaning of Bo with this estimated value in words that your client with no statistical background can understand. (If your estimate is different, you can use yours.)4. (12 pts) Describe how you would construct a 95% confidence interval for A. Justify your decision. 5. (10 pts) You remember that your client lives in Davis. In an effort to make the results more informative to your client, you decide to replace the covariate in the regression model, i.e., meters above sea levels x, with feet above the Davis level. It is known that 1 meter is about 3.28 feet, and that Davis is at 52 ft (16 meter) above the sea level. In other words, you want to fit a regression of y on 3.28 x( 16). Can you calculate the values of least squares estimators for the new regression model?Question 2 (50 pts): A client walks into your office and asks for help in analyzing the market of electric ve- hicles. The client has collected data from all 58 counties in California including the proportion of electric vehicles among all vehicles in a county, and the countys average household income (in dollars). The client wants to study the association between each countys average household income and the usage of electric vehicles (measured by the proportion electric vehicles among all vehicles). Seeing this simple problem, you decide to fit a simple linear regression on this data set with the usage of electric vehicle as the response. However, your client wants you to explain the analysis plan before handing over the data set. 1. (26 pts) Write down the simple linear regression model you plan to fit on this data set. Explain the notations clearly, and interpret the parameters in your model in words that your client with no statistical background can understand.2. (10 pts) Your client is not sure about the use of the linear model, because, based on your interpretation, the model might suggest that the proportion of electric vehicles among all vehicles is larger than one when the countys household income is sufficiently large. Despite that the interpretation of model parameters makes sense, the client thinks that this model cannot be used for prediction. Do you agree with your client? Explain your answer 3. (14 pts) Your client wants to know why you choose the least square estimation among all possible statistical methods. (Hint: Explain the property of the least squares estimator (8 pts). Prove your claims to get full credits (6 pts).)

0 0
Add a comment Improve this question Transcribed image text
Know the answer?
Add Answer to:
Question 1 (50 pts): Suppose that a client of yours measure the heights (in inches) of...
Your Answer:

Post as a guest

Your Name:

What's your source?

Earn Coins

Coins can be redeemed for fabulous gifts.

Not the answer you're looking for? Ask your own homework help question. Our experts will answer your question WITHIN MINUTES for Free.
Similar Homework Help Questions
  • A client of yours wants to find out the best microbial environment for C. elegans. In...

    A client of yours wants to find out the best microbial environment for C. elegans. In previous meetings, the client told you that C. elegans feed on bacteria but may also be killed by certain bacteria. Therefore, it is important to figure out what bacteria are beneficial to C. elegans. In particular, the client was interested in studying the association between the density of Gluconobacter and the density of C. elegans. The client had collected some pilot data for this...

  • Question text Suppose that you have a five-point sample data set; the observations of (x, y)...

    Question text Suppose that you have a five-point sample data set; the observations of (x, y) are given by (8, 3), (10, 3), (6, 2), (2, 0), and (2, 1). Fit a simple linear regression model to this data by first computing the least squares estimate of the slope parameter. Which of the following is the most accurate? Select one: a. 0.3438 b. 0.4728 d. 0.6712

  • 2. The data set prostate in the faraway package is from a study on 97 men...

    2. The data set prostate in the faraway package is from a study on 97 men with prostate cancer who were due to receive a radical prostatectomy. We are interest is in predicting lpsa (log prostate specific antigen) with lcavol (log cancer volume). (a) Draw a scatterplot - does a simple linear regression model seem reasonable? (b) Without using the R function Im(0, compute the values , Y,Sxx, Syy and Sxy. Com pute the ordinary least squares estimates of the...

  • Please solve the question Simulation: Assume the simple linear regression model i = 1,... , n Ул 3D Во + B1; + ei, N(0,...

    Please solve the question Simulation: Assume the simple linear regression model i = 1,... , n Ул 3D Во + B1; + ei, N(0, o2) for i = 1,...,n. where e Let's set Bo = 10, B1 = -2.5, and n = 30 (a) Set a = 100, and x; = i for i = 1,...,n. (b) Your simulation will have 10,000 iterations. Before you start your iterations, set a random seed using your birthday date (MMDD) and report the...

  • Question 2: Hypothesis testing (30 pts) Consider the following simple linear regression model with E[G-0 and...

    Question 2: Hypothesis testing (30 pts) Consider the following simple linear regression model with E[G-0 and var(G)-σ2. The output of linear where €1, €2, . . . ,en regression from R takes the form are i.i.d. errors Cal1: lm(formula y ~ x + 1) Residuals: Min 1Q Median 3Q Max 2.0606-0.3287-0.1148 0.5902 1.2809 Coefficients: Estimate Std. Error t value Prlt (Intercept) 0.507932 0.340896 1.49 0.147 0.049656 0.003455 14.37 1.89e-14 Signif. codes: 0.0010.010.05 .'0.1''1 Residual standard error: 0.7911 on 28 degrees...

  • If can't complete it all, I can post more questions just let me know!! There is an old saying in golf: "You drive for show and you putt for dough. "The point is that good putting is more...

    If can't complete it all, I can post more questions just let me know!! There is an old saying in golf: "You drive for show and you putt for dough. "The point is that good putting is more important than long driving for shooting low scores and hence winning money. To see if this is the case, data on the top 69 money winners on the PGA tour in 1993 are examined. The average number of putts per hole for...

  • 1. (55 points) The investigators are interested in asses the relationship between Systolic Blood Pressure (SBP)...

    1. (55 points) The investigators are interested in asses the relationship between Systolic Blood Pressure (SBP) in mm Hg and Age in years among Hypertensive Patients. Specif- ically, whether a patient's SBP can be predicted from his or her age. They selected n=122 patients at random from a medical record database in a hospital. Assume that the simple linear regression model is appropriate. The following table shows regression output of a simple linear regression model relating the SBP to the...

  • QUESTION 1 Consider the following OLS regression line (or sample regression function): wage =-2.10+ 0.50 educ...

    QUESTION 1 Consider the following OLS regression line (or sample regression function): wage =-2.10+ 0.50 educ (1), where wage is hourly wage, measured in dollars, and educ years of formal education. According to (1), a person with no education has a predicted hourly wage of [wagehat] dollars. (NOTE: Write your answer in number format, with 2 decimal places of precision level; do not write your answer as a fraction. Add a leading minus sign symbol, a leading zero and trailing...

  • Question 1: (50 marks – 500 words)    A colleague is planning to conduct a survey among...

    Question 1: (50 marks – 500 words)    A colleague is planning to conduct a survey among a sample of university students on a sensitive health topic. He is considering the following methods of data collection: Face-to-face interviews A self-completion survey using a smartphone app. Recommend the method of data collection that you believe is most appropriate, giving reasons for your choice. Discuss the advantages and disadvantages of each of these methods for this survey. Question 2: (50 marks – 500...

  • 5. (20 pts) Suppose that we have a dataset {(yi, x, Tt2, X;3), i,1,... ,n} together with some general belief on the dat...

    5. (20 pts) Suppose that we have a dataset {(yi, x, Tt2, X;3), i,1,... ,n} together with some general belief on the data that higher (lower) value of each covariate x; (j = 1,2,3) will tend to result in higher (lower) y. In this study, we are interested in predicting y; from the total set of the regressors x;i, X;2, xt3. So, we apply the multiple linear regression yi = Bo+B1x1 +B2x52 + B3x43 + t to the data and...

ADVERTISEMENT
Free Homework Help App
Download From Google Play
Scan Your Homework
to Get Instant Free Answers
Need Online Homework Help?
Ask a Question
Get Answers For Free
Most questions answered within 3 hours.
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT