An agronomist wanted to investigate the factors that determine crop yield. Accordingly, she undertook an experiment in which thirty greenhouses of the same size were rented. In each, the amount of fertiliser (kg) and the amount of water (litres per week) were varied. At the end of the growing season, the amount of corn (kg) harvested was recorded. Using this data set, perform the following tasks to help the agronomist.
a) Specify a multiple population regression model to find out whether there is a direct link between crop yield and the amounts of fertilizer and water used. Assuming that neither fertilizer nor water is used excessively, do you expect the slope parameters to be negative or positive? Interpret the unadjusted and the adjusted coefficients of determination. What do they tell you about how well this multiple regression model fits the data? .Test the overall utility of this multiple regression model.
Corn | Fertilizer | Water |
223 | 100 | 1000 |
321 | 200 | 1000 |
158 | 300 | 1000 |
187 | 400 | 1000 |
331 | 500 | 1000 |
255 | 100 | 2000 |
362 | 200 | 2000 |
216 | 300 | 2000 |
301 | 400 | 2000 |
342 | 500 | 2000 |
353 | 100 | 3000 |
328 | 200 | 3000 |
252 | 300 | 3000 |
346 | 400 | 3000 |
343 | 500 | 3000 |
220 | 100 | 4000 |
385 | 200 | 4000 |
247 | 300 | 4000 |
390 | 400 | 4000 |
303 | 500 | 4000 |
345 | 100 | 5000 |
268 | 200 | 5000 |
380 | 300 | 5000 |
415 | 400 | 5000 |
353 | 500 | 5000 |
366 | 100 | 6000 |
376 | 200 | 6000 |
323 | 300 | 6000 |
421 | 400 | 6000 |
448 | 500 | 6000 |
Answer:
Multiple Linear Regression Model:
The given data is as follows:
Corn | Fertilizer | Water |
223 | 100 | 1000 |
321 | 200 | 1000 |
158 | 300 | 1000 |
187 | 400 | 1000 |
331 | 500 | 1000 |
255 | 100 | 2000 |
362 | 200 | 2000 |
216 | 300 | 2000 |
301 | 400 | 2000 |
342 | 500 | 2000 |
353 | 100 | 3000 |
328 | 200 | 3000 |
252 | 300 | 3000 |
346 | 400 | 3000 |
343 | 500 | 3000 |
220 | 100 | 4000 |
385 | 200 | 4000 |
247 | 300 | 4000 |
390 | 400 | 4000 |
303 | 500 | 4000 |
345 | 100 | 5000 |
268 | 200 | 5000 |
380 | 300 | 5000 |
415 | 400 | 5000 |
353 | 500 | 5000 |
366 | 100 | 6000 |
376 | 200 | 6000 |
323 | 300 | 6000 |
421 | 400 | 6000 |
448 | 500 | 6000 |
1.Enter the data into Excel sheet as shown above. |
2.If this is the first time you have used an Excel add-in, click the File tab, otherwise skip to step 7. |
3.Click Options from the list on the left. |
4.Select Add-ins in the Excel Options box. |
5.In the Add-in list box, select Analysis Toolbox-VBA from the Inactive Application Add-ins list. |
6.Click OK. |
7.Then select Data/ Data Analysis tab from the menu bar. |
8.The Data Analysis dialog box will appear on the screen. |
9.From the Data Analysis dialog box, select Regression and click OK. |
10.The Regression dialog box will appear on the screen. |
11.Place independent variable (Fertilizer and Water) in Input X Range and place dependent variable (Corn) in Input Y Range. |
12.Place appropriate confidence level in Confidence Level box. (If necessary) |
13. Give Output Range. |
14.Click OK. |
Then the MS-Excel gives the following output:
Regression Statistics | |
Multiple R | 0.644717 |
R Square | 0.41566 |
Adjusted R Square | 0.372376 |
Standard Error | 57.29273 |
Observations | 30 |
ANOVA | |||||
df | SS | MS | F | Significance F | |
Regression | 2 | 63042.85 | 31521.42 | 9.602996 | 0.000708 |
Residual | 27 | 88626.35 | 3282.457 | ||
Total | 29 | 151669.2 |
Coefficients | Standard Error | t Stat | P-value | Lower 95% | Upper 95% | |
Intercept | 194.84 | 32.57803 | 5.980718 | 2.23E-06 | 127.9954 | 261.6846 |
Fertilizer | 0.122667 | 0.073965 | 1.658451 | 0.108801 | -0.0291 | 0.274429 |
Water | 0.024846 | 0.006125 | 4.056542 | 0.000381 | 0.012279 | 0.037413 |
From the above Excel output, the multiple linear regression model is,
Unadjusted Coefficient of Determination:
From the above output, the unadjusted coefficient of determination, R2 = 0.4157
The coefficient of determination is a measure used in statistical analysis that assesses how well a model explains and predicts future outcomes. It is indicative of the level of explained variability in the data set. More commonly, it is used as a guideline to measure the accuracy of the model. Here, the coefficient of determination is 0.4157. Thus we can interpret that, approximately 41.57% of the observed variation can be explained by the model.
Adjusted Coefficient of Determination:
From the above output, the unadjusted coefficient of determination, Adjusted R2 = 0.3724
The adjusted R-squared is a modified version of R-squared for the number of predictors in a model. Adjusted R-squared compares the correlation of the investment to several measured models. It explains the percentage of variation of the independent variables that affect the dependent variables. If the adjusted coefficient of determination is closer to 1, it indicates that the estimated equation of regression fits the data. Here the adjusted coefficient of determination is 0.3724. Thus, we can interpret that 37.24% of the variation in model is explained by the independent variables (Fertilizers and Water).
The closer the value of R2 is to 1, the better the fit, or relationship, between the two factors.
Since, R2 = .4157, which indicates the fitting of the multiple linear regression model is not good.
The test for significance of regression in the case of multiple linear regression analysis is carried out using the analysis of variance. The test is used to check if a linear statistical relationship exists between the response variable and at least one of the predictor variables. The statements for the hypotheses are:
H0: β1 = β2 = 0
H1: βj ≠ 0, for at least one value of j; j =
1, 2.
From the above Excel output, the test statistic F = 9.60299
At 5% level of significance, the critical F value, F0.05, 2, 27 = 0.000708
Decision Rule: Reject the null hypothesis, if calculated F value > Critical F value.
Conclusion:
Here, Calculated F value > Critical F value (i.e., 9.603 > 0.0007). Therefore, we reject the null hypothesis at 5% level of significance. It is concluded that at least one coefficient out of β1 and β2 is significant. In other words, it is concluded that a regression model exists between Corn and either one or both of the independent variables (i.e. Fertilizers and Water).
An agronomist wanted to investigate the factors that determine crop yield. Accordingly, she undertook an experiment...
Question 2 The market for barley has the following demand and supply schedule. Price (RM) 100 200 300 400 500 600 Quantity demanded (tonnes) 7000 6000 5000 4000 3000 2000 (tonnes) 1000 2000 3000 4000 5000 6000 (a) State the price, quantity and revenue received at equilibrium. 18 marks] Study shows that barley is good for diabetics. The quantity demanded is increased by 2000 tonnes. Determine the new quantity demanded, price (i) and quantity. 6 marks] (b) Calculate the new...
Using the regression equation y=-0.16729x+1233.33, determine what the R(x) equation is? Chart Title 1200 X1 1943 3190 4570 6490 7330 p 1035 581 405 124 85 1000 y = -0.1673x + 1232.9 R= 0.932 800 600 400 200 0 1000 2000 3000 4000 5000 6000 7000 8000
1. P Q1 Q2 500 1000 100 1000 900 200 1500 800 300 2000 700 400 2500 600 500 3000 500 600 3500 400 700 4000 300 800 4500 200 900 5000 100 1000 A. For the above prices and quantities: Which are the demand quantities and which are the supply quantities? B. Graph demand and supply on one graph. (Plot the points) C. What is the equilibrium price and quantity? Approximately. D. What would a price ceiling set at...
Write a Java program with Counter-Controlled Loop to output the following: N 10*N 100*N 1000*N 1 10 100 1000 2 20 200 2000 3 30 300 3000 4 40 400 4000 5 50 500 5000 The output should look as shown above. The columns should be properly aligned.
6.6 See Figure 6.71. Determine the measurements shown, on the civil engineer's scales. oe V س لسليس مسلسل سلسلسسلسسلسلسليس مسلسل السلسبيست س لسلس ] lululululululululululululululululululululululululululululululululululului + 6.6: Civil engineer's scales Number Ratio 1"=1" 1"=1' 1: 1 0 .1" 0.1' 1"=10' 1' 1"-100' 1"=1000 10' 100' 1"=5" 1"=5' 1"=50' 1"=500' 1"=5000' 1”=2" 1”=2° 1"=20° 1"=200° 1"=2000" 1”=4” 1”=4' 1"=40' 1"=400' 1"-4000' 1: 3 1 "=3" 1"=3' 1"=30' 1"=300' 1"=3000' 1:6 1"=6" 1"=6' 1"=60' 1"=600' 1"=6000'
13. The Wall Street Journal CEO Compensation Study analyzed chief executive officer (CEO) pay from many U.S. companies with fiscal year 2008 revenue of at least $5 billion that filed their proxy statements between October 2008 and March 2009. The data are in the file P02_30.xlsx. a) Create a new variable Total 2008, the sum of Salary 2008 and Bonus 2008. (Actually, this is not “total” compensation because it omits the very lucrative compensation from stock options.) Also, recode Company...
Assume that SunSpecs Corporation makes three types of sunglasses, Razors, Slims, and Eagles, for major retailers such as Ray-Ban and Gucci. SunSpecs presently applies overhead using a predetermined rate based on direct labor hours. A consultant recommended that SunSpecs switch to activity-based costing. Management decided to give ABC a try and identified the following activities, cost drivers, and estimated costs for Year 2 for each activity center. Activity Recommended Cost Driver Annual Costs Estimated Annual Costs driver units...
1. (35 pts) Specific fuel consumption rate curves are given for three different engines: A, B and C. Power and Torque curves for Engine 'B' are also given at Page 3. Please answer the following questions for Engine 'B'. a) (5 pts) Find the power and torque when it runs at the engine rpm of minimum specific fuel consumption rate. b) (5 pts) Find the torque and specific fuel consumption rate when it runs at the engine rpm of maximum...
13. The Wall Street Journal CEO Compensation Study analyzed chief executive officer (CEO) pay from many U.S. companies with fiscal year 2008 revenue of at least $5 billion that filed their proxy statements between October 2008 and March 2009. The data are in the file P02_30.xlsx. a) Create a new variable Total 2008, the sum of Salary 2008 and Bonus 2008. (Actually, this is not “total” compensation because it omits the very lucrative compensation from stock options.) Also, recode Company...
Refer to the Lincolnville School District bus data. First, add a variable to change the type of engine (diesel or gasoline) to a qualitative variable. If the engine type is diesel, then set the qualitative variable to 0. If the engine type is gasoline, then set the qualitative variable to 1. Develop a regression equation using statistical software with maintenance cost as the dependent variable and age, odometer miles, miles since last maintenance, and engine type as the independent variables....