1.between free burned(y) and panel area (X1)
X Values
∑ = 69066
Mean = 4062.706
∑(X - Mx)2 = SSx =
147711085.529
Y Values
∑ = 18234.42
Mean = 1072.613
∑(Y - My)2 = SSy =
9061658.29
X and Y Combined
N = 17
∑(X - Mx)(Y - My) = 14711481.545
R Calculation
r = ∑((X - My)(Y - Mx)) /
√((SSx)(SSy))
r = 14711481.545 / √((147711085.529)(9061658.29)) = 0.4021
Meta Numerics (cross-check)
r = 0.4021
Key
X: X Values
Y: Y Values
Mx: Mean of X Values
My: Mean of Y Values
X - Mx & Y -
My: Deviation scores
(X - Mx)2 & (Y -
My)2: Deviation
Squared
(X - Mx)(Y -
My): Product of Deviation Scores
The value of R is 0.4021.
Although technically a positive correlation, the relationship between your variables is weak (nb. the nearer the value is to zero, the weaker the relationship).
between free burned (y) and floor area(X2)
X Values
∑ = 25491
Mean = 1499.471
∑(X - Mx)2 = SSx =
7017316.235
Y Values
∑ = 18234.42
Mean = 1072.613
∑(Y - My)2 = SSy =
9061658.29
X and Y Combined
N = 17
∑(X - Mx)(Y - My) = 6201663.956
R Calculation
r = ∑((X - My)(Y - Mx)) /
√((SSx)(SSy))
r = 6201663.956 / √((7017316.235)(9061658.29)) = 0.7777
Meta Numerics (cross-check)
r = 0.7777
Key
X: X Values
Y: Y Values
Mx: Mean of X Values
My: Mean of Y Values
X - Mx & Y -
My: Deviation scores
(X - Mx)2 & (Y -
My)2: Deviation
Squared
(X - Mx)(Y -
My): Product of Deviation Scores
The value of R is 0.7777.
This is a strong positive correlation, which means that high X variable scores go with high Y variable scores (and vice versa).
between free burned (y) and # panel(X3)
X Values
∑ = 199
Mean = 11.706
∑(X - Mx)2 = SSx = 341.529
Y Values
∑ = 18234.42
Mean = 1072.613
∑(Y - My)2 = SSy =
9061658.29
X and Y Combined
N = 17
∑(X - Mx)(Y - My) = 33631.065
R Calculation
r = ∑((X - My)(Y - Mx)) /
√((SSx)(SSy))
r = 33631.065 / √((341.529)(9061658.29)) = 0.6045
Meta Numerics (cross-check)
r = 0.6045
Key
X: X Values
Y: Y Values
Mx: Mean of X Values
My: Mean of Y Values
X - Mx & Y -
My: Deviation scores
(X - Mx)2 & (Y -
My)2: Deviation
Squared
(X - Mx)(Y -
My): Product of Deviation Scores
The value of R is 0.6045.
This is a moderate positive correlation, which means there is a tendency for high X variable scores go with high Y variable scores (and vice versa).
2
a) significnce test between panel area(X1) and free burned (y)
r = 0.4021
n = 17
t test = r*square root of [(n-2)/( 1- r^2)]
t = 1.70 so, right tail P-Value is .054883.
The result is not significant at p < .05. so there is
no strong correlation between panel area and free burned
b)significnce test between #panel(X3) and free burned (y)
r = 0.6045
n= 17
t =2.93 right tail P-Value is .005173.
The result is significant at p < .05 means there is a strong
correlation between # panel and free burned
c) r = 0.7777 we can say that there is a strong correlation between floor area and free burned at 5 % significance because of same sample size and high r value.
3) between panel area and free burned- linear equation
linear equation : y= 0.0996*x +667.98
between floor area and free burned- linear equation
linear equation : y = 0.8838*x - 252.57
between# panel and free burned- linear equation
linear equation : y = 98472*x -80.088
4) after removal the outlier
14459-panel area | 1138- free burned |
y = 0.4256*x -383.86
5) between panel area and free burned after removal outlier
Y = -383.8617 + 0.4256 X1
Source |
DF |
Sum of Squares | Mean Square | F Statistic | P-value |
---|---|---|---|---|---|
Regression (between ŷi andyi bar) |
1 |
5953159.9054 |
5953159.9054 |
26.8510 |
0.0001391 |
Residual (between yi and ŷi) |
14 |
3103955.7000 |
221711.1214 |
||
Total(between yi andyi bar) |
15 |
9057115.6054 |
603807.7070 |
Coeff |
SE | t-stat | lower t0.025(14) | upper t0.975(14) |
Stand Coeff |
p-value |
VIF |
|
---|---|---|---|---|---|---|---|---|
intercept | -383.8617 | 304.0027 | -1.2627 | -1035.8825 | 268.1592 | 0.000 | 0.2273 | |
slope | 0.4256 | 0.08212 | 5.1818 | 0.2494 | 0.6017 | 0.8107 | 0.0001391 | 1.0000 |
Y and X
relationship
R square (R2) equals 0.6573. It means
that the predictors (Xi) explain 65.7% of the variance
of Y.
Adjusted R square equals 0.6328.
The coefficient of correlation (R) equals 0.8107.
It means that there is a very strong direct relationship between
the predicted data (ŷ) and the observed data (y).
Goodness of
fit
Overall regression: right-tailed, F(1,14) = 26.8510,
p-value = 0.0001391. Since p-value < α (0.05), we reject the
H0.
The linear regression model, Y = b0+
b1X1provides a better fit
All the independent variables (Xi) are significant.
The Y-intercept : two-tailed, T = -1.2627, p-value = 0.2273. Hence
intercept is not significantly different from zero. It is still
most likely recommended not to force b to be zero.
between floor area and free burned - there is no outlier
Y = -252.5679 + 0.8838 X1
Source |
DF |
Sum of Squares | Mean Square | F Statistic | P-value |
---|---|---|---|---|---|
Regression (between ŷi andyi bar) |
1 |
5480818.3840 |
5480818.3840 |
22.9589 |
0.0002378 |
Residual (between yi and ŷi) |
15 |
3580839.9055 |
238722.6604 |
||
Total(between yi andyibar) |
16 |
9061658.2896 |
566353.6431 |
Coeff |
SE | t-stat | lower t0.025(15) | upper t0.975(15) |
Stand Coeff |
p-value |
VIF |
|
---|---|---|---|---|---|---|---|---|
intercept | -252.5679 | 300.8844 | -0.8394 | -893.8878 | 388.7521 | 0.000 | 0.4144 | |
slope | 0.8838 | 0.1844 | 4.7915 | 0.4906 | 1.2769 | 0.7777 | 0.0002378 | 1.0000 |
Y and X
relationship
R square (R2) equals 0.6048. It means
that the predictors (Xi) explain 60.5% of the variance
of Y.
Adjusted R square equals 0.5785.
The coefficient of correlation (R) equals 0.7777.
It means that there is a strong direct relationship between the
predicted data (ŷ) and the observed data (y).
Goodness of
fit
Overall regression: right-tailed, F(1,15) = 22.9589,
p-value = 0.0002378. Since p-value < α (0.05), we reject the
H0.
The linear regression model, Y = b0+
b1X1 provides a better fit t
All the independent variables (Xi) are significant.
The Y-intercept : two-tailed, T = -0.8394, p-value = 0.4144. Hence
intercept is not significantly different from zero. It is still
most likely recommended not to force b to be zero.
between # panel and free burned - there is no outlier
y = -80.0880 + 98.4719 X1
Source |
DF |
Sum of Squares | Mean Square | F Statistic | P-value |
---|---|---|---|---|---|
Regression (between ŷi andyibar) |
1 |
3311716.2806 |
3311716.2806 |
8.6393 |
0.01015 |
Residual (between yi and ŷi) |
15 |
5749942.0089 |
383329.4673 |
||
Total(between yi andyibar) |
16 |
9061658.2896 |
566353.6431 |
Coeff |
SE | t-stat | lower t0.025(15) | upper t0.975(15) |
Stand Coeff |
p-value |
VIF |
|
---|---|---|---|---|---|---|---|---|
intercept | -80.0880 | 419.9374 | -0.1907 | -975.1634 | 814.9873 | 0.000 | 0.8513 | |
slope | 98.4719 | 33.5021 | 2.9393 | 27.0639 | 169.8800 | 0.6045 | 0.01015 | 1.0000 |
Y and X
relationship
R square (R2) equals 0.3655. It means
that the predictors (Xi) explain 36.5% of the variance
of Y.
Adjusted R square equals 0.3232.
The coefficient of correlation (R) equals 0.6045.
It means that there is a strong direct relationship between the
predicted data (ŷ) and the observed data (y).
Goodness of
fit
Overall regression: right-tailed, F(1,15) = 8.6393,
p-value = 0.01015. Since p-value < α (0.05), we reject the
H0.
The linear regression model, Y = b0+
b1X1, provides a better fit.
All the independent variables (Xi) are significant.
The Y-intercept : two-tailed, T = -0.1907, p-value = 0.8513. Hence
intercept is not significantly different from zero. It is still
most likely recommended not to force b to be zero.
so, the best model is panel area nad free burned relationship.
6)
with three explanatory variable
y = -740.43 + 0.0454749*X1 + 0.659653*X2 + 54.6017X3
Variable | Parameter | S.D. | T-STAT H0: parameter = 0 |
2-tail p-value | 1-tail p-value |
(Intercept) | -740.4 | 336.6 | -2.2000e+00 | 0.04651 | 0.02326 |
X1 | +0.04548 | 0.03758 | +1.2100e+00 | 0.2478 | 0.1239 |
X2 | +0.6596 | 0.1887 | +3.4960e+00 | 0.003946 | 0.001973 |
X3 | +54.6 | 26.01 | +2.1000e+00 | 0.05586 | 0.02793 |
Multiple Linear Regression - Regression Statistics | |
Multiple R | 0.8532 |
R-squared | 0.7279 |
Adjusted R-squared | 0.6651 |
F-TEST (value) | 11.59 |
F-TEST (DF numerator) | 3 |
F-TEST (DF denominator) | 13 |
p-value | 0.0005619 |
Multiple Linear Regression - Residual Statistics | |
Residual Standard Deviation | 435.5 |
Sum Squared Residuals | 2.465e+06 |
WITH two explanatory variable - floor area and # panel
y = -650.516 + 0.722435X2 + 54.6614V3X3
Variable | Parameter | S.D. | T-STAT H0: parameter = 0 |
2-tail p-value | 1-tail p-value |
(Intercept) | -650.5 | 333.7 | -1.9500e+00 | 0.07155 | 0.03578 |
X2 | +0.7224 | 0.1844 | +3.9170e+00 | 0.001548 | 0.0007738 |
X3 | +54.66 | 26.43 | +2.0680e+00 | 0.05766 | 0.02883 |
Multiple Linear Regression - Regression Statistics | |
Multiple R | 0.835 |
R-squared | 0.6973 |
Adjusted R-squared | 0.654 |
F-TEST (value) | 16.12 |
F-TEST (DF numerator) | 2 |
F-TEST (DF denominator) | 14 |
p-value | 0.0002329 |
Multiple Linear Regression - Residual Statistics | |
Residual Standard Deviation | 442.6 |
Sum Squared Residuals | 2.743e+06 |
Based on F test and slope test of two models we can say that the second model is best ( two variable) because 69 % explained by this two variable if I include third variable help to explained extra 3 %.
7) y = -740.43 + 0.0454749*X1 + 0.659653*X2 + 54.6017X3
x1 =5000
x2= 2500
x3 =11
y = -740.43 + 0.0454749*5000 + 0.659653*2500 + 54.6017*11
y = 1736.6957
Eclipse Engineering provides services of structural engineering. They just opened a new branch in...
Eclipse Engineering provides services of structural engineering. They just opened a new branch in Portland, OR. One of their products is called Structural Insulated Panels, or SIPs. It is a type of foam insulation that replaces typical wall and roof framing for residential or commercial buildings. Clients are billed a fixed fee for each project based on square footage of the building, square footage of the panel area, and number of panels. Depending on how long a project takes to...
Styles The data in the accompanying table represent the population of a certain country every 10 years for the years 1900-2000. An ecologist is interested in finding an equation that describes the population of the country over time. Complete parts (a) through (3) below Year, x 1900 1910 1920 1930 1940 1950 Population, y Year, x Population, y 179,323 203,302 79,212 1960 95,228 1970 104,021 1980 123,202 1990 132,164 2000 151,325 226,542 248,709 281,421 (a)Determine the least-squares regression equation, treating...
Please show all work AND any calculator functions. The table below summarizes data of heights and weights of 8 randomly selected adults. Use this information to answer parts a) – h). Height (x) in inches 5.2 5.6 5.8 5.9 5.4 6.1 6.0 5.7 Weight (y) in pounds 119 136 155 185 135 202 194 165 a) Test the claim ρ ≠ 0. Use α = 0.01. • State the hypotheses and label the claim. • State the significance level (α)....
Y = listing priceX1 = interior floor space (square footage of the house)X2 = land size (square footage). This is usually represented by length of the front of the lot (in ft) X the depth of the lot (in ft). sizes range but a common lot size in Vancouver is 33 X 120. You will need to convert this to feet squared (area) before entering it into your spreadsheet (i.e., 3960)X3 = number of bedroomsX4 = number of bathroomsX5 =...
(1 point) College Graduation Rates. Data from the College Results Online website compared the 2011 graduation rate and school size for 92 similar sized public universities and colleges in the United States. Statistical software was used to create the linear regression model using size as the explanatory variable and graduation rate as the response variable. Summary output from the software and the scatter plot are shown below. Round all calculated results to four decimal places Coefficients Estimate Std. Errort value...
7. A study was conducted to investigate the effectiveness of a new drug for treating Stage 4 AIDS patients. A group of AIDS patients was randomly divided into two groups. One group received the new drug; the other group received a placebo. The difference in mean subsequent survival (those with drugs - those without drugs) was found to be 1.04 years and a 95% confidence interval was found to be 1.04 ± 2.37 years. Based upon this information: Select one...
Hi! Can someone help me to calculate the last question, the CI? Thanks! 1 point) College Graduation Rates. Data from the College Results Online website compared the 2011 graduation rate and school sze for 92 similar-sized public universities and colleges in the United States. Statistical sottware was used to places. create the linear regression model using size as the explanatory variable and graduation rate as the response variable. Summary output from the software and the scatter plot are shown below....
u LLS FRUICHUISJ and 6 use the following information. Prehistoric pottery vessels are usually found as sherds (broken pieces) and are care- fully reconstructed if enough sherds can be found. Information taken from Mimbres Mogollon Archaeology by A. I. Woosley and A.J. McIntyre (University of New Mexico Press) provides data relating x = body diameter in centimeters and y = height in centimeters of prehistoric vessels reconstructed from sherds found at a prehistoric site. The following Minitab printout provides an...
**************ALL OF THE PARTS ARE FROM ONE QUESTION.**************** **********ALL OF THE PARTS ARE FROM ONE QUESTION.*********** (a) The mathematical ability of a random sample of 306 students were as- sessed by means of a test. The test regards the mathematical ability of the student as either: low, medium or high. The handedness of each subject (right-handed of left-handed) was also recorded. The data is pre- sented in the table below. Low MediumHigh Total Left-handed19 393391 1 215 74 306 114...
The multiplication of two variables is used as a predictor if the two variables jointly affect the response. True O False Question 7 1 pts Even if the P-value of the F test in a multiple regression model is nearly zero, it is possible that the R of the model is much less than one. OT False Question 8 1 pts in selecting independent variables for a regression model, neither the forward selection method nor the backward elimination method guarantee...