Question

Height vs Weight - Erroneous Data: You will need to use software to answer these questions....

Height vs Weight - Erroneous Data: You will need to use software to answer these questions.

Below is the scatterplot, regression line, and corresponding data for the height and weight of 11 randomly selected adults. You should notice something odd about the last entry.

WebAssign Plot         
index height (x) weight (y)
inches pounds
1 60 120
2 72 200
3 65 130
4 72 205
5 67 180
6 69 180
7 68 193
8 69 195
9 61 115
10 62 140
11 5.5 160

You should be able copy and paste the data by highlighting the entire table.

Answer the following questions regarding the relationship.

(a) Using all 11 data pairs for height and weight, calculate the correlation coefficient. Round your answer to 3 decimal places.
r =

(b) Is there a significant linear correlation between these 11 data pairs?

YesNo    


(c) Using only the first 10 data pairs for height and weight, calculate the correlation coefficient. Round your answer to 3 decimal places.
r =

(d) Is there a significant linear correlation between these 10 data pairs?

YesNo    


(e) Which statement explains this situation?

The height for the last data pair must be an error.

The erroneous value from the last data pair ruined a perfectly good correlation.    

Despite the low correlation coefficient from part (a), there is probably a significant correlation between height and weight.

All of these are valid statements.

0 0
Add a comment Improve this question Transcribed image text
Answer #1

Given is a Height vs Weight - Erroneous Data

Also given is a scatter plot, regression line, and corresponding data for the height and weight of 11 randomly selected adults.

Index Height (x) Weight (y)
Inches Pounds
1 60 120
2 72 200
3 65 130
4 72 205
5 67 180
6 69 180
7 68 193
8 69 195
9 61 115
10 62 140
11 5.5 160

The scatter plot of the above data.

Scatterplot of weight (y) vs height (x) weight (y) 0 10 20 30 50 60 70 80 40 height (x)

Clearly we can see that all other 10 points falls in a line except for the 11th data in the data set so clearly the 11th data is erroneous.

Before we go on to solve the problems let us know a bit about correlation coefficient.

Correlation Coefficient

The correlation coefficient is a measure of degree of linear relationship between two variables x and y. It is denoted by r and is calculated by,

Covariancer, y) Variance(2) Variance(y) (I; -7)(yi - T) (Ii - 7) D-, (y 5)2

-1<r<1

r=-1, High negative correlation between x and y

r=0, x and y are not correlated

r=1, High positive correlation between x and y

Coming back to our problem

(a) Here we need to calculate the correlation coefficient using all 11 data pairs for height and weight.

The table of calculations is provided below,

Index Height (x) Weight (y) (xi-x̄)^2 (yi-ȳ)^2 (xi-x̄)*(yi-ȳ)
Inches Pounds
1 60 120 0.9111 2049.6174 43.2128
2 72 200 122.0031 1205.9854 383.5804
3 65 130 16.3661 1244.1634 -142.6957
4 72 205 122.0031 1578.2584 438.8079
5 67 180 36.5481 216.8934 89.0339
6 69 180 64.7301 216.8934 118.4885
7 68 193 49.6391 768.8032 195.3527
8 69 195 64.7301 883.7124 239.171
9 61 115 0.0021 2527.3444 -2.2874
10 62 140 1.0931 638.7094 -26.4226
11 5.5 160 3075.2016 27.8014 292.3949
Total 670.5 1818 3553.2276 11358.1822 1628.6364

670.5 Σi = = 60.9545 11

η-Σι 1918 - 105.2727

Σ (; – )(yi - 3) 1 (- 7)? Σ, (μι - H)2 1628.6364 3553.227611358, 1822 Σ

r=0.256

Hence the correlation coefficient using all 11 data pairs for height and weight is 0.256

(b) Now since r=0.256 which is close to 0 hence there is no significant linear correlation between these 11 data pairs.

So answer is No.

(c) Here we need to calculate the correlation coefficient using only the first 10 data pairs for height and weight.

The table of calculations is provided below,

Index Height (x) Weight (y) (xi-x̄)^2 (yi-ȳ)^2 (xi-x̄)*(yi-ȳ)
Inches Pounds
1 60 120 42.25 2097.64 297.7
2 72 200 30.25 1169.64 188.1
3 65 130 2.25 1281.64 53.7
4 72 205 30.25 1536.64 215.6
5 67 180 0.25 201.64 7.1
6 69 180 6.25 201.64 35.5
7 68 193 2.25 739.84 40.8
8 69 195 6.25 852.64 73
9 61 115 30.25 2580.64 279.4
10 62 140 20.25 665.64 116.1
Total 665 1658 170.5 11327.6 1307

665 = 66.5 10

η - Σ - 158 - 165.3

r=\frac{\sum_{i=1}^{n}(x_{i}-\overline{x})(y_{i}-\overline{y})}{\sqrt{\sum_{i=1}^{n}(x_{i}-\overline{x})^{2}}\sqrt{\sum_{i=1}^{n}(y_{i}-\overline{y})^{2}}}=\frac{1307}{\sqrt{170.5}\sqrt{11327.6}}

\Rightarrow r=0.940

Hence the correlation coefficient using only the first 10 data pairs for height and weight is 0.940

(d) Now since r=0.940 which is close to 1 hence there is a significant linear correlation between these first 10 data pairs.

So answer is Yes.

(e)

  • Clearly the height for the last data pair must be an error because looking at the other heights, height cannot be 5.5
  • Clearly the erroneous value from the last data pair ruined a perfectly good correlation because when we omitted the last pair the other 10 pairs had a correlation of 0.940.
  • Despite the low correlation coefficient from part (a) (which was due to the erroneous pair), there is probably a significant correlation between height and weight (which is clear from the part c and d).

Hence all of these statements are valid statements.

Add a comment
Know the answer?
Add Answer to:
Height vs Weight - Erroneous Data: You will need to use software to answer these questions....
Your Answer:

Post as a guest

Your Name:

What's your source?

Earn Coins

Coins can be redeemed for fabulous gifts.

Not the answer you're looking for? Ask your own homework help question. Our experts will answer your question WITHIN MINUTES for Free.
Similar Homework Help Questions
  • Use the given data set to complete parts (a) through (C) below. (Use a = 0.05.)...

    Use the given data set to complete parts (a) through (C) below. (Use a = 0.05.) 10 9.14 8 8.15 13 8.74 9 8.77 11 9.27 14 8.09 6 6.13 4 3.09 12 9.13 5 4.75 7.26 Click here to view a table of critical values for the correlation coefficient. a. Construct a scatterplot. Choose the correct graph below. OA OB. OC. OD. AY 10- 8 10- 10 8 10- 8- 6 6- 6- . 6- 4 4 4- 4...

  • Use the given data set to complete parts (a) through (c) below. (Use a = 0.05.)...

    Use the given data set to complete parts (a) through (c) below. (Use a = 0.05.) 5 х у 10 9.14 8 8.14 13 8.73 9 8.76 11 9.26 14 8.09 6 6.13 4 3.09 12 9.13 7 7.25 4.73 Click here to view a table of critical values for the correlation coefficient. a. Construct a scatterplot. Choose the correct graph below. OA. OB. O c. OD y AY 10- 8- Ay 10-1 a 10- 10- 8- 6- 8-1 8-...

  • The following data were collected on the height (inches) and weight (pounds) of 5 students. Height...

    The following data were collected on the height (inches) and weight (pounds) of 5 students. Height 72 70 62 65 67 Weight 180 172 125 132 145 a. Develop a regression model to predict weight based on height. b. What percent of the total variation in weight has been explained by height? c. If a student is 69 inches tall, what would you estimate the weight to be? Please use the Excel Solver to solve the above exercise question

  • Use the given data set to complete parts (a) through (c) below. (Use a-0.05.) 10 9.13...

    Use the given data set to complete parts (a) through (c) below. (Use a-0.05.) 10 9.13 13 8.75 12 9.13 3.11 4.74 8.14 8.77 9.25 8.09 6.13 7.25 Click here to view a table of critical values for the correlation coefficient. a. Construct a scatterplot. Choose the correct graph below. OA. OB. O C. OD. 12 16 12 16 8 12 16 12 16 b. Find the linear correlation coefficient, r, then determine whether there is sufficient evidence to support...

  • Use the given data set to complete parts (a) through (c) below. (Use a=0.05.) n х...

    Use the given data set to complete parts (a) through (c) below. (Use a=0.05.) n х 10 9.14 8 8.14 13 8.74 11 9.26 14 8.11 6 6.13 4 3.11 12 9.12 7 7.26 5 4.75 y 8.77 Click here to view a table of critical values for the correlation coefficient. a. Construct a scatterplot. Choose the correct graph below. O A. B. D Ay 10- AY 10- 8 лу 10- AY 10- 8- 8- 8- 6- 6- 6- 6-...

  • show work Show Work Question Help Use the given data set to complete parts (a) through...

    show work Show Work Question Help Use the given data set to complete parts (a) through (c) below. (Use a = 0.05.) x 10 8 13 19 T 11 T 14 16 14 | y 7.46 6.77 12.74 7.12 7.81 8 .84 6.09 5.39 12 8.15 7 6.42 5 5.73 = Click here to view a table of critical values for the correlation coefficient a. Construct a scatterplot. Choose the correct graph below. OA OB. 638 12 16 06 12...

  • ... Use the given data set to complete parts (a) through (C) below. (Use a =...

    ... Use the given data set to complete parts (a) through (C) below. (Use a = 0.05.) X 6 7 10 7.45 8 6.76 13 12.74 9 7.11 11 7.81 14 8.84 4 5.39 12 8.14 5 5.72 y 6.08 6.41 Click here to view a table of critical values for the correlation coefficient. a. Construct a scatterplot. Choose the correct graph below. OA. OB. OC. OD 16- AY 16- AY 167 AY 16 12- 12 12 a 12- 8-...

  • need help answering these questions please For a data set of brain volumes (cm) and I...

    need help answering these questions please For a data set of brain volumes (cm) and I scores of eleven males, the linear correlation coefficient is r=0.671. Use the table available below to find the critical values of t. Based on a comparison of the linear correlation coefficient and the critical values, what do you conclude about a linear correlation? Click the icon to view the table of critical values of r. The critical values are (Type integers or decimals. Do...

  • ui Use the given data set to complete parts (a) through (c) below (Use α:005) 10...

    ui Use the given data set to complete parts (a) through (c) below (Use α:005) 10 MA 6.78 2 74 642573 oss] )Click here to view a table of critical values for the correlation coefficient Sec b. Find the inear correlation coeficient r, then Sect (Round to three decimal places as needed ) determine whether there is suticient evidence to support the claim of a linear correlation between the two variables The linear correlation coeficient is r Using the linear...

  • need help answering these questions please For a data set of brain volumes (cm) and IQ...

    need help answering these questions please For a data set of brain volumes (cm) and IQ scores of eleven males, the linear correlation coefficient is r=0.671. Use the table available below to find the critical values of t. Based on a comparison of the linear correlation coefficient and the critical values, what do you conclude about a linear correlation? Click the icon to view the table of critical values of A The critical values are - 602, 602 (Type integers...

ADVERTISEMENT
Free Homework Help App
Download From Google Play
Scan Your Homework
to Get Instant Free Answers
Need Online Homework Help?
Ask a Question
Get Answers For Free
Most questions answered within 3 hours.
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT