The following data was recorded for the "Age" of the various participants in a sports activity at a community centre. Age: 20 32 36 37 29 20 27 30 25 37 22 20 20 36 38 32 35 25 24 32 20 27 23 26 28 Use MiniTab to obtain the answers to following questions and draw the required diagrams. a. Find values of Mean, Variance and Standard Deviation using MiniTab. Simply write down the actual formulas for these three quantities. b. Calculate the values (by showing your calculations) of First, Second and Third Quartile and the Inter Quartile Range (IQR). c. Draw the boxplot, histogram, stem and leaf diagrams. Based on the boxplot, what can you say about the data in an approximate way? d. Now calculate the value of the median by simply using the stem and leaf diagram drawn above. It was found that the 14th observation was recorded incorrectly. It should have been '56' instead of 36. In light of this, answer the following by using MiniTab. e. Find values of Mean, Median, Standard Deviation and draw the boxplot for this slightly altered data. Can you see something unusual in the boxplot and comment on the values of Mean, Standard Deviation and Median when you compare them with what you found in part 'a' above?
I am using EXCEL to perform the analysis as I don't have MINITAB.
Functions of Excel used for calculation
The data of Age of 25 participants is stored from B5:B29
mean | =AVERAGE(B5:B29) |
variance | =VAR(B5:B29) |
Standard deviation | =STDEV.S(B5:B29) |
Q1 | =QUARTILE(B5:B29,1) |
Q2 | =MEDIAN(B5:B29) |
Q3 | =QUARTILE(B5:B29,3) |
IQR | =B36-B34 |
The interquartile range is a measure of where the “middle fifty” is in a data set. The interquartile range formula is the first quartile subtracted from the third quartile: IQR = Q3 - Q1
Count | Age |
1 | 20 |
2 | 32 |
3 | 36 |
4 | 37 |
5 | 29 |
6 | 20 |
7 | 27 |
8 | 30 |
9 | 25 |
10 | 37 |
11 | 22 |
12 | 20 |
13 | 20 |
14 | 36 |
15 | 38 |
16 | 32 |
17 | 35 |
18 | 25 |
19 | 24 |
20 | 32 |
21 | 20 |
22 | 27 |
23 | 23 |
24 | 26 |
25 | 28 |
mean | 28.04 |
variance | 38.04 |
Standard deviation | 6.167658 |
Q1 | 23 |
Q2 | 27 |
Q3 | 32 |
IQR | 9 |
BOXPLOT: EXCEL> INSERT> boxplot
Boxplot shows that there is no outlier in the data. Data is roughly equally distributed about the mean/ median value. So we can say data is roughly normal and follows the bell curve.
Stem and Leaf Plots
A Stem and Leaf Plot is a special table where each data value is split into a "stem" (the first digit or digits) and a "leaf" (usually the last digit).
STEM | LEAF |
2 | 000002345567789 |
3 | 0222566778 |
Histogram
14th observation in changed from 36 to 56
Count | Age |
1 | 20 |
2 | 32 |
3 | 36 |
4 | 37 |
5 | 29 |
6 | 20 |
7 | 27 |
8 | 30 |
9 | 25 |
10 | 37 |
11 | 22 |
12 | 20 |
13 | 20 |
14 | 56 |
15 | 38 |
16 | 32 |
17 | 35 |
18 | 25 |
19 | 24 |
20 | 32 |
21 | 20 |
22 | 27 |
23 | 23 |
24 | 26 |
25 | 28 |
mean | 28.84 |
variance | 67.30667 |
Standard deviation | 8.204064 |
Q1 | 23 |
Q2(median) | 27 |
Q3 | 32 |
IQR | 9 |
Yes due to the introduction of 56 data point, we have an outlier(dot) in the boxplot indicating that this is outside the permissible range of values.
The outlier is calculated by following formula
Lower limit = Q1 - 1.5*IQR = 23 - 1.5*9 = 9.5
Upper Limit = Q3 + 1.5*IQR = 32 + 1.5*9 = 45.5
Since 56 is beyond 45.5, it is an outlier.
Also, standard deviation and variance have increased as due to change from 36 to 56. Mean is almost same as before as mean is not much affected by one data point because it is a measure of central tendency but standard deviation and variance indicate spread so they change considerably due to the inclusion of a larger or smaller value.
Median remain unaffected as it is also a measure of central tendency.
The following data was recorded for the "Age" of the various participants in a sports activity...
Caffeine: Following are the number of grams of carbohydrates in 12-ounce espresso beverages offered at a coffee shop. Construct a boxplot for these data. The first quartile is 26, median is 32, and third quartile is 35. 10 32 28 26 48 33 35 50 32 35 30 34 32 15 38 36 27 15 17 33 26 37 20 31 Send data to Excel Construct a boxplot for the given data (including outliers if any). 5101520253035404550
The following data are the ages (in years) at diagnosis for 20 patients under treatment for meningitis: 18 18 25 19 23 20 69 18 21 18 20 18 18 20 18 19 28 17 18 18 (a) Calculate and interpret the values of the sample mean, variance, and standard deviation. (b) Compute the sample median. Why might you recommend the sample median as a measure of center rather than the sample mean? (c) Compute the upper quartile, the lower...
5. The following data represent the diastolic blood pressure for 40 individuals. 84878482 085589 39229 1865786 54155 1993 Find the sample mean, sample variance, sample standard deviation, and median Construct a stem-and-leaf plot of the data. Construct a histogram (Use Microsoft Excel or Minitab).
A simple random sample of 20 days was selected and the amount of food that Parsnip ate each day was recorded. The data is listed below. 33 38 32 49 19 23 14 18 14 51 33 19 12 39 40 37 17 18 34 40 Calculate the mean, median, range, standard deviation, and IQR of this data. Please write the answers in the box below. Construct an appropriate stem-and-leaf plot to graphically display this data. To do so, type...
PLEASE ANSWER ALL PARTS AND PROVIDE COMPLETE WORK FOR ALL PARTS. 1. An environmentalist studied 15 fish and measured their liver enzyme levels. After 21 days, she measured the total liver enzyme of each fish and obtained the following values. 124 122 122 135 127 119 134 121 120 132 161 a) (5pts) Construct a Stem and Leaf Plot. b) (Spts) Find the mean, median, mode, maximum and minimum.(round to the nearest tenth 0.0) Means Median Mode= Maximum Minimum c)...
n du Consider the data set below: 22 12 21 22 S = 19 45 20 30 37 34 44 23 26 33 70 54 40 48 23 17 22 56 38 17 21 16 33 41 84 43 27 65 34 19 100 34 48 29 54 16 43 33 8934 43 31 37 77 43 43 Problem 1 (a) Make a stem and leaf display of S. (b) Calculate the mean I = 2 (c) Calculate the 10%-trimmed...
The data shown below represent the age (in weeks) at which babies first crawl, based on a survey of 12 mothers. Complete parts (a) through (c) below. 355 52 30 47 37 30 47 44 58 35 Click the icon to view the table of critical correlation coefficient values for normal probability plots (a) Draw a normal probability plot to determine this reasonable to conclude the data come from a population that is normally distributed. Choose the correct answer belon...
Q1 Please use this sample dataset to answer the following questions: 3 26 38 22 21 35 10 16 13 22 13 25 28 36 10 4 17 15 46 32 (1) Please calculate mean, median, and mode. (2) Please calculate sample variance and sample standard deviation. (3) Please draw the stem-n-leaf plot.
Q1 Please use this sample dataset to answer the following questions: 3 26 38 22 21 35 10 16 13 22 13 25 28 36 10 4 17 15 46 32 (1) Please calculate mean, median, and mode. (2) Please calculate sample variance and sample standard deviation. (3) Please draw the stem-n-leaf plot.
In a 2003 report based on the data from the National Health and Nutrition Examina- tion Survey (NHANES) conducted annually in the US, investigators included the following summary values (in mg) of daily iron intake for the 20-39 age group: Female - Mean = 13.7 Median = 11.7 Standard Deviation = 8.9 Male - Mean = 17.9 Median = 15.7 Standard Deviation = 10.9 (i) Based on these summary measures, does it make sense to assume that daily iron intake...