Consider the following measurements of blood hemoglobin concentrations (in g/dL) from three human populations at different geographic locations:
population1 = [ 14.7 , 15.22, 15.28, 16.58, 15.10 ]
population2 = [ 15.66, 15.91, 14.41, 14.73, 15.09]
population3 = [ 17.12, 16.42, 16.43, 17.33]
Perform ANOVA to check if any of these populations have different mean hemoglobin concentrations. (Assume that all the ANOVA requirements such as normality, equal variances and random samples are met.) After you perform ANOVA perform a Tukey-Kramer post-hoc test at a significance level of 0.05 to see which populations actually have different means. As usual, round all answers to two digits after the decimal point. (Make sure you round off to at least three digits any intermediate results in order to obtain the required precision of the final answers.) For any questions, which ask about differences in means or test statistics, which depend on differences in means provide absolute values. In other words if you get a negative value, multiply by -1 to make it positive.
QUESTION 5
For the three populations, what is the value of MSgroups in the ANOVA table?
QUESTION 6
For the three populations, what is the value of MSerror in the ANOVA table?
QUESTION 7
If you run ANOVA on the data for the three populations, what is the F-ratio?
QUESTION 8
If you run ANOVA on the data for the three populations, what is the range of the P-value?
Greater than 0.05 |
||
between 0.05 and 0.025 |
||
between 0.025 and 0.01 |
||
less than 0.01 |
Ho : There is no significant difference in the population's mean hemoglobin concentrations.
H1 : There is significant difference in the population's mean hemoglobin concentrations.
Level of Significance :- (l.o.s.) alpha = 0.05
Decision Criteria :- Reject Ho at 5% l.o.s if F > F crit
Calculations: Using the Single Factor Annova test from the data analysis package in the Excel, we get the following results :
Anova: Single Factor | ||||||
SUMMARY | ||||||
Groups | Count | Sum | Average | Variance | ||
Population 1 | 5 | 76.88 | 15.376 | 0.50408 | ||
Population 2 | 5 | 75.8 | 15.16 | 0.3912 | ||
Population 3 | 4 | 67.3 | 16.825 | 0.2207 | ||
ANOVA | ||||||
Source of Variation | SS | df | MS | F | P-value | F crit |
Between Groups | 7.043066 | 2 | 3.521533 | 9.129 | 0.004606 | 3.982 |
Within Groups | 4.24322 | 11 | 0.385747 | |||
Total | 11.28629 | 13 |
Conclusions : Since F > F crit, we reject Ho at 5% l.o.s. and thus conclude that there is significant difference in the population's mean hemoglobin concentrations.
Tukey-Kramer post-hoc test at a significance level of 0.05 to see which populations actually have different means is been carried out in the Excel as follows:
Difference | n1 | n2 | SE | q statistic | q table | ||
Population 1 | Population 2 | 0.216 | 5 | 5 | 0.278 | 0.778 | 3.82 |
Population 1 | Population 3 | 1.449 | 5 | 4 | 0.295 | 4.918 | 3.82 |
Population 2 | Population 3 | 1.665 | 5 | 4 | 0.295 | 5.652 | 3.82 |
Here, No. of cases , a =[ No. of groups*( No. of groups -1 ) ] / 2 = 3*2/2 = 3
Difference = Absolute difference of the means of the two selected populations
SE = sqrt [ 0.5 * MSE *(1/n1 + 1/n2) ] where MSE = 0.385747 from the first table.
q statistic = Difference / SE
Decision Criteria : If q statistic > q table at 5% l.o.s. , then we conclude that the respective population means are significantly different. Here, value from q table at 5% l.o.s is obtained for the degrees of freedom of (a , N - a ) where N is the total number of observations.
Thus, the q table for this problem corresponding to the degrees of freedom of ( 3 , 11 ) is 3.82
Conclusion : Since q statistic > q table at 5% l.o.s. for the second and the third case of the 2nd table, we conclude that the means of population 1 and 3 and means of population 2 and 3 differ significantly from each other.
Question 5 : MSgroups = 3.521533 = 3.522 [ from the first table ]
Question 6 : MSerrors = 0.385747 = 0.386 [ from the first table ]
Question 7 : F-ratio = 9.129 [ from the first table ]
Question 8 : p-value = 0.004606 [ from the first table ]
i.e. less than 0.01
Consider the following measurements of blood hemoglobin concentrations (in g/dL) from three human populations at different...
You have measured the blood hemoglobin concentrations in a random sample of 12 males aged 20-29 years and have obtained the following values in mg/dL: [ 14.7, 15.22, 15.28, 16.58, 15.1, 15.66, 15.91, 14.41, 14.73, 15.09, 15.62, 14.92] Calculate the following from the above sample: 1.95% confidence interval for the mean hemoglobin concentration in the population of 20-29 year old males. 2. 99% confidence interval for the mean hemoglobin concentration in the same population 3. 95% confidence interval for the...
A random sample of five observations from three normally distributed populations produced the following data: (You may find it useful to reference the F table.) Treatments 22 28 23 XA = 24.8 SA = 48.2 25 XR = 21.6 = 27.3 24 23 27 = 24.0 = 21.0 X d. Specify the competing hypotheses in order to determine whether some differences exist between the population means. OHO: MA - MB - MC; HA: Not all population means are equal. OHO:...
QUESTION 7 The following data are taken from three different populations known to be normally distributed, with equal population variances based on independent simple random samples. Sample 1 Sample 2 Sample 3 39 40 43 37 38 50 35 33 42 45 35 54 37 30 48 30 52 Given that one-way ANOVA was performed and we reject the null hypothesis, please use Tuley's test to determine which pairwise means differ using a familywise error rate of a -0.05. 22...
Consider the following competing hypotheses and accompanying sample data drawn independently from normally distributed populations. (You may find it useful to reference the appropriate table: z table or t table) Ho: H1-Hu2 0 HA: H1 Hz< e 251 252 s1 39 s=19 n1=7 n 7 a-1. Calculate the value of the test statistic under the assumption that the population variances are equal. (Negative values should be indicated by a minus sign. Round all intermediate calculations to at least 4 decimal...
Consider the following summary statistics, calculated from two independent random samples taken from normally distributed populations. Sample 1 F1 = 22.49 11 = 2.54 P1 = 15 Sample 2 F2 = 27.31 3 = 3.08 P2 = 18 Test the null hypothesis HO : H1 = 2 against the alternative hypothesis HA: MI <H2 a) To save you on calculations, I will tell you that the standard error of the difference in sample means (SE(X_1 bar - X_2 bar)) is...
Consider the following competing hypotheses and accompanying sample data drawn independently from normally distributed populations. (You may find it useful to reference the appropriate table: z table or t table) H0: μ1 − μ2 ≥ 0HA: μ1 − μ2 < 0 x¯1x¯1= 249x−2x−2= 262s1 = 35s2 = 23n1 = 10n2 = 10a-1. Calculate the value of the test statistic under the assumption that the population variances are equal. (Negative values should be indicated by a minus sign. Round all intermediate calculations to at least 4 decimal places and final answer to 3 decimal places.) a-2. Find the p-value. multiple choice 1p-value < 0.010.01 ≤ p-value...
The following sample data reflect shipments received by a large firm from three different vendors and the quality of those shipments (You may find it useful to reference the appropriate table: chi-square table or table) Vendor DefectiveAcceptable 20 25 24 116 67 255 a. Select the competing hypotheses to determine whether quality is associated with the source of the shipments H: Quality and source of shipment (vendor) are independent; Ha: Quality and source of shipment (vendor) are dependent. Ho: Quality...
The following sample data reflect shipments received by a large firm from three different vendors and the quality of those shipments. (You may find it useful to reference the appropriate table: chi-square table or F table) Vendor Defective Acceptable 1 18 117 2 26 66 3 33 248 a. Select the competing hypotheses to determine whether quality is associated with the source of the shipments. H0: Quality and source of shipment (vendor) are independent.; HA: Quality and source of shipment...
The following data represent the results from an independent-measures experiment comparing three treatment conditions. Use StatCrunch to conduct an analysis of variance with a = 0.05 to determine whether these data are sufficient to conclude that there are significant differences between the treatments. Treatment A Treatment B Treatment C 5 5 12 3 6 6 5 4 10 4 7 9 3 3 8 F-ratio = p-value = Conclusion: There is a significant difference between treatments These data do not...
1. Three different metal alloys were tested for tensile strength. The strength of some examples of each alloy measured (in hundreds of megapascals), as follows. was Alloy Tensile strength of some examples 19.8 12.4 1 15.2 14.8 2 8.9 11.6 10.0 11.9 3 10.5 13.8 12.1 Source: the data come from Berenson and Levine (1998), Business Statistics: A First Course, p. 449, Question 10.27, but shortened.) Taking the types of alloy one-way ANOVA. The following R commands were used: as...