1. The data set on sheet #1 gives data on GPA category and number of hours studied. Construct comparative box plots of the data first GPA category. Then conduct two-sample t-test on the data for whether GPA category influences the number of hours studied. Be prepared to explain the results of the test and the meaning of the boxplots and how they relate to each other. Then redo the analysis by replacing the ordinal GPA category with a numerical dummy variable with Low=0, High=1. Run a regression analysis on how study hours (x) influence GPA category (y). Include the scatterplot. Compare the results of the two tests. Be able to state and null and alternative hypotheses
Student | GPA | Hours per week |
1 | Low | 6 |
2 | Low | 18 |
3 | Low | 16 |
4 | Low | 14 |
5 | High | 0 |
6 | Low | 22 |
7 | Low | 15 |
8 | Low | 12 |
9 | High | 6 |
10 | Low | 7 |
11 | Low | 5 |
12 | High | 20 |
13 | High | 9 |
14 | High | 9 |
15 | Low | 22 |
16 | Low | 23 |
17 | High | 8 |
18 | Low | 7 |
19 | Low | 14 |
20 | Low | 12 |
21 | Low | 0 |
22 | High | 7 |
23 | High | 4 |
24 | Low | 9 |
25 | Low | 0 |
26 | Low | 0 |
27 | High | 6 |
28 | High | 14 |
29 | Low | 10 |
30 | Low | 9 |
31 | High | 5 |
32 | High | 7 |
33 | High | 4 |
34 | High | 16 |
35 | High | 0 |
36 | Low | 20 |
37 | Low | 13 |
38 | High | 0 |
39 | High | 4 |
40 | Low | 6 |
41 | Low | 17 |
42 | Low | 8 |
43 | High | 4 |
44 | Low | 0 |
45 | High | 16 |
46 | Low | 17 |
47 | Low | 4 |
48 | High | 11 |
49 | Low | 14 |
50 | Low | 16 |
51 | High | 11 |
52 | High | 7 |
53 | High | 4 |
54 | Low | 11 |
55 | Low | 8 |
56 | High | 2 |
57 | Low | 0 |
58 | Low | 0 |
59 | High | 13 |
60 | Low | 18 |
61 | Low | 28 |
62 | High | 1 |
63 | Low | 20 |
64 | Low | 13 |
65 | Low | 4 |
66 | Low | 7 |
67 | High | 11 |
68 | Low | 12 |
69 | High | 5 |
70 | Low | 7 |
71 | Low | 22 |
72 | High | 8 |
73 | Low | 19 |
74 | Low | 8 |
75 | High | 2 |
76 | High | 11 |
77 | Low | 18 |
78 | Low | 20 |
79 | High | 7 |
80 | High | 4 |
81 | High | 4 |
82 | High | 16 |
83 | High | 15 |
84 | Low | 9 |
85 | High | 8 |
86 | High | 10 |
87 | Low | 13 |
88 | High | 9 |
89 | Low | 2 |
90 | Low | 22 |
91 | Low | 12 |
92 | High | 6 |
93 | High | 9 |
94 | Low | 20 |
95 | Low | 14 |
96 | High | 7 |
97 | High | 15 |
98 | High | 9 |
99 | High | 2 |
100 | Low | 23 |
Box plot:
we can observe that, Mean number of hours studied by GPA-Low students is greater than Mean number of hours studied by GPA-high students. Now we need to test this statement using 2 sample t-test.
2 sample t-test:
Null hypothesis Ho: There is no difference in mean number of hours studied by GPA-Low students and mean number of hours studied by GPA-High students.
Alternative hypothesis H1: Mean number of hours studied by GPA-Low students is greater than Mean number of hours studied by GPA-high students.
(So this is a right tailed or one tailed test)
Test statistic:
where
By usual definition of mean and standard deviation we get,
Substituting the above values in test statistic equation we get,
t=3.511
and degrees of freedom
Now to draw the conclusion, we need to compare the t value (3.511) with t-distribution value at 5% level of significance () with degrees of freedom 98. (its called critical value)
i.e from t-distibution table we get
Since , we reject the null hypothesis at 5% level of significance.
Which means "Mean number of hours studied by GPA-Low students is greater than Mean number of hours studied by GPA-high students."
Or
"GPA category influence the number of hours studied"
Scatter Plot:
we can observe from above Scatter plot that, there is no linear relationship between, Number of hours studied and GPA category. Since dependent variable GPA category is binary (o or 1) we can try to fit a logistic regression.
Logistic regression Model:
Logistic regression model is given by,
and we get the model,
1. The data set on sheet #1 gives data on GPA category and number of hours studied. Construct com...
The data below represents the total number of credit hours taken during one semester by 30 randomly chosen students. 3, 15, 12, 9, 10, 10, 11, 8, 6, 9, 12, 15, 16, 14, 11, 5, 4, 8, 7, 20, 18, 12, 14, 15, 10, 8, 12, 13, 12,9 Find the frequencies of the data grouped using 6 classes. Credit hours Frequency eeeeeeeeeeeeeeeeeeeeeee 3-5 6-8 9-11 12-14 15-17 18-20 Question Help: Message instructor
Which of the following frequency tables shows a skewed data set? Select all that apply: Value Frequency 5 2 6 5 7 3 8 15 9 11 10 24 11 14 12 12 13 10 14 4 Value Frequency 13 2 14 5 15 14 16 13 17 23 18 26 19 15 20 2 Value Frequency 5 1 6 1 7 9 8 20 9 24 10 20 11 6 12 11 13 5 14 2 15 1 Value...
Use the given data to construct a frequency distribution. 11) The number of people treated in the emergency service of a hospital every day of November was 15 23 12 10 28 7 12 17 20 21 18 13 11 12 26 6 16 19 22 14 17 21 28 9 16 13 11 16 20
Calculate the mean, median, and standard deviation for the total number of candies (per bag). Construct a histogram of the total number of candies (per bag). Use the z-score method to identify any potential outliers and outliers. Assume the total number of candies is normally distributed, calculate the probability that a randomly sampled bag has at least 55 candies in a bag. If a random sample of 50 bags is selected, find the probability that the mean number of candies...
Suppose there are 100 identical firms in the market and the luggage industry is perfectly competitive. What does the market supply curve look like? 20 19 18 17 16 15 14 13 12 11 A 10 9 8 7 6 5 4 20 19 18 17 16 15 14 13 12 11 A 10 8 7 6 2 1 0 0 0 1 2 3 4 5 6 7 8 9 10 11 12 0 1 2 3 4 5...
Review the 6 karyotypes in Figure 10 and determine the chromosomal disorder. Record the chromosomal disorder in Data Table 3. Describe the genotype of each chromosomal disorder and record in Data Table 3. Describe the phenotype of each chromosomal disorder and record in Data Table 3. Data Table 3: Karyotype to Genotype to Phenotype # Chromosomal Disorder Genotype Phenotype 1 2 3 4 5 6 1 2 3 4 5 6 7 8 1 2 3 4 5 6 7...
The data in the table below represent the number of nonconformities per 1000 meters in telephone cable. Sample Number Number of Nonconformities Sample Number Number of Nonconformities 1 1 12 6 2 1 13 9 3 3 14 11 4 7 15 15 5 8 16 8 6 10 17 3 7 5 18 6 8 13 19 7 9 0 20 4 10 19 21 9 11 24 22 20 Construct ‘c’ Chart for nonconformities for these data. Step...
Problem 6. The set (Z19 − {0}, ·19) is a group with the indicated operation; see the attached table. a.) Show that H = {1, 7, 8, 11, 12, 18} is a subgroup. b.) List all the right cosets of H. c.) Show that if Hy = Hx then xy−1 ∈ H. [Make sure to give a reason for each step.] d.) Show that φ : H → Hx defined by φ(h) = hx is one-to-one and onto. [Use the...
2. In the past, I have studied the relationship between student study time (number of hours spent studying) and course grade. The data are below, and they are fake data. Is there a relationship between these two variables? Assume . Report all answers to two decimal places if the answer involves decimal places. Hours Studying X Quiz Score Y 17 7 14 6 12 6 10 8 15 9 14 9 9 4 7 6 18 8 5 0 4...
Game Point_Differential Assists Rebounds Turnovers Personal_Fouls 1 15 15 38 11 9 2 36 20 43 8 13 3 16 21 29 7 13 4 45 22 46 11 11 5 12 11 40 7 22 6 -10 10 31 13 26 7 11 19 45 11 7 8 12 16 32 16 14 9 3 16 27 18 15 10 19 9 34 17 17 11 40 16 41 9 17 12 44 12 29 9 22 13 16 ...