Students with the test scores listed as follows: 29, 81, 68, 43, 65, 45, 69, 78, 81, 72, 88, 99, 95, 24, 77, 52. Partition them into four bins by (1) equal-frequency (equi-depth) method, (2) equal-width method, and (3) an even better method (such as clustering).
1)Equal-Frequency Method
Data set Provided Is:29,81,68,43,65,45,69,78,81,72,88,99,95,24,77,52
Write These Elements In Ascending Order
After Arranging Element In Ascending Order Data Elements Look Like :
24,29,43,45,52,65,68,69,72,77,78,81,81,88,95,99
Now To Find Number Of Elements(Frequency) In Each Partition
Frequency=(Total Number Of Elements) ÷ (Number Of Partitions)
In Our Case Frequency = 16 ÷ 4 = 4 (16 divided by 4 gives us 4)
Now From Data set sorted in Ascending Order Keep On including 4 elements from Left Into Bin
Bin 1: 24, 29,43, 45
Bin 2: 52, 65, 68, 69
Bin 3: 72, 77, 78, 81
Bin 4: 81, 88, 95 ,99
Note:Here We Can See 81 Is Present In Both Bin 3 And Bin 4,because Provided Data-set Has Two Occurrences Of 81 And Aim Of This Approach Is To Create Bins Such That All Of Them Have Same Number Of Data-Elements.
2)Equal-Width Method
Dataset Provided Is:29,81,68,43,65,45,69,78,81,72,88,99,95,24,77,52
Maximum Element in Dataset(Max)= 99
Minimum Element In Dataset(Min)=24
Lets Say Number Of Bins To Be Created(N)=4 ...(Given Information)
For equal width method,Width Could Be Given By Formulae
Width=(max-min)/N
Width=(99-24)/4=18.75 Which Could Be Rounded Off To Nearest Integer Value 19
so Width In This Case=19
Bin Range
Bin 1: 29,24 [24-43)
Bin 2: 43 ,45 ,52 [43-62)
Bin 3: 68, 65, 69, 78, 72 ,77 [62-81)
Bin 4: 81, 81, 88, 99, 95 [81-+)
All The Element In Data-Set That Lies In The Range Specified Are Included In Corresponding Bin.
In Set Theory [x,y) means from x to y including 'x' but excluding 'y'.
3)Clustering(k-Means)
K-means clustering is a simple unsupervised learning algorithm that is used to solve clustering problems. It follows a simple procedure of classifying a given data set into a number of clusters, defined by the letter "N," which is fixed beforehand
Here We Have To Create Four Clusters N=4
Step 1:Create 4 clusters And Randomly Assign Data-Element Into The Cluster.
Data set Provided Is:29,81,68,43,65,45,69,78,81,72,88,99,95,24,77,52
Random Clusters Mean-Value Of Cluster Elements
Cluster 1: { 24,43,29,68 } (24+43+29+68) ÷ 4= 41
Cluster 2: { 81,81,77,69 } (81+81+77+69) ÷ 4 = 77
Cluster 3: { 88,95,99,65 } (88+95+99+65) ÷ 4 =86.75
Cluster 4: { 72,52,78,45 } (72+52+78+45) ÷ 4 =61.75
Form New Clusters By Moving Data Element From One Cluster To Another One If Its Distance From Its Own Mean Is More Than Its Distance From Some Other Cluster Mean Value
(example: Distance Of 68 present In Cluster 1 From Mean Value Of Cluster 1 Is: 68-41=27 Whereas Distance Of 68 From Mean Of Cluster 4 Is 68-61.75=6.25 As 68 Is More Closer To Mean Of Cluster 4 It Moves Into Cluster 4,Same Goes With Other Elements.And Finally We Would Get New Clusters As Shown Below)
Clusters Newly Calculated Mean
Cluster 1: { 24,29,43,45 } (24+29+43+45) ÷ 4 = 35.25
Cluster 2: { 72,77,78,81,81 } (72+77+78+81+81) ÷ 5= 77.8
Cluster 3: { 88,95,99 } (88+95+99) ÷ 3 = 94
Cluster 4: { 52,65,68,69 } (52+65+68+69) ÷ 4= 63.5
Again Form New Clusters By Moving Data Element From One Cluster To Another One If Its Distance From Its Own Mean Is More Than Its Distance From Some Other Cluster Mean Value
Clusters Newly Calculated Mean
Cluster 1: { 24,29,43,45 } (24+29+43+45) ÷ 4 = 35.25
Cluster 2: { 72,77,78,81,81 } (72+77+78+81+81) ÷ 5= 77.8
Cluster 3: { 88,95,99 } (85+95+99) ÷ 3 = 94
Cluster 4: { 52,65,68,69 } (52+65+68+69) ÷ 4= 63.5
As None Of The Data Element Moved From One Cluster To Another Cluster Final Clusters Formed After Performing k-Means Clustering Are
Cluster 1: { 24,29,43,45 }
Cluster 2: { 72,77,78,81,81 }
Cluster 3: { 88,95,99 }
Cluster 4: { 52,65,68,69 }
Please Do Up-vote If You Find This Answer Helpful
Students with the test scores listed as follows: 29, 81, 68, 43, 65, 45, 69, 78,...
Find the indicated measure. The test scores of 40 students are listed below. Find P85. 30 35 43 44 47 48 54 55 56 57 59 62 63 65 66 68 69 69 71 72 72 73 74 76 77 77 78 79 80 81 81 82 83 85 89 92 93 94 97 98 1) 85 2) 87 3) 89 4) 34
Find the indicated measure. The test scores of 40 students are listed below. Find P85. 30 35 43 44 47 48 54 55 56 57 59 62 63 65 66 68 69 69 71 72 72 73 74 76 77 77 78 79 80 81 81 82 83 85 89 92 93 94 97 98 1) 85 2) 87 3) 89 4) 34
Question 1 15 pts Test scores for a class of 40 students are listed below: 25 35 43 44 47 48 54 55 56 57 59 62 63 65 66 68 69 69 71 72 72 73 74 76 77 77 78 79 80 81 81 82 83 85 89 92 93 94 97 98 a) The mean of the sample data is b) The median of the sample data is c) The standard deviation of the sample data is...
The test scores of 30 students are listed below. Find the percentile that corresponds to a score of 74. 31 41 45 48 52 55 56 56 63 65 67 67 69 70 70 74 75 78 79 79 80 81 83 85 85 87 90 92 95 99
Problem #1: Consider the below matrix A, which you can copy and paste directly into Matlab. The matrix contains 3 columns. The first column consists of Test #1 marks, the second column is Test # 2 marks, and the third column is final exam marks for a large linear algebra course. Each row represents a particular student.A = [36 45 75 81 59 73 77 73 73 65 72 78 65 55 83 73 57 78 84 31 60 83...
help QUESTION 3 The test scores of 30 students are listed below. Find trhe 30th percentile. 31 41 45 48 52 55 56 56 63 65 67 67 69 70 70 74 75 78 79 79 80 81 83 85 85 87 90 92 95 99 mo
Use the Grouped Distribution method for the following exercise (see Self-Test 2-4 for detailed instructions), rounding each answer to the nearest whole number. Using the frequency distribution below (scores on a statistics exam taken by 80 students), determine:ion 1 of the preliminary test (scores on a statistics exam taken by 80 students), determine: 68 84 75 82 68 90 62 88 76 93 73 79 88 73 60 93 71 59 85 75 61 65 75 87 74 62 95...
Use the Grouped Distribution method for the following exercise (see Self-Test 2-4 for detailed instructions), rounding each answer to the nearest whole number. Using the frequency distribution below (scores on a statistics exam taken by 80 students), determine:ion 1 of the preliminary test (scores on a statistics exam taken by 80 students), determine: 68 84 75 82 68 90 62 88 76 93 73 79 88 73 60 93 71 59 85 75 61 65 75 87 74 62 95...
10. The scores in a class quiz are as follows: 58, 62, 62, 63, 65, 65, 65, 68, 69, 72, 72, 75, 76, 78, 79, 81, 84, 84, 85, 92, 94, 95, 98. a. Draw a box and whiskers diagram of the data set b. Determine if there are any outliers in the data set
QUESTION 23 Problem 5) True False Final scores of all the students in randomly selected 3 sections in a course, "Statistical Methods tot Sam Houston State University are presented below. There me 25 sections in total Section 3: (69.92, 66, 81, 76, 55, 70, 83, 68, 57, 50, 66, 69, 55, 88, 70, 70, 70, 56, 50) Section 6: (10, 73, 88, 58, 89, 69, 63, 76, 87, 82, 97, 76, 66, 95, 84, 88, 82, 81, 68, 89) Section...