Please write full justification for (a) and (b). Will uprate/vote! 4. K-means The goal of K-means clustering is to divide a set of n points into k< n subgroups of points that are "close" t...

Question

Question

Please write full justification for (a) and (b). Will uprate/vote!

4. K-means The goal of K-means clustering is to divide a set of n points into k< n subgroups of points that are close to ea

engineering Computer-Science

Add a comment Improve this question Transcribed image text

Answer 1

Answer #1

a) True. We have k clusters. So each of the n points can belong to any one of the k clusters, thus having k options per item. So , total number of possible clusters for 2 items = k for the first item multiplied by k for the second item=k^2. Likewise for n items, we have k^n ways of assigning colors to each point.

b) Naively searching through all possible ways of coloring a set of n points will take k^n, in this case 5^100 units of time = 25^50 units of time = 25^48 sec. This is greater than 10^17 sec. The naive approach hence will fail in this case. We should remember that only 1 of the 5^100 arrangements is the correct answer.

K means works as follows.

Input : number of clusters we want (k) and number of data-points we have(n)

initially all points are randomly scattered and unassigned. w
we randomly assign m points with m distinct colors ( initial centers)
for all remaining points:
we calculate distance from the k centers and assign the point the color

of the center closest to it.

Then we update the value of the centers of every cluster with the average coordinate value and update the center to that point closest to the average center.
repeat entire process till no point updates its color from the previous setting

TIME COMPLEXITY: O(t*k*n*d) where t is number of iterations, k=number of clusters, n=number of points and d=number of dimensions each point has ( for 2D grid coordinates, its value is 2 ). Thus this approach is way better than the naive approach.

Add a comment

Answer 2

Please write full justification for (a) and (b). Will uprate/vote! 4. K-means The goal of K-means clustering is to divide a set of n points into k< n subgroups of points that are "close" t...

Homework Answers

Add Answer to:
Please write full justification for (a) and (b). Will uprate/vote! 4. K-means The goal of K-means clustering is to divide a set of n points into k< n subgroups of points that are "close" t...

Post as a guest

Earn Coins

K-means clustering K-means clustering is a very well-known method of clustering unlabeled data. The simplicity of...

K-means clustering Problem 1. (10 pts) Suppose that we have the gene expression values for 5...

Data clustering and the k means algorithm. However, I'm not able to list all of the...

Please write full justification for (a) and (b). Will uprate/vote! 4. K-means The goal of K-means clustering is to divide a set of n points into k< n subgroups of points that are "close" t...

Homework Answers

Add Answer to: Please write full justification for (a) and (b). Will uprate/vote! 4. K-means The goal of K-means clustering is to divide a set of n points into k< n subgroups of points that are "close" t...

Post as a guest

Earn Coins

K-means clustering K-means clustering is a very well-known method of clustering unlabeled data. The simplicity of...

K-means clustering Problem 1. (10 pts) Suppose that we have the gene expression values for 5...

Data clustering and the k means algorithm. However, I'm not able to list all of the...

Add Answer to:
Please write full justification for (a) and (b). Will uprate/vote! 4. K-means The goal of K-means clustering is to divide a set of n points into k< n subgroups of points that are "close" t...