Question

Question 4 1 pts Which of the following reasons is not the reason why the K-means algorithm will likely end up with sub-optim

0 0
Add a comment Improve this question Transcribed image text
Answer #1

Question 4:

correct options: 2,3

Choosing the correct value of k is essential for optimal clustering. There is no fixed value of k, we usually have to try out a range of values of k. A value of k that corresponds to natural number of clusters is a good place to start.

Fast convergence of K-means will not lead to sub-optimal clustering. It may be an indicator of K-means getting stuck in a local optima but it will not cause poor performance. If anything fast convergence can be computationally efficient.

Question 5:

correct options: 1,2,3

Setting initial cluster centers, Computing the distance between every point and cluster center and assigning that point to that cluster center are all steps in K-means implementation.

The cluster centers are modified on every iteration based on the cost function not by random initialization.

Question 6:

correct options: 1,2

The value of k needs to be selected before running K-means.

With every iteration, cost function must either stay the same or decrease or else our K-means algorithm is not working.

Data points can be assigned to any cluster center on any iteration based on the distance between the point and center.

Different initializations may lead to different clusters since kmeans algorithm may stuck in a local optimum and may not converge to global optimum. So it’s recommended to run the algorithm using different initializations of centroids and pick the one with better performance.

Add a comment
Know the answer?
Add Answer to:
Question 4 1 pts Which of the following reasons is not the reason why the K-means...
Your Answer:

Post as a guest

Your Name:

What's your source?

Earn Coins

Coins can be redeemed for fabulous gifts.

Not the answer you're looking for? Ask your own homework help question. Our experts will answer your question WITHIN MINUTES for Free.
Similar Homework Help Questions
  • 1. apply k-means clustering to a dataset Task Consider the following set of two-dimensional records: RID...

    1. apply k-means clustering to a dataset Task Consider the following set of two-dimensional records: RID Dimension 1 Dimension2 1 00 8 4 5 4 N 3 2 4 4 6 N 5 2. 00 6 00 8 6 Use the k-means algorithm to cluster the data in the dataset with K=3. You can assume that the records with RIDS 1, 3, and 5 are used for the initial cluster centroids (means). You must include the intermediate results in each...

  • Please write full justification for (a) and (b). Will uprate/vote! 4. K-means The goal of K-means clustering is to divide a set of n points into k< n subgroups of points that are "close" t...

    Please write full justification for (a) and (b). Will uprate/vote! 4. K-means The goal of K-means clustering is to divide a set of n points into k< n subgroups of points that are "close" to each other. Each subgroup (or cluster) is identified by the center of the cluster, the centroid (μι, μ2' ··· ,14k) In class, we have seen a brute force approach to solve this problem exactly. Each of the k clusters is represented by a color, e.g.,...

  • 1. Implement the K-means algorithm using these two as a reference. 2.Use Matlab’s implementation of kmeans...

    1. Implement the K-means algorithm using these two as a reference. 2.Use Matlab’s implementation of kmeans to check your results on the fisheriris dataset (https://www.mathworks.com/help/stats/kmeans.html) a. The fisheriris dataset is built into Matlab, and you can load it using ‘load fisheriris’. b. Please note the labels are available for the dataset, so you can check the performance of the kmeans algorithm on the dataset. 274 14 Unsupervised Lnn Fig. 14.1 A two-dimensional domain with clusters of examples weight bot initial...

  • Gi 1D: 11-10,22 = 8,13-6,24 clustering to obtain k 2 clusters by hand. Specifically, 4,15-3,16 2,...

    Gi 1D: 11-10,22 = 8,13-6,24 clustering to obtain k 2 clusters by hand. Specifically, 4,15-3,16 2, perform k-means ven the following six iteims in 1. Start from initial cluster centers c0,2 9. Show your steps for all iterations: (1 the cluster assignments i.... ys: (2) the updated cluster centers at the end of that iteration; (3) the energy at the end of that iteration 2. Repeat the above but start from initial cluster centers c 3. Which k-means solution is...

  • Data clustering and the k means algorithm. However, I'm not able to list all of the...

    Data clustering and the k means algorithm. However, I'm not able to list all of the data sets but they include: ecoli.txt, glass.txt, ionoshpere.txt, iris_bezdek.txt, landsat.txt, letter_recognition.txt, segmentation.txt vehicle.txt, wine.txt and yeast.txt. Input: Your program should be non-interactive (that is, the program should not interact with the user by asking him/her explicit questions) and take the following command-line arguments: <F<K><I><T> <R>, where F: name of the data file K: number of clusters (positive integer greater than one) I: maximum number...

  • K-means clustering Problem 1. (10 pts) Suppose that we have the gene expression values for 5...

    K-means clustering Problem 1. (10 pts) Suppose that we have the gene expression values for 5 genes (G1 to G5) under 4 time points (t1 to t4) as shown in the following table. Please use K-Means clustering to group 5 genes into 2 clusters based on Euclidean distance. Find out the final centroids and their affiliated genes. The initial centroids are c1=(1,2,3,4) and c2=c(9,8,7,6). Please write down your algorithm step by step. Result without steps won't get points. t1 t2...

  • Business Analytics, Assignment on Clustering As part of the quarterly reviews, the manager of a r...

    Business Analytics, Assignment on Clustering As part of the quarterly reviews, the manager of a retail store analyzes the quality of customer service based on the periodic customer satisfaction ratings (on a scale of 1 to 10 with 1 = Poor and 10 = Excellent). To understand the level of service quality, which includes the waiting times of the customers in the checkout section, he collected data on 100 customers who visited the store; see the attached Excel file: ServiceQuality....

  • Hierarchical clustering is sometimes used to generate K clusters, K > 1 by taking the clusters...

    Hierarchical clustering is sometimes used to generate K clusters, K > 1 by taking the clusters at the Kth level of the dendrogram. (Root is at level 1.) By looking at the clusters produced in this way, we can evaluate the behavior of hierarchical clustering on different types of data and clusters, and also compare hierarchical approaches to K-means. The following is a set of one-dimensional points: {6, 12, 18, 24, 30, 42, 48}. (a) For each of the following...

  • K-means clustering K-means clustering is a very well-known method of clustering unlabeled data. The simplicity of...

    K-means clustering K-means clustering is a very well-known method of clustering unlabeled data. The simplicity of the process made it popular to data analysts. The task is to form clusters of similar data objects (points, properties etc.). When the dataset given is unlabeled, we try to make some conclusion about the data by forming clusters. Now, the number of clusters can be pre-determined and number of points can have any range. The main idea behind the process is finding nearest...

  • PLEASE DO QUESTION #2, IGNORE QUESTION 1 PLEASE DO QUESTION #2, IGNORE QUESTION 1 Consider the...

    PLEASE DO QUESTION #2, IGNORE QUESTION 1 PLEASE DO QUESTION #2, IGNORE QUESTION 1 Consider the following information for Questions #1 and #2: Topco, Inc., a manufacturer of self-propelled lawn mowers, wants to learn more about differences between residential lawn mower buyers. The market research department of Topco believes that customers differ in terms of the importance that they attach to the following two lawn mower feotures (1) that the lawn mower allows for bagging (of grass clippings), and (2)...

ADVERTISEMENT
Free Homework Help App
Download From Google Play
Scan Your Homework
to Get Instant Free Answers
Need Online Homework Help?
Ask a Question
Get Answers For Free
Most questions answered within 3 hours.
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT