(a) Which points would you consider outliers by visual inspection? Mark these points on a printou...
(a) Which points would you consider outliers by visual inspection? Mark these points on a printout of the figure. (b) Using the following table, state which points would be considered outliers using the distance to kth-nearest neighbor algorithm (assume k-10). Use a threshold of 0.5 on the distance to the 10th nearest neighbor to determine outliers. xy Distance to 10th nearest neighbor 1.02 5.04 4.00 2.50 4.02 5.39 3.31 3.73 1.11 5.09 0.94 4.48 4.07 5.38 4.17 5.85 4.63 3.92 0.15 2.01 0.42 1.13 0.56 0.38 0.72 0.90 (c) Using the following table, state which points would be considered outliers using the LOF algorithm. Use a threshold of 2.5 for the LOF score to determine outliers. x | У | Distance to 10th nearest neighbor 0.83 4.88 1.02 5.04 4.00 2.50 4.02 5.39 3.31 3.73 1.11 5.09 0.94 4.48 4.07 5.38 4.17 5.85 4.63 3.92 2.69 1.02 3.88 0.95 1.57 2.35 (d) Based on your answers to parts (a)-(c), what can you say about the relative performance of the distance to kth-nearest neighbor and LOF algorithms?
(a) Which points would you consider outliers by visual inspection? Mark these points on a printout of the figure. (b) Using the following table, state which points would be considered outliers using the distance to kth-nearest neighbor algorithm (assume k-10). Use a threshold of 0.5 on the distance to the 10th nearest neighbor to determine outliers. xy Distance to 10th nearest neighbor 1.02 5.04 4.00 2.50 4.02 5.39 3.31 3.73 1.11 5.09 0.94 4.48 4.07 5.38 4.17 5.85 4.63 3.92 0.15 2.01 0.42 1.13 0.56 0.38 0.72 0.90 (c) Using the following table, state which points would be considered outliers using the LOF algorithm. Use a threshold of 2.5 for the LOF score to determine outliers. x | У | Distance to 10th nearest neighbor 0.83 4.88 1.02 5.04 4.00 2.50 4.02 5.39 3.31 3.73 1.11 5.09 0.94 4.48 4.07 5.38 4.17 5.85 4.63 3.92 2.69 1.02 3.88 0.95 1.57 2.35 (d) Based on your answers to parts (a)-(c), what can you say about the relative performance of the distance to kth-nearest neighbor and LOF algorithms?