Classification in Python: Classification In this assignment, you will practice using the kNN (k-Nearest Neighbors) algorithm...

Question

Question

Classification in Python: Classification In this assignment, you will practice using the kNN (k-Nearest Neighbors) algorithm...

Classification in Python: Classification

In this assignment, you will practice using the kNN (k-Nearest Neighbors) algorithm to solve a classification problem. The kNN is a simple and robust classifier, which is used in different applications. The goal is to train kNN algorithm to distinguish the species from one another. The dataset can be downloaded from UCI Machine Learning Repository: https://archive.ics.uci.edu/ml/machine-learning-databases/iris/ (Links to an external site.)Links to an external site.. Download `iris.data` file from the Data Folder. The Data Set description with the definitions of all the columns can be found on the dataset page - https://archive.ics.uci.edu/ml/datasets/Iris (Links to an external site.)Links to an external site..

(1 point): Load the data from the file (`iris.data`) into the DataFrame. Set the names of columns according to the column definitions given in Data Description.

(2 points): Data inspection. Display the first 5 rows of the dataset and use any relevant functions that can help you to understand the data. Prepare 2 scatter plots - `sepal_width` vs `sepal_length` and `petal_width` vs `petal_length`. Scatter plots should show each class in different color (`seaborn.lmplot` is recommended for plotting).

(2 points): Prepare the data for classification. Using the pandas operators prepare the feature variables `X` and the response `Y` for the fit. Note that `sklean` expects data as arrays, so convert extracted columns into arrays.

(1 point): Split the data into `train` and `test` using `sklearn` `train_test_split` function.

(2 points): Run the fit using `KNeighborsClassifier` from `sklearn.neighbors`. First, instantiate the model, Then, run the classifier on the training set.

(3 points): Use learning model to predict the class from features, run prediction on `X` from test part. Show the accuracy score of the prediction by comparing predicted iris classes and the `Y` values from the test. Comparing these two arrays (predicted classes and test `Y`), count the numbers of correct predictions and predictions that were wrong. (HINTS: `NumPy` arrays can be compared using `==` operator. You can also use `NumPy`'s operator `count_nonzero` to count number of non-False values).

(4 points): In this task, we want to see how accuracy score and the number of correct predictions change with the number of neighbors `k`. We will use the following number of neighbors `k`: 1, 3, 5, 7, 10, 20, 30, 40, and 50: Generate 10 random train/test splits for each value of `k` Fit the model for each split and generate predictions Average the accuracy score for each `k` Calculate the average number of correct predictions for each `k` as well Plot the accuracy score for different values of `k`. What conclusion can you make based on the graph?

engineering Computer-Science

Add a comment Improve this question Transcribed image text

Answer 1

Answer #1

Add a comment

Answer 2

Classification in Python: Classification In this assignment, you will practice using the kNN (k-Nearest Neighbors) algorithm...

Homework Answers

Add Answer to:
Classification in Python: Classification In this assignment, you will practice using the kNN (k-Nearest Neighbors) algorithm...

Post as a guest

Earn Coins

I need to create the "nnclassifier using the euclidean distance formula to find nearest neighbor. I...

The key purpose of splitting the dataset into training and test sets is A) To speed...

I'm using Python 3.7 with Spyder I need the full code and the same output as...

This is my code: import numpy as np import pandas as pd import sys from keras.models...

r studio/ Python : In this assignment, we are working with manuscripts and their reviews from a famous CS conference, ICLR (International Conference on Learning Representations). This is a top conference in computer science on machine learning.(Python not

Performance Metrics: Which of the following are terms used for performance metrics a. Specificity & Precision...

In python, write the following program, high school question, please keep simple. When I submitted the...

ONLY DO NUMBER 3 For this project you will test claims and conjectures using hypothesis testing. ...

ONLY DO NUMBER 7 For this project you will test claims and conjectures using hypothesis testing. ...

Problem statement For this program, you are to implement a simple machine-learning algorithm that uses a...

Classification in Python: Classification In this assignment, you will practice using the kNN (k-Nearest Neighbors) algorithm...

Homework Answers

Add Answer to: Classification in Python: Classification In this assignment, you will practice using the kNN (k-Nearest Neighbors) algorithm...

Post as a guest

Earn Coins

Add Answer to:
Classification in Python: Classification In this assignment, you will practice using the kNN (k-Nearest Neighbors) algorithm...