Question

Implement the following machine learning tasks, utilizing Regression techniques e Prediction e Classification eFeature Reduct
0 0
Add a comment Improve this question Transcribed image text
Answer #1

The Machine Learning Task which can be performed using classification is Credit Card Fraud Detectionwhere we will classify whether a particular transaction by the customer is a fraud transaction or not on the basis of the previous transactions which are done by the Customer.

In this task the inputs will provided by a particular dataset which is available very easily online at various platforms in the form of csv files.(e.g. creditcard.csv). Once we have the data, we can specifically tell about the inputs or the labels but some which are present are Time of the transactions, Valid Transactions, Fraud Cases etc.

The output of this task will be first to analyze the dataset using our Machine Learning algorithm and thenpredicting whether a given transaction from the test set is valid or fraud based upon the training set and the training which has taken place.

The goal will be to design our algorithm in such a way that we get high prediction accuracy or the classification accuracy of test dataset as valid or fraud transactions i.e. if a transaction is a fraud transaction it should be classified as fraud and valid transaction if it is actually a valid one.

For data preparation, we can do the data cleaning which involves filling up the missing values, removing noisy data, smoothing out data types inconsistencies and removing outliers.We can also go for data reduction in which we can remove data which is insignificant with respect to our result obtaining process.Data transformation is another technique which can help in data preparation, through this we cantransform large values into smaller values or integers into float for obtaining precise and quick result.

For collecting the training data, what we can do is that dataset which we will use can be divided into two parts i.e. in the ratio of 70:30 where 70% of the dataset will be training set and 30% will be test set. The ratio can vary a bit and even we can try different proportions to obtain better accuracy. But it is recommended to use 70-80% of the dataset as training set otherwise we may suffer from the problem of underfitting or overfitting of the data.

The ground-truth label for all samples is obtained through the training which we will be providing to our machine learning model, in this our model will understand the various kinds of labels which are required in the output of the samples which are provided.

Add a comment
Know the answer?
Add Answer to:
Implement the following machine learning tasks, utilizing Regression techniques e Prediction e Classification eFeature Reduction Feature...
Your Answer:

Post as a guest

Your Name:

What's your source?

Earn Coins

Coins can be redeemed for fabulous gifts.

Not the answer you're looking for? Ask your own homework help question. Our experts will answer your question WITHIN MINUTES for Free.
Similar Homework Help Questions
  • Classification in Python: Classification In this assignment, you will practice using the kNN (k-Nearest Neighbors) algorithm...

    Classification in Python: Classification In this assignment, you will practice using the kNN (k-Nearest Neighbors) algorithm to solve a classification problem. The kNN is a simple and robust classifier, which is used in different applications. The goal is to train kNN algorithm to distinguish the species from one another. The dataset can be downloaded from UCI Machine Learning Repository: https://archive.ics.uci.edu/ml/machine-learning-databases/iris/ (Links to an external site.)Links to an external site.. Download `iris.data` file from the Data Folder. The Data Set description...

ADVERTISEMENT
Free Homework Help App
Download From Google Play
Scan Your Homework
to Get Instant Free Answers
Need Online Homework Help?
Ask a Question
Get Answers For Free
Most questions answered within 3 hours.
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT