Question

Consider the following setting. You are provided with n training examples: (x1, y1), (x2, y2), ·...

Consider the following setting. You are provided with n training examples: (x1, y1), (x2, y2), · · · , (xn, yn), where xi is the input example, and yi is the class label (+1 or -1). However, the training data is highly imbalanced (say 90% of the examples are negative and 10% of the examples are positive) and we care more about the accuracy of positive examples. How will you modify the perceptron algorithm to solve this learning problem? Please justify your answer.

0 0
Add a comment Improve this question Transcribed image text
Answer #1

ANSWER:

GIVEN THAT:

To consider the following setting. You are provided with n training examples: (x1, y1), (x2, y2), · · · , (xn, yn), where xi is the input.

There are many solution paths available for this problem. Some of them are :

i).Oversampling Minority Class:-
The minority class is used multiple times to train the perceptron. This way the minority input is amplified.

ii).Undersampling Majority Class:-
Remove some of the training tuples from majority class on random. This will reduce the excessive effect of majority class on the perceptron model.

iii).Modify Algorithm to be sensitive to Minority classes:-
Even a single misclassification in minority class has a huge cost because there are already very few cases of it.

Increase the misclassification costs

Add a comment
Know the answer?
Add Answer to:
Consider the following setting. You are provided with n training examples: (x1, y1), (x2, y2), ·...
Your Answer:

Post as a guest

Your Name:

What's your source?

Earn Coins

Coins can be redeemed for fabulous gifts.

Not the answer you're looking for? Ask your own homework help question. Our experts will answer your question WITHIN MINUTES for Free.
Similar Homework Help Questions
ADVERTISEMENT
Free Homework Help App
Download From Google Play
Scan Your Homework
to Get Instant Free Answers
Need Online Homework Help?
Ask a Question
Get Answers For Free
Most questions answered within 3 hours.
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT