Why do we randomly shuffle the training examples before using SGD optimization?

Question

Question

Why do we randomly shuffle the training examples before using SGD optimization?

engineering Computer-Science

Add a comment Improve this question Transcribed image text

Answer 1

Answer #1

Hey here is answer to your query.

Main reason to shuffle training data before using SGD is to get unbiased result for the real gradient. Like mostly data is in ordered manner so if we provide that data to SGD then there is high probability that it will bias the gradient which will eventually lead to poor convergence . which will not be good for our model as then overall accuracy of model will be effected in negative way.

In case of any doubt please comment. Happy Learning :)

Add a comment

Answer 2

Why do we randomly shuffle the training examples before using SGD optimization?

Homework Answers

Add Answer to:
Why do we randomly shuffle the training examples before using SGD optimization?

Post as a guest

Earn Coins

A data set has 600 examples. To properly test the performance of the final hypothesis, you set as...

in titration why do we need to degassed 7 up before titration?

Why do we adjust for stock dividend before its distribution, not after ... for the EPS calculation?

If you are a manager responsible for employee training, what might you now do before, during...

why do data-mining tools expensive and require training?

Explain why do we use a band-pass limiter before the frequency discriminator ? the ans should...

Question 1- Do you agree with Fred’s decision to conduct the training and use the third...

Explain the difference between conformity and obedience (give examples) and why we are all so prone...

PLEASE READ THE QUESTIONS AND DIRECTION CAREFULLY BEFORE ANSWERING. (MICROBIOLOGY LAB) THANK YOU! Why do we...

Why do we need to back up system before penetration testing: Question 13 options: Be able...

Why do we randomly shuffle the training examples before using SGD optimization?

Homework Answers

Add Answer to: Why do we randomly shuffle the training examples before using SGD optimization?

Post as a guest

Earn Coins

Add Answer to:
Why do we randomly shuffle the training examples before using SGD optimization?