Question

Briefly explain two ways to limit overfitting in constructing a decision tree. Briefly explain the advantages...

Briefly explain two ways to limit overfitting in constructing a decision tree. Briefly explain the advantages and the weaknesses of decision trees.

0 0
Add a comment Improve this question Transcribed image text
Answer #1

Overfitting is a significant practical difficulty for decision tree models and many other predictive models. Overfitting happens when the learning algorithm continues to develop hypotheses that reduce training set error at the cost of an
increased test set error. There are several approaches to avoiding overfitting in building decision trees.
Pre-pruning that stop growing the tree earlier, before it perfectly classifies the training set.
Post-pruning that allows the tree to perfectly classify the training set, and then post prune the tree.
Practically, the second approach of post-pruning overfit trees is more successful because it is not easy to precisely estimate when to stop growing the tree.
The important step of tree pruning is to define a criterion be used to determine the correct final tree size using one of the following methods:
Use a distinct dataset from the training set (called validation set), to evaluate the effect of post-pruning nodes from the tree.
Build the tree by using the training set, then apply a statistical test to estimate whether pruning or expanding a particular node is likely to produce an improvement beyond the training set.
Error estimation
Significance testing (e.g., Chi-square test)
Minimum Description Length principle : Use an explicit measure of the complexity for encoding the training set and the decision tree, stopping growth of the tree when this encoding size (size(tree) + size(misclassifications(tree)) is minimized.
The first method is the most common approach. In this approach, the available data are separated into two sets of examples: a training set, which is used to build the decision tree, and a validation set, which is used to evaluate the impact of pruning the tree. The second method is also a common approach. Here, we explain the error estimation and Chi2 test.

The advantages of a decision tree are fairly obvious: a “path” through possibilities, with alternatives, leading toward a desirable outcome. The tree anticipates dead ends and disastrous missteps, but most importantly it clarifies the difference between controlled and uncontrolled events – what decisions are in the CEO’s power to make, and what decisions must await the outcome of changes uncontrollable. For example, a tree showing ways to use excess capital will show what choices are available, and what choices must await Stock Market fluctuation. Another revelation from decision trees is the taxonomy of priorities – for example, is employee maintenance more or less important than stockholder dividends?

The major disadvantage of decision trees is loss of innovation – only past experience and corporate habit go into the “branching” of choices; new ideas don’t get much consideration. There is a tendency with trees to only consider paths that have been successful in the past, thus stultifying thought about changing situations. The trees are usually over-simple, not branched enough, and little consideration given to the “thickness” (value and probability) of each branch. Finally, like all metaphors, there is a tendency to argue by analogy – phrases like “the roots of the business,” the “seasons of new growth,” etc., tend to obfuscate the real debate. So, while they visualize the decisions to be made, at the same time they condense a complex process into discrete steps (which may be a good or a bad thing)

Add a comment
Know the answer?
Add Answer to:
Briefly explain two ways to limit overfitting in constructing a decision tree. Briefly explain the advantages...
Your Answer:

Post as a guest

Your Name:

What's your source?

Earn Coins

Coins can be redeemed for fabulous gifts.

Not the answer you're looking for? Ask your own homework help question. Our experts will answer your question WITHIN MINUTES for Free.
Similar Homework Help Questions
ADVERTISEMENT
Free Homework Help App
Download From Google Play
Scan Your Homework
to Get Instant Free Answers
Need Online Homework Help?
Ask a Question
Get Answers For Free
Most questions answered within 3 hours.
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT