Question

Given the training set below: Age Income Student Credit_rating Buys_computer <=30 high no fair no <=30...

Given the training set below:

Age

Income

Student

Credit_rating

Buys_computer

<=30

high

no

fair

no

<=30

high

no

excellent

no

31…40

high

no

fair

no

>40

medium

no

fair

yes

<=30

low

no

fair

yes

>40

high

no

fair

no

>40

low

yes

fair

yes

>40

low

yes

excellent

no

31…40

low

yes

excellent

yes

<=30

medium

no

fair

no

<=30

low

yes

fair

yes

>40

medium

yes

fair

yes

<=30

medium

yes

excellent

no

31…40

medium

no

excellent

no

31…40

high

yes

fair

yes

>40

medium

no

excellent

yes

The information gain for attribute Credit_rating is:

Group of answer choices

1. 0.049

2. 0.244

3. 0.261

4. 0.488

5. 0.004

0 0
Add a comment Improve this question Transcribed image text
Answer #1

In column Credit_rating, we have 6 excellents and 10 fairs. So, Entropy of Credit_rating is
-\frac{6}{16}log_2(\frac{6}{16})-\frac{10}{16}log_2(\frac{10}{16})
= 0.954

Now, we split Credit_rating based on Buys_computer. When Buys_computer is yes, we have 6 fairs and 2 excellents. When Buys_computer is no, we have 4 fairs and 4 excellents.
Entropy of Buys_computer=yes is
-\frac{6}{8}log_2(\frac{6}{8})-\frac{2}{8}log_2(\frac{2}{8})

= 0.811

Entropy of Buys_computer=no is
-\frac{4}{8}log_2(\frac{4}{8})-\frac{4}{8}log_2(\frac{4}{8})

= 1

Entropy of Credit_rating based on Buys_computer is
0.5*0.811+0.5*1
= 0.9055

So, Information gain from Credit_rating is 0.954 - 0.9055 = 0.0485 ~ 0.049

So, the answer is 0.049

Add a comment
Know the answer?
Add Answer to:
Given the training set below: Age Income Student Credit_rating Buys_computer <=30 high no fair no <=30...
Your Answer:

Post as a guest

Your Name:

What's your source?

Earn Coins

Coins can be redeemed for fabulous gifts.

Not the answer you're looking for? Ask your own homework help question. Our experts will answer your question WITHIN MINUTES for Free.
Similar Homework Help Questions
  • Group of answer choices (Select all that apply) 1. Age: 37; Income: medium; Student: no; Credit_rating:...

    Group of answer choices (Select all that apply) 1. Age: 37; Income: medium; Student: no; Credit_rating: excellent; 2. Age: 45; Income: low; Student: yes; Credit_rating: excellent; 3. Age: 32; Income: high; Student: no; Credit_rating: fair; 4. Age: 30; Income: medium; Student: no; Credit_rating: excellent; 5. Age: 40; Income: low; Student: yes; Credit_rating: fair; 6. Age: 39; Income: medium; Student: yes; Credit_rating: fair; Given the decision tree in the image, which of the following are rules extracted from it? (Select all...

  • Group of answer choices 1. IF Age=31..40 AND Income=medium THEN Buys_computer=yes; 2. IF Age<=30 AND Income=low...

    Group of answer choices 1. IF Age=31..40 AND Income=medium THEN Buys_computer=yes; 2. IF Age<=30 AND Income=low THEN Buys_computer=no; 3. IF Age<=30 AND Student=yes THEN Buys_computer=no; 4. IF Age<=30 AND Credit_rating=excellent THEN Buys_computer=yes; 5. IF Age>40 AND Credit_rating=excellent THEN Buys_computer=no; 6. IF Age=31..40 THEN Buys_computer=yes; Given the decision tree in the image, which of the following are rules extracted from it? (Select all that apply) age? <=30 31..40 >40 student? income? credit rating? no yes excellent low medium, high fair no...

  • Question 3 Given the following data points, use the K-Nearest Neighbours (kNN) (k=5) to find the class for age<=30, income=medium, student=yes, credit-rating=fair. Show your calculations and the fi...

    Question 3 Given the following data points, use the K-Nearest Neighbours (kNN) (k=5) to find the class for age<=30, income=medium, student=yes, credit-rating=fair. Show your calculations and the final clusters. For similarity measure use a simple match of attribute values: Similarity(A,B)= equals bi and 0 otherwise. ai and b i are either age, income, student or credit_rating. Weights are all 1 except for income it is 2. 5%, *c(a,,b,)/4 ,h ) is 1 ifai where cla -1 studentcredit rating dass RID...

  • Given the training data in Question 1 below| (on buying RRSP8), predict the class of the...

    Given the training data in Question 1 below| (on buying RRSP8), predict the class of the following new example using k-nearest-neighbor classification fork = 5: sector = oil industry, income = medium, self-employed = yes, credit-rating fair. For distance measure, use the following similarity measure: similarity(tupleAtupleB)-4-.(w"S(ab/4), where S(ab) is 1 if parameter a equals parameter b and o otherwise The parameters atand biare either Sector, income, self-employed, or credit-rating. The weights wiare all 1, except for income, which is 2....

  • Given the training data in Question 1 below| (on buying RRSP8), predict the class of the following new example using k...

    Given the training data in Question 1 below| (on buying RRSP8), predict the class of the following new example using k-nearest-neighbor classification fork = 5: sector = oil industry, income = medium, self-employed = yes, credit-rating fair. For distance measure, use the following similarity measure: similarity(tupleAtupleB)-4-.(w"S(ab/4), where S(ab) is 1 if parameter a equals parameter b and o otherwise The parameters atand biare either Sector, income, self-employed, or credit-rating. The weights wiare all 1, except for income, which is 2....

  • Analyze a relationship between two variables Determine the specific function relating the two variables All of...

    Analyze a relationship between two variables Determine the specific function relating the two variables All of the above d. 7. Professor Stanton thought that verbal ability was a function of age. Stanton collected data on people aged 5 to 20. The analysis yielded the following equation: V-20+3A where V verbal ability and A age. Stanton was asked to predict the verbal ability of a 30 year old.Stanton should respond a. 110 b. 90 c. Extrapolation is untrustworthy d. Interpolation is...

  • ntroduce your scenario and data set. Provide a brief overview of the scenario you are given...

    ntroduce your scenario and data set. Provide a brief overview of the scenario you are given above and the data set that you will be analyzing. Classify the variables in your data set. Which variables are quantitative/qualitative? Which variables are discrete/continuous? Describe the level of measurement for each variable included in your data set. Discuss the importance of the Measures of Center and the Measures of Variation. What are the measures of center and why are they important? What are...

  • Read the scenario given below. With your group, examine the data set (on page 2) and...

    Read the scenario given below. With your group, examine the data set (on page 2) and decide what analysis should be done. Perform this analysis. Record your results and submit this file on Blackboard by the end of the day. Each student must submit a copy of their results. Scenario: Students are often late to class. This is distracting to both students and teachers, and is quite annoying. The school and instructors realize that some students have long commutes, but...

  • can you please do 30, 31, 37, 39b and show work? 30) A machine has a...

    can you please do 30, 31, 37, 39b and show work? 30) A machine has a probability 0.03 of producing a defective glf b When 3 golf balls are manufactured by this machine, find the probability that none of the golf balls is defective 31) When three people are selected randomly, find the probability that their birthdays are all different 32) The mean credit card debt for college students in the U.S. is $2100. Is 2100 a parameter or statistic?...

  • 2 (00 polnt) Reaoareh ProposalL You wore given a data set with the folowing varlables based...

    2 (00 polnt) Reaoareh ProposalL You wore given a data set with the folowing varlables based on a survey of 850 adults: VerlableLevel Level of Measurement Additional Datalls or Type of Variable Annual income Number of years of education beyond high school Categorical (ordina) 18-30, 31-45, 45-60 Age group Number of years of experience Numerical Gender (nominal)African American Race Asian American White (non-Hispanic) Two or more Other Marital status (nominal) Never married Categorical (nomina) Rent Rent or own Own Other...

ADVERTISEMENT
Free Homework Help App
Download From Google Play
Scan Your Homework
to Get Instant Free Answers
Need Online Homework Help?
Ask a Question
Get Answers For Free
Most questions answered within 3 hours.
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT