Question

solve using gini index and give the decision treecid marital housingHas_loarApproveloan job age young married yes divorced no married single divorced no married married singl

0 0
Add a comment Improve this question Transcribed image text
Answer #1

THERE ARE TWO METHODS FOR DECISION TREE :-

1. INFORMATION GAIN

2. GINI INDEX

IN THIS QUESTION, WE HAVE TO FIND DECISION TREE BY GINI INDEX METHOD.

THE FORMULA USED IN GINI INDEX IS
GiniInde 1 -p

AND TO FIND GINI INDEX FOR ATTRIBUTES WE HAVE TO ADD WEIGHT AND SUM OF EACH GINI INDICES IN THE ATTRIBUTE.

THE GINI INDICES FOR DIFFERENT VARIABLES IS AS FOLLOWS :-

TOTAL CID = 27 attribute age young 12 yes = 6 no = 6 gini index for age = young is 0.5 probality of age probality of age prob

attribute job probality of job probality of job = manager and Approve_loan probality of job = manager and Approve_loan = no i

attribute marital probality of marital = married is 15/27 probality of marital = married and Approve_loan = yes is 3/15 proba

attribute housing probality of housing probality of housing = yes and Approve_loan = yes is 6/12 probality of housing yes and

attribute Has_loan yes is 4/27 yes and Approve_loan yes 4 yes probality of Has_loan probality of Has_loan probality of Has_lo

ATTRIBUTE GINI INDEX 0.42603 age job 0.44277 marital 0.4 housing 0.4 Has_loan 0.40579

THE DECISION TREE FROM GINI INDEX METHOD IS DRAWN BY TAKING THE MINIMUM GINI INDEX VALUE FIRST AND SO ON.

THE DECISION TREE FOR THE GIVEN QUESTION IS AS FOLLOWS :-

HOUSING? .YES NO. MARITAL? MARITAL? SINGLE DIVORCED MARRIED DIVORCED MARRIED SINGLE HAS LOAN? HAS_LOAN? HAS LOAN? HAS LOAN? H

Add a comment
Know the answer?
Add Answer to:
solve using gini index and give the decision tree cid marital housingHas_loarApproveloan job age young married...
Your Answer:

Post as a guest

Your Name:

What's your source?

Earn Coins

Coins can be redeemed for fabulous gifts.

Not the answer you're looking for? Ask your own homework help question. Our experts will answer your question WITHIN MINUTES for Free.
Similar Homework Help Questions
  • II. Using the spreadsheet provided, please answer each question. You must show work for the problems...

    II. Using the spreadsheet provided, please answer each question. You must show work for the problems where this is indicated. Round all answers to the nearest hundredth where necessary. 5. Name two categories that have qualitative data. 6. Name two categories that have quantitative data that is discrete, 7. Name two categories that have quantitative data that is continuous. 8. Build a frequency chart for the category “Major". 9. Build a relative frequency chart for the category “Number of Siblings"....

  • Using the database created in W1 Assignment, convert each subject's age and height into a z-score....

    Using the database created in W1 Assignment, convert each subject's age and height into a z-score. Using the z-score of ±1.645 for the 5 percent cutoff and the z-score of ±1.96 for the 2.5 percent in the tail, identify the subject identification (ID) number for subjects who are closest to the cutoff for the upper 2.5 percent and 5 percent of the scores and the lower 2.5 percent and 5 percent of the scores. Do this by comparing each participant’s...

  • What is the median for BthWeight rounded to 2 decimal places? Ethnic Smoking BreastFeed Age PreWeight...

    What is the median for BthWeight rounded to 2 decimal places? Ethnic Smoking BreastFeed Age PreWeight DelWeight BthWeight BthLength TimeNut White NonSmoker No 29 115 140 3310 45 99 Black NonSmoker No 33 112 126 2650 48 64 Black NonSmoker No 19 125 145 2900 49 60 White LightSmoker Yes 26 108 146 3500 51.5 102 White NonSmoker Yes 35 112 133 2600 51 77 Black NonSmoker No 20 115 137 3770 52 110 Black NonSmoker Yes 22 99 135...

  • What is the standard deviation for BthWeight rounded to 2 decimal places? Ethnic Smoking BreastFeed Age...

    What is the standard deviation for BthWeight rounded to 2 decimal places? Ethnic Smoking BreastFeed Age PreWeight DelWeight BthWeight BthLength TimeNut White NonSmoker No 29 115 140 3310 45 99 Black NonSmoker No 33 112 126 2650 48 64 Black NonSmoker No 19 125 145 2900 49 60 White LightSmoker Yes 26 108 146 3500 51.5 102 White NonSmoker Yes 35 112 133 2600 51 77 Black NonSmoker No 20 115 137 3770 52 110 Black NonSmoker Yes 22 99...

  • Observation Education (No. of years) Length of tenure in current employment (No. of years) Age (No....

    Observation Education (No. of years) Length of tenure in current employment (No. of years) Age (No. of years) Annual income ($) 1 17 8 40 124,000 2 12 12 41 30,000 3 20 9 44 193,000 4 14 4 42 88,000 5 12 1 22 27,000 6 14 9 28 43,000 7 12 8 43 96,000 8 18 10 37 110,000 9 16 12 36 88,000 10 11 7 39 36,000 11 16 14 42 81,000 12 12 4 23...

  • X Programming Exercise 8.6 | Instructions breezypythongui.py taxformwithgui.py + Q Desktop + ve Add radio button...

    X Programming Exercise 8.6 | Instructions breezypythongui.py taxformwithgui.py + Q Desktop + ve Add radio button options for filing status to the tax calculator program of Project 1. The user selects one of these options to determine the tax rate. The Single option's rate is 20%. The Married option is 15%. The Divorced option is 10%. The default option is Single. 1 2 File: taxformwithgui.py 3 Project 8.6 4 A GUI-based tax calculator. 5 6 Computes and prints the total...

  • II Suppose for our purposes that I wanted information about all students currently taking statistics at...

    II Suppose for our purposes that I wanted information about all students currently taking statistics at GPC. To do this grouped the students by the section they were in then I randomly chose two of the sections. Using this scenario, please answer each of the following: 1. Complete the following chart using the data set provided: Frequency Major Accounting Biology Business Administration Education Nursing Health Science Social Work/Sociology Other Using the chart above, answer the following: a. What is the...

  • please peovide coding in R: with a data of 13 variables and 200 observations Using the...

    please peovide coding in R: with a data of 13 variables and 200 observations Using the variable CLASS, test at 5% significance level to test the claim that the proportions of Freshmen, Sophomores, Juniors, and Seniors are the same. 2. Repeat part (a) for the variable COLLEGE. RLM ENGLISH MATH COMP 21 16 25 839SSESSOR 8 F 17 F N H ABCD 1 SEX HSP GPA AGE CREDITS CLASS COLLEGE MAJOR RESIDENCY TYPE 2 Transfer Blochm Resident F 75 2.39...

  • Question 2 (Using PHStat) A young researcher wants to know what factors affect the Number of...

    Question 2 (Using PHStat) A young researcher wants to know what factors affect the Number of weekly riders. You are asked to help the young researcher to make statistical analysis. Develop two (2) hypotheses Using simple linear regression analysis with a significance level of 5% as the basis of your analysis, please conduct the analysis and interpret the results for both of the hypotheses that you developed. City Number of weekly riders Price per week Population of city Monthly income...

  • ________________ ________________ ______________ 11. Using Excel - Scatter diagrams, estimated regression equations, and trendlines Suppose a...

    ________________ ________________ ______________ 11. Using Excel - Scatter diagrams, estimated regression equations, and trendlines Suppose a company records data on sales calls, induding the length of each call and whether a sale was made. The manager is interested in determining whether there is a relationship between the average time spent per call and the number of sales made by each employee, so she obtains the average call length and the total number of sales over a 2-week period for a...

ADVERTISEMENT
Free Homework Help App
Download From Google Play
Scan Your Homework
to Get Instant Free Answers
Need Online Homework Help?
Ask a Question
Get Answers For Free
Most questions answered within 3 hours.
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT