Question

1) Assuming that data mining approaches are to be used in the following cases, identify whether...

1) Assuming that data mining approaches are to be used in the following cases, identify whether the

task required is supervised or unsupervised learning:

a. A service that has a large number of people repeatedly label photos as to whether or

not they contain an object of interest, then wants to use that data to label new photos.

b. A photo-sorting app that takes all your photos of people and automatically divides them

into different groups, each group ideally containing photos that contain a single person.

c. An IT team that wants to identify a network data packet as dangerous (e.g., virus or

hacker attack) based on comparison to other packets whose threat status is known.

d. Predicting whether a start-up company will succeed based on comparing its financial

data to those of similar past start-ups that both succeeded and did not.

e. A real-estate app that seeks to predict the sales price of a home in a certain

neighborhood, given information on recent sales in the area.

f. A grocery store that wants to use sales data regarding items bought together by its

customer base to offer coupons to customers on items they might like to try.

g. When biologists started studying living creatures and were interested in establishing a

taxonomy that would group living things according to various natural similarities and

differences.

h. A news filtering service that seeks to find news articles that contain similar topics.

i. Identifying children with similar learning styles in a classroom so that targeted learning

materials can be used for the different groups.

j.Political candidates attempting to identify groups of voters with similar demographic

characteristics to prepare platform statements to address each group’s unique

concerns.

2) In the game Scrabble, a player has a tray of 7 letter tiles. Consider the word 7-letter word SCIENCE

and answer the following questions.

a. Calculate the probability of randomly selecting each unique letter from this word.

b. Using those probabilities, calculate the entropy of this set of letters.

c. What would be the reduction in entropy (i.e., the information gain), if you split these

letters into two sets, one containing the vowels and one containing the consonants?

d. What is the maximum possible entropy in a set of 7 Scrabble tiles?

e. In general, which is preferable when you are playing Scrabble: a set of letters with high

entropy, or a set of letters with low entropy?

0 0
Add a comment Improve this question Transcribed image text
Answer #1

1)

In Supervised Learning, both input variables and output variables will be given.

In Unsupervised Learning, only input variables will be given.

a. A service that has a large number of people repeatedly label photos as to whether or not they contain an object of interest, then wants to use that data to label new photos.

output variable - whether or not they contain an object of interest,

It is Supervised Learning as output variable is defined.

b. A photo-sorting app that takes all your photos of people and automatically divides them into different groups, each group ideally containing photos that contain a single person.

output variable - whether or not the photos that contain a single person.

It is Supervised Learning as output variable is defined.

c. An IT team that wants to identify a network data packet as dangerous (e.g., virus or hacker attack) based on comparison to other packets whose threat status is known.

output variable - whether or not network data packet is dangerous

It is Supervised Learning as output variable is defined.

d. Predicting whether a start-up company will succeed based on comparing its financial data to those of similar past start-ups that both succeeded and did not.

output variable - whether or not a start-up company will succeed

It is Supervised Learning as output variable is defined.

e. A real-estate app that seeks to predict the sales price of a home in a certain neighborhood, given information on recent sales in the area.

output variable - sales price of a home

It is Supervised Learning as output variable is defined.

f. A grocery store that wants to use sales data regarding items bought together by its customer base to offer coupons to customers on items they might like to try.

No output variable - We have not given criteria to determine which coupons to offer.

It is Unsupervised Learning as output variable is defined.

g. When biologists started studying living creatures and were interested in establishing a taxonomy that would group living things according to various natural similarities and differences.

No output variable. We have not given by which natural similarities and differences we need to group living creatures.

It is Unsupervised Learning as output variable is defined.

h. A news filtering service that seeks to find news articles that contain similar topics.

No output variable. We have not given criteria to determine the similarity and differences of topics.

It is Unsupervised Learning as output variable is defined.

i. Identifying children with similar learning styles in a classroom so that targeted learning materials can be used for the different groups.

No output variable. We have not given criteria to determine the similarity and differences of learning styles.

It is Unsupervised Learning as output variable is defined.

j.Political candidates attempting to identify groups of voters with similar demographic characteristics to prepare platform statements to address each group’s unique concerns.

No output variable. We have not given criteria to determine the similarity and differences of demographic characteristics.

It is Unsupervised Learning as output variable is defined.

Add a comment
Know the answer?
Add Answer to:
1) Assuming that data mining approaches are to be used in the following cases, identify whether...
Your Answer:

Post as a guest

Your Name:

What's your source?

Earn Coins

Coins can be redeemed for fabulous gifts.

Not the answer you're looking for? Ask your own homework help question. Our experts will answer your question WITHIN MINUTES for Free.
Similar Homework Help Questions
  • 1) analyze the following case 2) give a summary and suggest ways for the company ——————...

    1) analyze the following case 2) give a summary and suggest ways for the company —————— Salesforce.com, one of the most disruptive technology companies of the past few years, has single-handedly shaken up the software industry with its innovative business model and resounding success. Salesforce provides customer relationship management (CRM) and other application software solutions in the form of software as a service leased over the Internet, as opposed to software bought and installed on machines locally. The company was...

  • SYNOPSIS The product manager for coffee development at Kraft Canada must decide whether to introduce the...

    SYNOPSIS The product manager for coffee development at Kraft Canada must decide whether to introduce the company's new line of single-serve coffee pods or to await results from the product's launch in the United States. Key strategic decisions include choosing the target market to focus on and determining the value proposition to emphasize. Important questions are also raised in regard to how the new product should be branded, the flavors to offer, whether Kraft should use traditional distribution channels or...

  • Case Study 1: Should a Computer Grade Your Essays? Would you like your college essays graded...

    Case Study 1: Should a Computer Grade Your Essays? Would you like your college essays graded by a computer? Well, you just might find that happening in your next course. In April 2013, EdX, a Harvard/MIT joint venture to develop massively open online courses (MOOCs), launched an essay-scoring program. Using arti ficial intelligence technology, essays and short answers are immediately scored and feedback tendered, allowing students to revise, resubmit, and improve their grade as many times as necessary. The non-profit...

  • A. Issues [1] In addition to damages for one year's notice period, can a trial judge...

    A. Issues [1] In addition to damages for one year's notice period, can a trial judge award significant damages for the mere fact of an employee's dismissal, or for the stigma that that dismissal brings? Or for the employer thereafter competing with the ex-employee for the clients, before the ex-employee has got a new job? B. Basic Facts [2] This is an appeal from 2009 ABQB 591 (CanLII), 473 A.R. 254. [3] Usually a judgment recites facts before law. But...

  • Hi there! I need to compare two essay into 1 essay, and make it interesting and...

    Hi there! I need to compare two essay into 1 essay, and make it interesting and choose couple topics which im going to talk about in my essay FIRST ESSAY “Teaching New Worlds/New Words” bell hooks Like desire, language disrupts, refuses to be contained within boundaries. It speaks itself against our will, in words and thoughts that intrude, even violate the most private spaces of mind and body. It was in my first year of college that I read Adrienne...

ADVERTISEMENT
Free Homework Help App
Download From Google Play
Scan Your Homework
to Get Instant Free Answers
Need Online Homework Help?
Ask a Question
Get Answers For Free
Most questions answered within 3 hours.
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT