Question

Different data types may need to use different similarity (distance) measures. What is the expected similarity...

Different data types may need to use different similarity (distance) measures. What is the expected similarity measure in each of the following applications:

(a) clustering stars in the universe,

(b) clustering text documents,

(c) clustering clinical test data, and

(d) clustering houses to find delivery centers in a city with rivers and bridges?

0 0
Add a comment Improve this question Transcribed image text
Answer #1

(a) Euclidean distance (spatial and interval based) - straight-line distance between two points in Euclidean space

(b) cosine (vector data) similarity - similarity between two non-zero vectors of an inner product space that measures cosine of angle between them.

(c) asymmetric data (example- Jaccard coefficient) - comparing the similarity and diversity of sample sets

(d) reachable distance (not use direct Euclidean distance), consider the obstacles- situated within easy reach

Add a comment
Know the answer?
Add Answer to:
Different data types may need to use different similarity (distance) measures. What is the expected similarity...
Your Answer:

Post as a guest

Your Name:

What's your source?

Earn Coins

Coins can be redeemed for fabulous gifts.

Not the answer you're looking for? Ask your own homework help question. Our experts will answer your question WITHIN MINUTES for Free.
Similar Homework Help Questions
ADVERTISEMENT
Free Homework Help App
Download From Google Play
Scan Your Homework
to Get Instant Free Answers
Need Online Homework Help?
Ask a Question
Get Answers For Free
Most questions answered within 3 hours.
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT