Question

11. One way to design a spam filter is to look at the words in an email. In particular, some words are more Trequent in spam emails. Suppose that we have the following 50% of emails are spam 1% of spam emails contain the word re 001% of non-spam emails contain the word refinance Suppose that an email is checked and found to contain the word refinance. What is the probability that the email is spam?

0 0
Add a comment Improve this question Transcribed image text
Answer #1

Bayes' Theorem: P(A | B) = P(A and B)/P(B)

P(spam) = 50% = 0.5

P(word refinance is found | spam) = 1% = 0.01

P(word refinance is found | not spam) = 0.1% = 0.001

(In question, its typed as 001%, but its taken as 0.1%, assuming its a typing error)

P(spam | word refinance is found) = P(spam and word refinance is found) / P(word refinance is found)

= ( P(a mail is spam and word refinance is found)) / (P(a mail is spam and word refinance is found + a mail is not spam and word refinance is found)

= (0.5x0.01)/(0.5x0.01 + 0.5x0.0.001)

= 0.9091

Add a comment
Know the answer?
Add Answer to:
11. One way to design a spam filter is to look at the words in an...
Your Answer:

Post as a guest

Your Name:

What's your source?

Earn Coins

Coins can be redeemed for fabulous gifts.

Not the answer you're looking for? Ask your own homework help question. Our experts will answer your question WITHIN MINUTES for Free.
Similar Homework Help Questions
  • Problem 5. (12 points) Bayes' Theorem is a popular tool for spam filtering. You are asked...

    Problem 5. (12 points) Bayes' Theorem is a popular tool for spam filtering. You are asked to design a spam filtering algorithm based on whether certain words appear in an email. You were given 1,000 randomly selected emails that entered an email server. You examined these emails and manually labeled each one either as spam or ham (i.e., non-spam). You found that 400 emails are spam and 600 are ham. In these 400 spam emails, you found 200 of thenm...

  • Problem 5. (12 points) Bayes' Theorem is a popular tool for spam filtering. You are asked...

    Problem 5. (12 points) Bayes' Theorem is a popular tool for spam filtering. You are asked to design a spam filtering algorithm based on whether certain words appear in an email. You were given 1,000 randomly selected emails that entered an email server. You examined these emails and manually labeled each one either as spam or ham (i.e., non-spam). You found that 400 emails are spam and 600 are ham. In these 400 spam emails, you found 200 of them...

  • The five most common words appearing in spam emails are shipping!, today!, here!, available, and fingertips!....

    The five most common words appearing in spam emails are shipping!, today!, here!, available, and fingertips!. Many spam filters separate spam from ham (email not considered to be spam) through application of Bayes' theorem. Suppose that for one email account, 1 in every 10 messages is spam and the proportions of spam messages that have the five most common words in spam email are given below. The five most common words appearing in spam emails are shipping!, today!, here!, available,...

  • The five most common words appearing in spam emails are shipping!, today!, here!, available, and fingertips!....

    The five most common words appearing in spam emails are shipping!, today!, here!, available, and fingertips!. Many spam filters separate spam from ham (email not considered to be spam) through application of Bayes' theorem. Suppose that for one email account, 1 in every 10 messages is spam and the proportions of spam messages that have the five most common words in spam email are given below. shipping! today! here! available fingertips! 0.049 0.044 0.034 0.013 0.013 Also suppose that the...

  • Spam Filters* Spam filters try to sort your incoming e-mails, deciding which are real messages and...

    Spam Filters* Spam filters try to sort your incoming e-mails, deciding which are real messages and which are unwanted. One method used is a point system. The filter reads each incoming e-mail and assigns points according to the sender, the subject, key words in the message, and so on. The higher the point total the more likely it is that the message is unwanted. The filter has a cutoff value for the point total; any message rated lower than that...

  • I have 4 questions dont know can anyone help me with any of it? ii) Consider the 11 letter word MATHEMATICS a) How many distinct words can be formed by rearranging its letters? b) How many 4 letter wo...

    I have 4 questions dont know can anyone help me with any of it? ii) Consider the 11 letter word MATHEMATICS a) How many distinct words can be formed by rearranging its letters? b) How many 4 letter words can be formed using the letters in the word MATHEMATICS, using letters no more often than they appear in the word? ii) Consider the equation where xi, x2, 13, T4,5 and re are non-negative integers a) How many solutions are there...

  • 3.2 Simple Bandpass Filter Design The L-point averaging filter is a lowpass filter. Its passband width...

    3.2 Simple Bandpass Filter Design The L-point averaging filter is a lowpass filter. Its passband width is controlled by L, being inversely proportional to L. In fact, you can use the GUI altidemo to view the frequency response for different averagers and measure the passband widths. It is also possible to create a filter whose passband is centered around some frequency other than zero. One simple way to do this is to define the impulse response of an L-point FIR...

  • In this lab you will write a spell check program. The program has two input files:...

    In this lab you will write a spell check program. The program has two input files: one is the dictionary (a list of valid words) and the other is the document to be spellchecked. The program will read in the words for the dictionary, then will read the document and check whether each word is found in the dictionary. If not, the user will be prompted to leave the word as is or type in a replacement word and add...

  • Information retrieval help! Will email Starbucks gift card for help. Leave your email in your answer!...

    Information retrieval help! Will email Starbucks gift card for help. Leave your email in your answer! Alice lost her phone a wek ago. When she finally got a new phone with a replaced SIM card, she found she got a thousand new messages, many of whichase Just spum She wanted fiber out the乎am. To her sadness, she lost the contacts as wel and could not know which messages ae from her friends. Aice went through 12 messages and moted down...

  • Please help writing a 1200-1600 essay. Follow the instructions. Prompt: First describe one particular problematic effect...

    Please help writing a 1200-1600 essay. Follow the instructions. Prompt: First describe one particular problematic effect of this system of residential segregation that we still see today. (Think about areas like education, criminal justice, economic inequality, media representation, rising hate crimes etc.) Then, propose a possible solution: How should we address this problem? /////SO FAR, I am thinking of writing about Racial Segregation (Between Black and Whites)/////// Some Source that I found.(Since I can't put the URL I will put...

ADVERTISEMENT
Free Homework Help App
Download From Google Play
Scan Your Homework
to Get Instant Free Answers
Need Online Homework Help?
Ask a Question
Get Answers For Free
Most questions answered within 3 hours.
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT