Let the event that an email is spam be denoted as S The event
that an email is not spam is denoted as Sc
Let the event that an email is detected as spam be denoted as D
The event
that an email is not detected as spam is denoted as
Dc
Given:- P(S)=0.1
P(Sc)=1-0.1=0.9
P(D|S)=0.98
P(Dc|S)=1-0.98=0.02
P(Dc|Sc)=0.9
P(D|Sc)=1-0.9=0.1
Problem 1 (Bayes theorem and spam filters) Suppose you have develop a new algorithm to detect...
The five most common words appearing in spam emails are
shipping!, today!, here!,
available, and fingertips!. Many spam filters
separate spam from ham (email not considered to be spam) through
application of Bayes' theorem. Suppose that for one email account,
1 in every 10 messages is spam and the proportions of spam messages
that have the five most common words in spam email are given
below.
The five most common words appearing in spam emails are shipping!, today!, here!, available,...
Problem 5. (12 points) Bayes' Theorem is a popular tool for spam filtering. You are asked to design a spam filtering algorithm based on whether certain words appear in an email. You were given 1,000 randomly selected emails that entered an email server. You examined these emails and manually labeled each one either as spam or ham (i.e., non-spam). You found that 400 emails are spam and 600 are ham. In these 400 spam emails, you found 200 of thenm...
Problem 5. (12 points) Bayes' Theorem is a popular tool for spam filtering. You are asked to design a spam filtering algorithm based on whether certain words appear in an email. You were given 1,000 randomly selected emails that entered an email server. You examined these emails and manually labeled each one either as spam or ham (i.e., non-spam). You found that 400 emails are spam and 600 are ham. In these 400 spam emails, you found 200 of them...
The five most common words appearing in spam emails are shipping!, today!, here!, available, and fingertips!. Many spam filters separate spam from ham (email not considered to be spam) through application of Bayes' theorem. Suppose that for one email account, 1 in every 10 messages is spam and the proportions of spam messages that have the five most common words in spam email are given below. shipping! today! here! available fingertips! 0.049 0.044 0.034 0.013 0.013 Also suppose that the...
Suppose 80% of the incoming email messages for a college’s computer system are spam. 1. Use the CLT to approximate the probability that in a random sample of 200 incoming email messages at this college, the sample proportion of these messages that are spam would exceed .75. 2. Display your answer to question 1 as a shaded area in a well-labeled sketch. 3. After implementing a new spam blocker, if it turns out that a random sample of 200 messages...
Suppose 80% of the incoming email messages for a college’s computer system are spam. 1. Use the CLT to approximate the probability that in a random sample of 160 incoming email messages at this college, the sample proportion of these messages that are spam would exceed .70 . 2. Display your answer to question 1 as a shaded area in a well-labeled sketch. 3. After implementing a new spam blocker, if it turns out that a random sample of 160...
Spam filters try to sort your e-mails, deciding which are real messages and which are unwanted. One method used is a point system. The filter reads each incoming e-mail and assigns points to the sender, the subject, key words in the message, and so on. The higher the point total the more likely it is that the message is unwanted. The filter has a cutoff value for the point total; any message rated lower than that cutoff passes through your...
Spam Filters* Spam filters try to sort your incoming e-mails, deciding which are real messages and which are unwanted. One method used is a point system. The filter reads each incoming e-mail and assigns points according to the sender, the subject, key words in the message, and so on. The higher the point total the more likely it is that the message is unwanted. The filter has a cutoff value for the point total; any message rated lower than that...