2. Consider a 'random' DNA database of length n 106. Consider a search with query AACGACG- TACGAT (length-13), with the following parameters: 1 for a match, -1 for mis-match, and -00 for gaps...
2. Consider a 'random' DNA database of length n 106. Consider a search with query AACGACG- TACGAT (length-13), with the following parameters: 1 for a match, -1 for mis-match, and -00 for gaps (no gaps allowed). The database yielded a score of 11. (a) Give an E-value assuming that Pr[A] = Pr[C] = Pr[G] = Pr[T] = 0.25 (20pts) (b) Would you consider the hit statistically interesting? Explain. (5pts) (Hint: Recall that the E-value is the expected number of hits you would find with a score of 11 or higher.) (25 pts.)
2. Consider a 'random' DNA database of length n 106. Consider a search with query AACGACG- TACGAT (length-13), with the following parameters: 1 for a match, -1 for mis-match, and -00 for gaps (no gaps allowed). The database yielded a score of 11. (a) Give an E-value assuming that Pr[A] = Pr[C] = Pr[G] = Pr[T] = 0.25 (20pts) (b) Would you consider the hit statistically interesting? Explain. (5pts) (Hint: Recall that the E-value is the expected number of hits you would find with a score of 11 or higher.) (25 pts.)