Problem

Optimal binary search trees. Suppose we know the frequency with which keywords occur in pr...

Optimal binary search trees. Suppose we know the frequency with which keywords occur in programs of a certain language, for instance:

begin	5 %
do	40%
else	8%
end	4%
if	10%
then	10 %
while	23 %

We want to organize them in a binary search tree, so that the keyword in the root is alphabetically bigger than all the keywords in the left subtree and smaller than all the keywords in the right subtree (and this holds for all nodes).

Figure 6.12 has a nicely-balanced example on the left. In this case, when a keyword is being looked up, the number of comparisons needed is at most three: for instance, in finding “while”, only the three nodes “end”, “then”, and “while” get examined. But since we know the frequency with which keywords are accessed, we can use an even more fine-tuned cost function, the average number of comparisons to look up a word. For the search tree on the left, it is

cost = 1(0.04) + 2(0.40 + 0.10) + 3(0.05 + 0.08 + 0.10 + 0.23) = 2.42.

By this measure, the best search tree is the one on the right, which has a cost of 2.18.

Give an efficient algorithm for the following task.

Input: n words (in sorted order); frequencies of these words:

P1, p2,..., pn.

Output: The binary search tree of lowest cost (defined above as the expected number of comparisons in looking up a word).

Step-by-Step Solution

Request Professional Solution

Request Solution!

We need at least 10 more requests to produce the solution.

0 / 10 have requested this problem solution

The more requests, the faster the answer.

Request! (Login Required)

All students who have requested the solution will be notified once they are available.

Add your Solution

Textbook Solutions and Answers Search

Solutions For Problems in Chapter 6

Free Homework Help App

Download From Google Play

Scan Your Homework
to Get Instant Free Answers

Need Online Homework Help?

Ask a Question

Get Answers For Free
Most questions answered within 3 hours.

Recent Solutions