A large supermarket tracks sales data by stock- keeping unit (SKU) for each item, such as “pasta”, “rice”, “egg”, “soup” is identified by a numerical SKU. The supermarket has a database of transactions where each transaction is a set of SKUs that were bought together. The database of transactions consist of following itemsets:
Transaction ID |
Itemsets |
T1 |
egg, soup |
T2 |
rice, egg, soup |
T3 |
rice, soup |
T4 |
pasta, rice, egg, soup |
T5 |
rice, egg |
T6 |
pasta, rice, soup |
T7 |
pasta, rice |
Solve the above problem using Apriori Algorithm (Association Rule Mining) with minimum support of 3 and write each and every step.
Answer : Given that,
Minimum support count = 3
All the items have support count more than 3.
Hence, the frequent itemsets are egg-soup, egg-rice, soup-rice, rice-pasta.
Next , we generate association rules for each frequent itemset by calculating its confidence percent using the given formulae :
A large supermarket tracks sales data by stock- keeping unit (SKU) for each item, such as...
A large supermarket tracks sales data by stock- keeping unit (SKU) for each item, such as “pasta”, “rice”, “egg”, “soup” is identified by a numerical SKU. The supermarket has a database of transactions where each transaction is a set of SKUs that were bought together. The database of transactions consist of following itemsets: Transaction ID Itemsets T1 egg, soup T2 rice, egg, soup T3 rice, soup T4 pasta, rice, egg, soup T5 rice, egg T6 pasta, rice, soup T7 pasta,...
Question 3 (10 marks) A large supermarket tracks sales data by stock- keeping unit (SKU) for each item, such as “pasta”, “rice”, “egg”, “soup” is identified by a numerical SKU. The supermarket has a database of transactions where each transaction is a set of SKUs that were bought together. The database of transactions consist of following itemsets: Transaction ID Itemsets T1 egg, soup T2 rice, egg, soup T3 rice, soup T4 pasta, rice, egg, soup T5 rice, egg T6 pasta,...
Question 3 (10 marks) A large supermarket tracks sales data by stock-keeping unit (SKU) for each item, such as "pasta", "rice", "egg", "soup" is identified by a numerical SKU. The supermarket has a database of transactions where each transaction is a set of SKUs that were bought together. The database of transactions consist of following itemsets: Transaction ID Itemsets T1 egg, soup T2 rice, egg, soup T3 rice, soup T4 pasta, rice, egg, soup Copyright © 2015-2018 VIT, All Rights...
I
need help with a data mining problem
Consider the following transaction dataset. T1: a, d, e T2: a, b, c, e T2: a, b, d, e T4: a, c, d, e T5: b, c, e T6: b, d, e T7:c, d T8: a, b, d a) Compute the support for itemsets {e}, {b, d}, and {b, d, e}. b) Compute the confidence for the association rules {b, d} rightarrow {e} and {e} rightarrow {b, d). c) Is confidence a...
We are given four items, namely A, B, C, and D. Their corresponding unit profits are pA, pB, pC, and pD. The following shows five transactions with these items. Each row corresponds to a transaction where a non-negative integer shown in the row corresponds to the total number of occurrences of the correspondence item present in the transaction. T A B C D t1 0 0 3 2 t2 3 4 0 0 t3 0 0 1 3 t4 1...