1 / 15

Constraint-based Association Analysis in Data Mining

Explore constraints in frequent itemset mining algorithms, understand monotone and antimonotone constraints, and their impact on pruning search space and constraint checking. Learn about Apriori and FP-growth algorithms with various constraints.

Download Presentation

Constraint-based Association Analysis in Data Mining

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. 732A02 Data Mining -Clustering and Association Analysis • Constrained frequent itemset mining ………………… Jose M. Peña jose.m.pena@liu.se

  2. Constraints • A constraint C(.) is • Monotone • If C(A) then C(B) for all A, B st A  B. • E.g. A’  A. • Antimonotone • If C(A) then C(B) for all A, B st B  A. • Or, if not C(B) then not C(A) for all A, B st B  A. • E.g. support ≥ min_support. • The apriori property applies to any antimonotone constraint.

  3. Constraints • sum(S.Price)  v is monotone (positive prices). • min(S.Price)  v is monotone. • range(S.Price) 15 is monotone. • Itemset ab satisfies C • So does every superset of ab

  4. Constraints • sum(S.Price)  v is antimonotone (positive prices). • sum(S.Price)  v is not antimonotone. • range(S.Price) 15 is antimonotone. • Itemset ab violates C • So does every superset of ab

  5. Constraints

  6. Apriori algorithm + any constraint Database D L1 C1 Scan D C2 C2 L2 Scan D L3 C3 Scan D Constraint: Sum{S.price} < 5, where item price equals item id

  7. Apriori algorithm + antimonotone constraint Prune search space Database D L1 C1 Scan D C2 C2 L2 Scan D L3 C3 Scan D Constraint: Sum{S.price} < 5, where item price equals item id

  8. Apriori algorithm + monotone constraint Does not prune search space but avoids constraint checking Database D L1 C1 Scan D C2 C2 L2 Scan D ☺ Not in the output, since they don’t satisfy the constraint ☺ ☺ L3 C3 Scan D ☺ Constraint: Sum{S.price} ≥ 5, where item price equals item id

  9. FP grow algorithm + antimonotone constraint • Remove items that do not satisfy the constraint. • If the conditioning itemset α does not satisfy the constraint, then do not generate α nor its conditional database. • Let β denote the frequent items in the conditional database of α. If α U β satisfies the constraint, then do not check the constraint in the conditional database of α. Similar to Apriori (prune search space) Specific of FP grow (avoids constraint check)

  10. FP grow algorithm + monotone constraint • If the conditioning itemset α satisfies the constraint, then do not check the constraint in its conditional database.

  11. Constraints • avg(S.Price)  v and avg(S.Price) ≥v are neither monotone nor antimonotone. • Convertible monotone • If there exists an item order R such that • If C(A) then C(B) for all A and B respecting R such that A is a suffix of B. • E.g. avg(S.Price) ≥v wrt decreasing price order. • Convertible antimonotone • If there exists an item order R such that • If C(A) then C(B) for all A and B respecting R such that B is a suffix of A. • Or, if not C(B) then not C(A) for all A and B respecting R such that B is a suffix of A. • E.g. avg(S.Price) ≥v wrt to increasing price order.

  12. Constraints • avg(X)  25 is convertible monotone wrt descending item price order R: < a, f, g,d, b, h, c, e> • If an itemset d satisfies a constraint C, so do itemsets fd and afd, which have d as a suffix. • avg(X)  25 is convertible antimonotone wrt ascending item price item order R-1: < e, c, h, b, d, g, f, a > • If an itemset dfa satisfies a constraint C, so do itemsets fa and a, which are suffixes of dfa. • Thus, avg(X)  25 is strongly convertible. • Check that avg(X)  25 is also strongly convertible.

  13. Constraints

  14. Constraints Monotone Antimonotone Strongly convertible Convertible antimonotone Convertible monotone Inconvertible avg(S)-median(S)=0

More Related