1 / 21

Interactive Interaction Analysis

Interactive Interaction Analysis. Aleks Jakulin & Gregor Leban Faculty of Computer and Information Science University of Ljubljana Slovenia. Overview. Interactions : Correlation can be generalized to more than 2 attributes, to capture interactions - higher-order regularities.

taline
Download Presentation

Interactive Interaction Analysis

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Interactive Interaction Analysis Aleks Jakulin & Gregor Leban Faculty of Computer and Information Science University of Ljubljana Slovenia

  2. Overview • Interactions: • Correlation can be generalized to more than 2 attributes, to capture interactions - higher-order regularities. • Information theory: • A non-parametric approach for measuring ‘association’ and ‘uncertainty’. • Applications: • Visualizations of the domain uncover previously unseen structure. • Software for interactive investigation of data assists the user in identifying interesting patterns. • Importance: • Understanding possible problems and assumptions in machine learning algorithms.

  3. label C importance of attribute A importance of attribute B attribute attribute A B attribute correlation 3-Way Interaction: What is common to A, B and C together; and cannot be inferred from any subset of attributes. 2-Way Interactions Attribute Dependencies

  4. Entropy given C’s empirical probability distribution (p = [0.2, 0.8]). H(C|A) = H(C)-I(A;C) Conditional entropy --- Remaining uncertainty in C after knowing A. H(A) Information which came with knowledge of A H(AB) Joint entropy I(A;C)=H(A)+H(C)-H(AC) Mutual information or information gain --- How much have A and C in common? Shannon’s Entropy A C

  5. Interaction Information I(A;B;C) := I(AB;C) - I(A;C) - I(B;C) = I(A;B|C) - I(A;B) • Interaction information can be: • POSITIVE – synergy between attributes • NEGATIVE – redundancy among attributes • SMALL – nothing special about the 3-way relationship

  6. The only type of odor that does not unambiguously predict the class of the mushroom (edible, inedible). Examples: A Useful Attribute Mutual information or information gain between the attribute and the label.

  7. Another Useful Attribute

  8. A Negative Interaction The proportion of information provided by either of the two attributes. This is the “overlap” between both mutual informations.

  9. That’s the gain of s-p-c if we already know the odor. A Negative Interaction The only column where spore-print-color succeeded in providing some information in excess of what we already knew from odor.

  10. One Somewhat Useful Attribute

  11. A (Seemingly) Useless Attribute Stalk-shapeis totally uninformative, as the class distribution is similar at all attribute values. That’s why we cannot distinguish between classes using this attribute.

  12. Surprise: A Positive Interaction! Information gained by holistic treatment of both attributes! Again, this is “new” mutual information arising from both attributes.

  13. Why a Positive Interaction? Specific attribute value combinations that yield perfect label predictions, but only in combination of both attributes

  14. Whole Domain: Interaction Matrix

  15. Interaction Graph

  16. a weakly negative interaction an unimportant interaction a positive interaction a cluster of negatively interacting attributes An Interaction Dendrogram

  17. synergy redundancy Information Diagram A dissected Venn diagram helps investigate higher-order interactions.

  18. Multi-Dimensional Scaling

  19. Interactive Interaction Analysis Attributes of interest A sorted list of interactions, ordered by the interaction magnitude. An interaction graph

  20. Summary • There are relationships exclusive to groups of n attributes. • Interaction information is a heuristic for quantification of relationships with entropy. • Visualization methods attempt to: • summarize the interactions in the domain (interaction graph, interaction dendrogram), • assist the user in exploring the domain and constructing classification models (interactive interaction analysis).

  21. Work in Progress • Overfitting: the interaction information computations do not account for the increase in complexity. • Support for numerical and ordered attributes. • Inductive learning algorithms which use these heuristics automatically. • Models that are based on the real relationships in the data, not on our assumptions about them.

More Related