1 / 14

Stat 324 – Day 35

Learn how to create classification trees and regression trees to predict outcomes based on explanatory variables. Use various metrics to assess model quality and explore the concept of interaction in tree models.

stoops
Download Presentation

Stat 324 – Day 35

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Stat 324 – Day 35 Classification and Regression Trees (15.5)

  2. Recap – Classification Trees • With a categorical response, we can perform successive binary splits of the data set according to various explanatory variables • Goal is to create groups where probability of success is close to zero or one to minimize prediction errors • Use R2, AICc, RMSE to judge quality of model • Continue splitting until these level off? • Following the branches enables you to predict outcome of new observations

  3. Recap – Classification Trees

  4. Practice problem • Yes, there is evidence of an interaction because people with extra credit cards were split at 49.73 purchases, and people without credit cards split at 35.93 purchases. This suggests that people who make more purchases are more likely to upgrade their credit card if they already have extra cards. Meanwhile, people who make do not have extra cards are more likely to upgrade their card if they've made less purchases than the people with extra cards.

  5. Practice problem

  6. Practice problem No interaction With interaction

  7. Interaction?

  8. Interaction?

  9. Interaction?

  10. CART

  11. Trees vs. Models Models Trees Still making predictions More flexible Robust with outliers Splitting history to assess variable importance Interactions, quadratic Quantitative or Categorical response • Prediction equation • Rate of change • Sensitive to outliers • Stepwise regression to assess variable importance • Interactions, quadratic • Quantitative or Categorical response • Validate theory

  12. Project 2 Comments • Scoring; Underlining • Watch terminology: positive association, model vs. data, beta’s are significant, standardizing • Software package, intercept and slope interpretations • Seemed to work best when • Narrowed down to a few good variables, then looked at saturated model, then narrowed in on final model • Much improved integrating output into discussion • Project 3 • Aim for more efficient discussion • Video in lieu of class presentation

  13. Announcements • No official class meeting Wed/Thur • Nick available here for project/video questions • Email me project questions! • Can also do Zoom chats in evening • Course evaluations • Two! • Please!

  14. Where to go from here • Design of Experiments: Stat 323 • Survey data: Stat 421 • Using R: Stat 331 (after CPE 101) • Using SAS: Stat 330 • Time series, correlated data: Stat 418 • Statistical learning: Stat 4xx, Data 301 • Categorical data analysis: Stat 418 • Clustering, PCA: Stat 419 • Probability: Stat 305 (Bayesian Stat 4xx)

More Related