1 / 17

Contingency Tables – Part II – Getting Past Chi-Square?

Contingency Tables – Part II – Getting Past Chi-Square?. Measures of Association – A Review. What is the difference between a significance test statistic and a measure of association? How are they related? The basic questions about associations between variables?

roslyns
Download Presentation

Contingency Tables – Part II – Getting Past Chi-Square?

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Contingency Tables – Part II – Getting Past Chi-Square?

  2. Measures of Association – A Review What is the difference between a significance test statistic and a measure of association? How are they related? The basic questions about associations between variables? Does an association exist (vs. independence)? What is form (& direction) of the relation? What is the magnitude (strength) of the relation? Association Effect size

  3. “Strength of Association What does “association” mean? Covariance Agreement Predictability (reduction in errors/ignorance) Characteristics of association measures? Coefficient should range between 0 (= no association) and 1 (+1) Coefficient should not be directly affected by N Coefficient should be independent of a variable’s scale of measurement (its “metric”) Coefficient values should be interpretable (intuitively or mathematically)

  4. “Strength of Association (cont.) A number of different measures of association (coefficients) are available: Based on different levels of measurement Based on different interpretive models How to choose among them? Identify levels-of-measurement of both variables Identify if you have a clear independent variable  may use a directional or a nondirectional coefficient Identify which coefficients are most commonly used or most interpretable

  5. Measurement Level Situations: Association between 2 numerical variables? Coefficient = Pearson’s r r2 = proportion of variance “in common” May use Spearman’s rif data are ranks Association between 1 categoric and 1 numeric variable? (as in ANOVA) Coefficient of Association = eta (ή) eta-squared = proportion of variance “between groups” In SPSS, use Descriptives Cross-tabs or Compare Means  Meansprocedures

  6. Association between 2 categoric variables Different approaches to nonparametric measures of association Chi-square-based  Correct for degrees of freedom and sample size Uncertainty/Errors of Prediction (PRE/PRU)  Improved Predictability of Y given knowledge of X Concordance/agreement  Proportion of shared or correspondent values Note: coefficients for Ordinal and Nominal variables are slightly different  Coeff. limited by the lowest level variable

  7. Strength of Association (continued) Association between 2 Nominal variables (or 1 nominal + 1 ordinal variable) Chi-square-derived: Contingency coefficient, C (forget it!) Cramer’s V coefficient use this for 3x3 or larger tables Phi coefficient, Φ use this for 2x2 tables (or 2x3 tables) PRE-derived : Lambda (asymmetric) (λyx <> λxy)

  8. Phi-Coefficient formula: Cramer’s V formula:

  9. Strength of Association (continued) Association between 2 Ordinal variables Concordance-based (PRE) statistics: Gamma, γ most commonly used(note: in cases of 2x2 tables, gamma = Yules Q) Others? Kendall’s tau; Somer’sd (less used) Rank-order statistics: Spearman’s Rho ,  Use if many categories & few ties Must convert scores to ranks Can also use Chi-square-based measures Will generally yield lower values

  10. Nonparametric Measures of Association: Summary Recap Nominal variables Phi, Φ for 2x2 tables (or 2x3) Kramer’s Vfor 3x3 tables or larger Ordinal variables Gamma, γ most commonly used Yules Q same statistic in a 2x2 table Spearman’s r if many values & few ties Can also use Phi and Kramer’s V

  11. Nonparametric Measures of Association: Summary (continued) Different kinds of coefficients will not yield the same values on the same crosstabulation Gamma (& Yules Q) will almost always compute higher values than Kramer’s V (& Phi) on the same tables Note that 2x2 tables (with binary variables) are somewhat of a special case – Why?

  12. Non-Parametric measures of association How to Compute them? By Hand: see formulas in the textbook Chi-square-based = easiest to compute Gamma = more laborious by hand Note: X & Y variables in crosstab must be formatted in the same direction for ordinal statistics (e.g., Gamma) In SPSS: Click Statistics box in Crosstabs pop-up menu, then select appropriate coefficients (Note: do not select them all)

  13. II. Multivariate analysis of associations Going beyond bivariate analysis to multivariate analyses We often wish to consider more than two variables at a time because other variables may be involved in more complex patterns Termed “Partialling” or “Elaborating”  statistically consider: confounding effects of additional variables  “spurious relationships” Complicating effects of additional variables  “contingent relationships”

  14. Multivariate Analysis (continued) In cross-tabulations, crosstabs are “nested within levels of other variables Compute separate sub-crosstabs within each category or level of the 3rd variable See the example on the handout Partialing is only useful when the extra variable is associated with both X and Y Then we wish to remove the extra covariation Otherwise, it’s a waste of time

More Related