A Bayesian, Meta Cost-Benefit Model

A Bayesian, Meta Cost-Benefit Model John Roman, Ph.D. P. Mitchell Downey District of Columbia Crime Policy Institute Partnership for Greater Washington Research The Urban Institute The Brookings Institution The Brookings Institution June 30, 2010

Presentation Overview • Background • Existing Models • DCPI • Drug Courts • Meta-Analysis • Proposed Model • Analytic Strategy • Results • Next Steps

General Approach Estimate: • Costs of drug court in DC; • Benefits of reduced crime (general); • Expected benefits for a DC drug court population; • Effect of drug court on criminal behavior; • Translate effects into benefits, difference the costs.

Presentation Overview Background Existing Models DCPI Drug Courts Meta-Analysis Proposed Model Analytic Strategy Results Next Steps

The Origins of DCPI Funded by the Justice Grants Administration (JGA) in the Washington, D.C. Mayor’s office using Recovery Act funds. Key goal was to create an entity similar to the Washington State Institute of Public Policy (WSIPP). WSIPP is a non-partisan, non-profit that provides evidence to support the Legislature; Created in the late 1980s, WSIPP (under the leadership of Steve Aos and Roxanne Lieb) conducted research funded by the legislature to inform decision-making.

The Origins of DCPI In 1998, WSIPP researchers create a meta, cost-benefit model (often called the Aos model). Performs meta analysis across a range of policies and programs; Links to Washington state specific costs of operations; Monetizes outcomes; Prioritizes policy choices and makes recommendations to the legislature. In 2008, recommended funding several cost-effective initiatives. Savings from those programs allowed plans to build two new prisons to be shelved. Passed by the Legislature.

How Can Existing Models be Improved? The WSIPP model embodies a quantum leap is evidence-based policymaking. However, there are two (intractable) problems with the model: First, general problem in meta analysis that if the underlying studies are not identified, no aggregate causal relationship can be established; Second, the model does not account for uncertainty in several steps of the estimation process, nor does it account for uncertainty in combining multiple estimates;

Identification Issues in the Existing Model Consider the question of whether DC should implement an Adult Drug Court in Washington. In order to believe that there is a causal relationship between drug court participation and successful outcomes (Y) must believe: YT > YC and, that assignment is exogenous, such that in the absence of drug court YT = YC If assignment is not exogenous and YT ≠ YC then no causal attribution is possible. Problem gets worse with aggregation. We have not solved this identification problem.

Integrating Uncertainty into the Model If model is not identified, and causal attribution is not possible, must include all sources of uncertainty into the model. Two types of uncertainty: Estimation. Each step in the process should yield both a point estimate and a distribution. Aggregation. Final results should integrate uncertainty from each estimate. WSIPP model produces a single point estimate with no confidence interval. Cannot determine probabilistically whether expected outcomes will occur. Problem akin to comparison of fingerprints and DNA

Integrating Uncertainty into the Model, con’t Why isn’t uncertainty included in the WSIPP model? The problem was intractable using classical techniques at the time the model was conceived due to limitations in computing power. Also, difficult to revisit estimates in a policy environment. Solution: Bayesian inference allows distributions rather than ‘moments’ to be aggregated, maximizing how much uncertainty is included in the model.

Donald Rumsfeld on the Importance of Incorporating Uncertainty "Reports that say that something hasn't happened are always interesting to me, because as we know, there are known knowns; there are things we know we know. We also know there are known unknowns; that is to say we know there are some things we do not know. But there are also unknown unknowns -- the ones we don't know we don't know.“

Other Benefits of a Bayesian, Meta, Cost-Benefit Analysis The model is designed to be replicable; Does not requires abundant reputational capital with stakeholders, because; It allows for stakeholder input; Can be adjusted in real-time to test different assumptions; All assumptions are transparent. Reduces the number of stages of estimation.

What is the District of Columbia Crime Policy Institute (DCPI)? A nonpartisan, public policy research organization focused on crime and justice policy in Washington, DC. DCPI’s mission is to support improvements in the administration of justice and public safety policies, through evidence-based research. Collaboration of UI and Brookings researchers to inform long-term strategic and short-term operations research on juvenile and criminal justice mattres. Goal is timely, practitioner-oriented research to develop and implement evidence-based crime policy in DC.

What will DCPI Do? Year 1 DCPI activities include: Develop a cost-benefit model to identify cost-effective DC based interventions; Develop a research library (DCCrimePolicy.org); Conduct independent research: An Evaluation of the Mayor’s Focused Improvement Area Initiative; A Study of Promising Practices at the Metropolitan Police Department; A Study to Understand the Impact of Pre-Trial Detention on Defendants and its Implications for Evidence-Based Practice

What are Drug Courts? Drug courts use court-based treatment as an alternative to incarceration. Begun in Miami in 1989, drug courts are specialized dockets that process drug-involved offenders. Client behavior closely monitored by a judge. Key premises: Drug involved offenders would commit fewer crimes if they desisted from drug use. Relapse is part of recovery. Treatment participation can be encouraged (coerced?).

Do Drug Courts Work? Yes? Can’t randomly assign courts to have a drug court or not. So next best studies find client outcomes are moderately better using; RCTs in a single site; Meta-analyses of many studies with varying designs; Simulation studies using population data. Generally, recidivism is reduced 10-15 percent. Potential for large net social benefits.

What is Meta Analysis? Policymakers seem most convinced by meta-analysis as it uses evidence from many settings. Steps in meta analysis are to: Calculate an effect size (standardized mean effect size comparing T and C); Sum across studies (weighting by the inverse of the variance of each effect size); Account for heterogeneity; Apply ad hoc weights.

What is Meta, Cost Analysis? Translate effect size into net benefits to account for: Savings to law enforcement - reducing new crime; Savings to courts – not prosecuting new offenders Savings to corrections - not locking those people up; Savings to those who are not victimized; Add in costs of new policies and programs.

What is Meta, Cost Analysis (Again)? What are the criticisms? Garbage in, garbage out (identification); No uncertainty in these estimates. Can’t solve identification issues at this time; Can introduce additional uncertainty. How? Use Bayesian inference rather than classical (inferential, repeated sample) statistics. Analogous to measuring using the metric system. Represents the same fact, but standardized Base 10 can be more efficient than Base Whatever.

Why go Bayesian (Statistically) Accounts for uncertainty in the presence of common problems: Multi-stage model. Output of one stage is next stage’s input. Bayesian inference takes a weighted average of all output (weighted by the probability) versus just using the mode (frequentist). Non-symmetric distributions. Default is ‘mode’ in frequentist inference. Presence of Missing Data. Again, uses distribution rather than MI or listwise deletion.

Why Should DCPI Go Bayesian? Policy Buy-In. Presents data more intuitively (e.g. weather forecaster does not predict an expected mean precipitation of 0.5 inches with a standard deviation of .25). Allow Policymaker input. (e.g. can provide information about prior beliefs that can improve model performance (effect of policy changes on various problems, changes in crime patterns in DC, demographic changes, etc.). Flexibility. Can adapt to all manner of common problems in inference (missing data, etc.); Can’t combine multiple estimates with uncertainty without going Bayes. Replaces sensitivity analyses.

Analytic Strategy: General Estimate: • Costs of drug court in DC; • Benefits of reduced crime (general); • Expected benefits for a DC drug court population; • Effect of drug court on criminal behavior; • Translate effects into benefits, difference the costs.

Analytic Strategy: Costs • Taken from a prior study of a DC drug court (Harrell, et al. 1998) • Inflation adjusted to get present day per participant costs of drug court operations • Estimates $11,500 per participant (for entire process) • Limitations: • Based on FY 1995 costs – could have changed • Assumes same size drug court (returns to scale) • Advantages: • DC specific

Analytic Strategy: Valuing Benefits of Reduced Crime The cost of crime is the product of the price of crime (P) and the quantity of crime (Q), so: Costi= Pi*Qi Where i’s are the categories of crime costs. • Law enforcement costs of investigating crime; • Court costs of processing crime; • Corrections costs of supervision; • Victims costs of being victimized. Main source of uncertainty: price of crime to victims. Extant estimates produce point estimates, Roman (2009) includes distribution of prices.

Analytic Strategy: Estimates of the Price of Crime to Victims

Analytic Strategy: Calculating Benefits The goal of this analysis is to create an estimate of the distribution of net harms averted. Calculate reduced crime (impact) • From large scale, multi-site evaluation of drug courts find: • the rate of re-arrest among comparison group (not participating in drug court); • the distribution of crimes committed by those comparisons who were re-arrested; • the proportion of arrests that led to incarceration.

Analytic Strategy: Benefits (cont.) In the next step, we will estimate how many crimes are prevented by drug court. • For each arrest prevented, estimate what the crime would have been by sampling from the distribution of crimes committed by controls; • Estimate the price of that crime by sampling from Roman (2009); • Find the probability that would have led to incarceration (calculate costs of incarceration); • These last two prices are the benefits of preventing one crime.

Analytic Strategy: Impact Estimates • Meta-analysis • Data: • From Shaffer (2006) • Coded typical meta-data (effect size, research design, length of follow-up, sample size, etc.) • Called drug courts which had been evaluated and interviewed about policies and practices • Our approach: • Bayesian linear regression of estimated effect size (correlation between re-arrest and drug court participation) • Condition effect size on court and study characteristics

Analytic Strategy: Impact Estimates • Variable selection (court traits): • Preliminary regressions of effect size on theoretically meaningful characteristics (treatment type, program design, response to graduation/failure, eligibility, etc.) • Selected those which were significant: violent offenders eligible and motivation required for enrollment • Variable selection (study traits): • Length of follow-up; • Methodology (random trial, matching, etc.); • Similarity between treatment and control groups (race, age, gender, prior criminal history); • Composition of control group (dropouts, ineligible, etc.).

Results: Effect Size Estimates • The final results are many simulated possible effect sizes • From these simulations, we can estimate which effect sizes are more probable than others • The following slide presents the probability (on the y-axis) that the effect size will be less than (larger reduction) various values (on the x-axis) • It is calculated separately for courts that require motivation, don’t require motivation, and allow violent offenders • The vertical blue line indicates the probability that the effect size will be below zero (drug court reduces crime)

Results: Effect Sizes, Tests Eligibility Rules Does not Require Motivation Requires Motivation Allows Violent Offenders Good Effects Bad Effects

Results: Effect Size Estimates • The results before clearly indicate that courts that do not accept violent offenders are nearly guaranteed to reduce offending • However, often, it is not enough to know that there will likely be some reduction, but the size of that reduction is critical • A key advantage of Bayesian methods is that they allow one to estimate the probability that the reduction will be greater than any particular value • If we arbitrarily define a large effect to be an effect size less than -0.08, then we can calculate the probabilities of a large effect under each of the three scenarios

Results: Effect Sizes, Probability of a Large Effect Does not Require Motivation Requires Motivation Allows Violent Offenders Good Effects Bad Effects

Results: Effect Size Estimates • The previous slide indicates that there is a large difference in the probability that a court will achieve a large reduction, depending on whether or not motivation is required • This result is important because both courts had near equal likelihood of leading to a reduction in reoffending • Motivation eligibility requirements only start to matter when there is a large target effect

The Effect Size Over Time • The following plot traces the effect size over time, based on the relationship between measured effect and length of follow-up • The findings indicate that studies with longer follow-up periods tended to estimate larger effects, suggesting that the effect of drug court grows over time • For simplicity, only the mean effect size at each time point is displayed

Results: Effect Size Over Time Allows Violent Offenders Does not Require Motivation Requires Motivation

The Problem with Effect Sizes • The past three plots indicate that: • Drug courts which do not accept violent offenders are almost certain to have a true re-arrest reducing effect • The effects are larger in courts that require motivation for enrollment • The effects appear to grow over time • However, all of the above results use the effect size • Effect sizes are basically measures of the mean effects • There is no certainty that any jurisdiction’s experience will match the mean effect size

The Problem with Effect Sizes • The problem with presenting effect sizes can best be thought of by considering the sample size problem ubiquitous in social science statistics • Some effect is measured, but with a small number of units, we cannot be sure whether the observed effect reflects the true effect (hence tests for statistical significance) • This problem is reversed here • We estimate the true effects of a drug court, but if only a small number of offenders are enrolled, we cannot be sure that the realized, observed effect will match the underlying true effect

The Problem with Effect Sizes • In other words, when presenting effect sizes, we are presenting estimates of what the true effect is, not what effect could be expected from establishing a new drug court • This problem is greatest when considering programs with a relatively small number of participants, which typifies most criminal justice programs • The following plot displays the densities of expected number of arrests prevented, given 230 participants (the mean number from our sample)

Densities of Change in Rearrest Reduction in Rearrest | Increase in Rearrest

The Problem with Effect Sizes • These densities indicate that there is substantial probability that drug courts will not reduce re-arrests, even though there is nearly a 100% chance that the mean effect is a reduction • Telling policymakers that there is a 99% chance of a mean reduction would have left out the critical detail that there is only a 90% chance that they will experience a reduction in arrests • The blue lines indicate the general findings from past meta-analyses, which suggest that the mean drug court effect is a 10-15 percent reduction in re-arrests • While the distributions match this mean estimate very well, this small interval masks considerable variation in estimated effects

The Problem with Effect Sizes • The following slides illustrate this problem • The blue lines indicate the distribution of the effect size based on the simulations (solid line is mean estimate, dashed lines are the 95% confidence interval of the estimate) • These lines are horizontal because the effect size estimate is not based on sample size (the x-axis)

The Problem with Effect Sizes • For each population size displayed on the x-axis, based on the distribution of the effect size, we simulated the number of arrests prevented 10,000 times • Rather than the effect size, these simulations represent the reduction that courts of different sizes would likely experience • This information is displayed by the black lines • Again, the mean is the solid line and the dashed lines are the 95% confidence interval

A Bayesian, Meta Cost-Benefit Model

A Bayesian, Meta Cost-Benefit Model

Presentation Transcript

Cost Benefit Analysis

Cost Benefit Analysis

Cost-Benefit

A Bayesian, Meta Cost-Benefit Model

Cost-Benefit Analysis

CEDBR Fiscal Benefit – Cost Model

Turning Bayesian Model Averaging Into Bayesian Model Combination

UML META-MODEL

Bayesian SPARROW Model

A Benefit-Cost Analysis Primer

Cost-Benefit Analysis

Meta Model I

COST BENEFIT ANALYSIS

Bayesian Model Comparison

A Freighter Airplane Cargo Fire Risk, Benefit and Cost Model

Meta Model I

AT Benefit Cost Analysis Model

The COSTEX model: a cost-benefit model relating gene expression and selection

Bayesian Model Comparison