Propensity Score Distribution
Interpretation Aid
Technical Interpretation
What you're seeing: The histograms show the distribution of estimated propensity scores for treated (blue) and control (orange) groups. Each propensity score represents the predicted probability that a unit received treatment, based on its covariate values.
Good overlap: When both distributions cover similar ranges (e.g., both spanning 0.2ā0.8), matching can find comparable units. The key assumption of PSMāthat we can find similar treated and control unitsāis satisfied.
Warning signs: If treated units cluster near 1.0 and controls near 0.0 with little overlap, the propensity model is "perfectly predicting" treatment. This means groups are fundamentally different on observed characteristics, and matching will fail or produce few matches.
Practical Interpretation
Marketing example: In a loyalty program analysis, if customers who enrolled all have very high propensity scores while non-enrollees have very low scores, it means enrolled customers were systematically different (e.g., higher prior spend, more visits). Finding a "fair comparison" will be difficult.
What to do if overlap is poor: (1) Use fewer or different covariates, (2) collect more data, or (3) consider that the treatment effect may not be estimable with this dataāthe groups are too different to compare.