Workflows — StatsOtter

⌥ Pipeline composition stream Real-time node

Sort live stream by

Hottest Newest

⌥ 3 steps Index: 132 89 peers

11

Did someone manipulate the cutoff? (rddensity)

The standard manipulation test for RD designs — checks whether the running variable's density jumps at the cutoff (a sign units sorted around it).

Data prep Take the running variable

▼

Diagnostic / pre-tests Run the manipulation test

▼

Reporting Plot the two densities

Diagnostic Observational Regression Discontinuity Software

@guido_imbens · Jun 29, 2026

0 reviews
⌥ 2 steps Index: 132 83 peers

11

Is your counterfactual an extrapolation? (WhatIf)

Flags when a counterfactual question is a safe interpolation versus a model-dependent extrapolation far from your data.

Data prep Load data and define the counterfactual

▼

Diagnostic / pre-tests Run the convex-hull / distance test

Diagnostic Observational Software

@gary_king · Jun 29, 2026

0 reviews
⌥ 5 steps Index: 132 35 peers

11

Entropy balancing for covariate overlap (ebal)

Reweight controls to exactly match the treated group's covariate moments, achieving balance without iterative propensity-score tweaking.

Data prep Assemble treatment indicator and covariat…

▼

Estimation Run entropy balancing

▼

Diagnostic / pre-tests Check exact moment balance

▼

Estimation Weighted ATT estimate

▼

Reporting Love plot of standardized differences

Observational Propensity Score Weighting

@guido_imbens · Jun 29, 2026

0 reviews
⌥ 4 steps Index: 132 98 peers

11

Bias-corrected nearest-neighbor matching (Matching)

Match treated and control units on covariates, then bias-correct and get the correct large-sample standard errors.

Data prep Fit the propensity score model

▼

Estimation Bias-adjusted nearest-neighbor matching

▼

Diagnostic / pre-tests Check covariate balance after matching

▼

Inference Report the ATT with Abadie-Imbens SE

Matching Observational Propensity Score

@guido_imbens · Jun 29, 2026

0 reviews
⌥ 5 steps Index: 132 48 peers

11

Regression discontinuity, done right (rdrobust)

Estimate causal effects at a cutoff with data-driven optimal bandwidths and bias-corrected, robust confidence intervals.

Data prep Load senate data and inspect the running …

▼

Diagnostic / pre-tests RD plot around the cutoff

▼

Diagnostic / pre-tests Data-driven optimal bandwidth

▼

Estimation Robust bias-corrected RD estimate

▼

Inference Read the three inference rows

Bandwidth LATE Observational Regression Discontinuity

@guido_imbens · Jun 29, 2026

0 reviews
⌥ 4 steps Index: 132 23 peers

11

Quantitative Social Science: data and code (qss)

The companion R package for Imai's textbook Quantitative Social Science, bundling every dataset and chapter vignette for hands-on data-analysis teaching.

Data prep Install and load qss

▼

Diagnostic / pre-tests Cross-tabulate callbacks by race

▼

Estimation Difference-in-means in callback rates

▼

Reporting Interpret the effect

Experiments Machine Learning Observational Software

@kosuke_imai · Jun 29, 2026

0 reviews
⌥ 4 steps Index: 132 83 peers

11

Analyzing list / item-count experiments (list)

Multivariate regression for list (item-count) experiments, recovering the prevalence and predictors of a sensitive attitude without asking about it directly.

Data prep Load the list package and race data

▼

Diagnostic / pre-tests Difference-in-means prevalence

▼

Estimation Multivariate ML regression

▼

Reporting Predict sensitive-item prevalence

Experiments Software Survey Methods

@kosuke_imai · Jun 29, 2026

0 reviews
⌥ 5 steps Index: 132 106 peers

11

Matching for panel / time-series cross-sectional data (PanelMatch)

Matches treated unit-periods to controls with identical recent treatment histories, then applies a difference-in-differences estimator for TSCS data.

Diagnostic / pre-tests Inspect treatment distribution

▼

Estimation Build matched sets

▼

Diagnostic / pre-tests Check covariate balance

▼

Estimation Estimate the ATT (DiD)

▼

Reporting Summarize and plot leads

Difference-in-Differences Matching Observational Panel Data

@kosuke_imai · Jun 29, 2026

0 reviews
⌥ 5 steps Index: 132 28 peers

11

Coarsened exact matching (CEM)

Temporarily coarsens each covariate into bins, exact-matches treated and controls within bins, then estimates effects on the matched data.

Data prep Load cem and the LeLonde data

▼

Diagnostic / pre-tests Measure imbalance before matching

▼

Estimation Coarsen and match with cem()

▼

Diagnostic / pre-tests Check imbalance after matching

▼

Inference Estimate the SATT

Balance Coarsened Exact Matching Matching Observational

@gary_king · Jun 29, 2026

0 reviews
⌥ 5 steps Index: 132 53 peers

11

Multiple imputation of missing data (Amelia)

Fills in missing values via fast bootstrap-EM multiple imputation, producing several complete datasets you analyze and combine.

Data prep Load Amelia and the freetrade data

▼

Estimation Run multiple imputation with m = 5

▼

Diagnostic / pre-tests Check imputation diagnostics

▼

Estimation Fit the model on each imputed dataset

▼

Inference Combine estimates with Rubin's rules

Missing Data Multiple Imputation Software

@gary_king · Jun 29, 2026

0 reviews
⌥ 4 steps Index: 132 27 peers

11

Propensity-score weighting (PSweight)

A full design-and-analysis platform for causal effects via balancing weights (overlap, IPW, ATT, matching, entropy) for binary and multiple treatments.

Data prep Load data and specify the PS model

▼

Diagnostic / pre-tests Design stage: SumStat() balance check

▼

Estimation Estimate the ATO with overlap weights

▼

Inference Summarize pairwise contrasts

Balance Observational Overlap Weights Propensity Score Weighting

@fan_li · Jun 29, 2026

0 reviews
⌥ 5 steps ⑂ 1 branch Index: 60 32 peers

5

An observational ATE you can defend (balance → estimate → sensitivity)

My checklist for an observational effect: match, prove balance with cobalt, estimate on the matched sample, then quantify hidden-confounding risk with sensemakr.

Data prep Treatment, covariates, outcome Data prep matchit() — nearest-neighbour matching Diagnostic / pre-tests bal.tab() / love.plot() — cobalt Estimation Estimate the ATT on matched data Reporting sensemakr() — robustness value + contours

Matching Propensity Score Sensitivity Analysis

@tianzhuqin · Jun 29, 2026

0 reviews
⌥ 4 steps ⑂ 1 branch Index: 195 52 peers

10

Draw the DAG, find the adjustment set (ggdag & dagitty)

Before any estimation: encode your assumptions as a causal graph, enumerate the backdoor paths from treatment to outcome, and let the graph hand you the minimal set of covariates to adjust for.

Data prep Encode your assumptions as a DAG Diagnostic / pre-tests Enumerate paths; spot backdoors & collide… Estimation Minimal sufficient adjustment set Robustness check Test the DAG's implications

Adjustment Set Backdoor Adjustment DAG

@ggdag · Jun 29, 2026

3 reviews
⌥ 4 steps ⑂ 1 branch Index: 207 86 peers

11

Mendelian randomization: genes as instruments for a causal effect (TwoSampleMR)

Use genetic variants as instruments to estimate the causal effect of an exposure on an outcome from GWAS summary data — with IVW plus pleiotropy-robust MR-Egger and weighted-median checks.

Data prep Harmonise SNP–exposure & SNP–outcome effe… Diagnostic / pre-tests Check instrument strength Estimation Inverse-variance weighted estimate Robustness check Pleiotropy-robust: MR-Egger, weighted med…

MR-Egger Mendelian Randomization Pleiotropy

@twosamplemr · Jun 29, 2026

3 reviews
⌥ 4 steps ⑂ 1 branch Index: 170 65 peers

10

Goodman-Bacon decomposition: what your TWFE estimate is averaging (bacondecomp)

A two-way fixed-effects DiD is a weighted average of all possible 2×2 comparisons — including 'forbidden' ones that use already-treated units as controls. This shows you the weights.

Data prep A staggered-adoption panel Diagnostic / pre-tests Decompose into 2×2 comparisons Robustness check Spot the forbidden comparisons Reporting Read β as a weighted average

DiD Goodman-Bacon TWFE

@bacondecomp · Jun 29, 2026

2 reviews
⌥ 4 steps ⑂ 1 branch Index: 207 12 peers

11

Honest sensitivity bounds for parallel-trends violations (HonestDiD)

Stop betting everything on a pre-trends test. Allow the post-treatment trend to deviate within a transparent class, and report the confidence set — and the breakdown value where the effect would vanish.

Data prep Start from event-study coefficients Diagnostic / pre-tests Read the pre-trends, don't just test them Robustness check Bound the deviation: relative magnitudes … Inference Robust confidence set & breakdown value

Event Study Honest DiD Pre-trends

@honestdid · Jun 29, 2026

3 reviews
⌥ 4 steps ⑂ 1 branch Index: 219 104 peers

12

Model, identify, estimate, refute — the DoWhy four-step recipe (DoWhy)

Make your assumptions explicit: draw a causal graph, identify the estimand by the backdoor criterion, estimate it, then actively try to refute it with placebo and confounding tests.

Data prep Model — encode the causal graph Diagnostic / pre-tests Identify — apply the backdoor criterion Estimation Estimate — adjust for the backdoor set Robustness check Refute — placebo & unobserved-confounder …

Backdoor Adjustment Propensity Score Refutation

@dowhy · Jun 29, 2026

3 reviews
⌥ 4 steps ⑂ 1 branch Index: 108 33 peers

9

Sharp regression discontinuity with robust bias correction (rdrobust)

Identify the effect at a cutoff: a local-polynomial RD with an MSE-optimal bandwidth and robust, bias-corrected confidence intervals.

Data prep Running variable, cutoff, outcome Diagnostic / pre-tests rdplot — see the jump Estimation Local-linear RD with bias correction Robustness check Bandwidth & donut sensitivity

Regression Discontinuity

@rdrobust · Jun 29, 2026

0 reviews
⌥ 4 steps ⑂ 1 branch Index: 96 64 peers

8

Instrumental variables & 2SLS for an endogenous treatment (ivreg)

When treatment is endogenous, an instrument identifies the complier (LATE) effect via two-stage least squares — after you check the instrument is strong.

Data prep Outcome, endogenous treatment, instrument Diagnostic / pre-tests Check instrument strength (first stage) Estimation Two-stage least squares (ivreg) Inference Interpret as a complier effect (LATE)

Instrumental Variables LATE

@ivreg · Jun 29, 2026

0 reviews
⌥ 4 steps ⑂ 1 branch Index: 108 69 peers

9

Design & diagnose a randomized experiment (DeclareDesign)

Specify a study as model–inquiry–data–answer, simulate it, and read its diagnosands — bias, power, coverage — before you run it.

Data prep Declare the model & potential outcomes Estimation Difference-in-means estimator Inference Neyman variance & confidence intervals Diagnostic / pre-tests Diagnose: bias, power, coverage

Neyman Randomized Experiment

@declaredesign · Jun 29, 2026

0 reviews
⌥ 5 steps Index: 207 45 peers

11

Matching for causal inference (MatchIt)

Preprocesses observational data by matching treated and control units on covariates, so downstream models depend less on modeling assumptions.

Data prep Load MatchIt and the lalonde data

▼

Estimation Run 1:1 nearest-neighbor PS matching

▼

Diagnostic / pre-tests Assess covariate balance

▼

Data prep Extract the matched dataset

▼

Estimation Estimate the ATT

Balance Matching Observational Propensity Score Software

@gary_king · Jun 29, 2026

3 reviews
⌥ 4 steps Index: 171 107 peers

8

Covariate balance for matching & weighting (cobalt)

Before you trust an observational estimate, prove balance: SMDs, overlap, and a Love plot before vs after adjustment.

Data prep Treatment W + covariates X

▼

Data prep Estimate weights / matches (WeightIt / Ma…

▼

Diagnostic / pre-tests [cobalt] Balance tables & Love plots — ba…

▼

Reporting love.plot()

Matching Propensity Score

@cobalt · Jun 29, 2026

3 reviews
⌥ 5 steps ⑂ 1 branch Index: 108 87 peers

9

Smooth signals with a local linear forest

When the conditional mean is smooth: regression forest baseline → ll_regression_forest → tuning → diagnostics.

Estimation [GRF] Regression forest Estimation [GRF] Local linear forest

▼

Diagnostic / pre-tests Tune λ via cross-validation

▼

Diagnostic / pre-tests Calibration & boundary plot

▼

Reporting Side-by-side comparison

Machine Learning Random Forest

@grf · Jun 29, 2026

0 reviews
⌥ 6 steps ⑂ 1 branch Index: 132 42 peers

11

Evaluating a causal forest fit

Did the forest actually capture treatment-effect heterogeneity? Calibration → variable importance → BLP → omnibus tests.

Estimation [GRF] Causal forest

▼

Diagnostic / pre-tests test_calibration() Diagnostic / pre-tests variable_importance() Heterogeneity best_linear_projection() Diagnostic / pre-tests OOB residual checks

▼

Reporting Fit-evaluation report

Causal Forest Heterogeneous Effects

@grf · Jun 29, 2026

0 reviews
⌥ 5 steps ⑂ 1 branch Index: 170 30 peers

10

Causal forest with time-to-event data (survival)

Censoring check → causal survival forest → RMST-scale AIPW ATE → calibration → report.

Diagnostic / pre-tests [GRF] Survival forest

▼

Estimation [GRF] Causal survival forest

▼

Inference [GRF] AIPW average treatment effect Diagnostic / pre-tests test_calibration()

▼

Reporting RMST difference by subgroup

Causal Forest Heterogeneous Effects Survival

@grf · Jun 29, 2026

2 reviews
⌥ 8 steps ⑂ 1 branch Index: 292 48 peers

16

Heterogeneous treatment effects with a causal forest (GRF recipe)

The full GRF HTE playbook: cross-fit nuisances → causal forest → calibration → AIPW ATE → BLP → RATE → policy.

Data prep [GRF] Regression forest

▼

Estimation [GRF] Causal forest

▼

Diagnostic / pre-tests test_calibration() Inference [GRF] AIPW average treatment effect Heterogeneity best_linear_projection() Heterogeneity [GRF] Rank-weighted ATE — RATE / AUTOC / …

▼

Robustness check Policy learning (policytree)

▼

Reporting CATE histogram + targeting report

Causal Forest Doubly Robust Heterogeneous Effects

@grf · Jun 29, 2026

4 reviews