publications | Lucius EJ Bynum

2025

Black Box Causal Inference: Effect Estimation via Meta Prediction

Lucius EJ Bynum , Aahlad Manas Puli , Diego Herrero-Quevedo , and 4 more authors

arXiv Preprint arXiv:2503.05985, 2025

Abstract Cite PDF

Causal inference and the estimation of causal effects plays a central role in decision-making across many areas, including healthcare and economics. Estimating causal effects typically requires an estimator that is tailored to each problem of interest. But developing estimators can take significant effort for even a single causal inference setting. For example, algorithms for regression-based estimators, propensity score methods, and doubly robust methods were designed across several decades to handle causal estimation with observed confounders. Similarly, several estimators have been developed to exploit instrumental variables (IVs), including two-stage least-squares (TSLS), control functions, and the method-of-moments. In this work, we instead frame causal inference as a dataset-level prediction problem, offloading algorithm design to the learning process. The approach we introduce, called black box causal inference (BBCI), builds estimators in a black-box manner by learning to predict causal effects from sampled dataset-effect pairs. We demonstrate accurate estimation of average treatment effects (ATEs) and conditional average treatment effects (CATEs) with BBCI across several causal inference problems with known identification, including problems with less developed estimators.
@article{bynum2025blackboxcausalinference, title = {Black Box Causal Inference: Effect Estimation via Meta Prediction}, author = {Bynum, Lucius EJ and Puli, Aahlad Manas and Herrero-Quevedo, Diego and Nguyen, Nhi and Fernandez-Granda, Carlos and Cho, Kyunghyun and Ranganath, Rajesh}, year = {2025}, eprint = {2503.05985}, journal = {arXiv Preprint arXiv:2503.05985}, }

2024

Causal Dependence Plots

Joshua R Loftus , Lucius EJ Bynum , and Sakina Hansen

In Advances in Neural Information Processing Systems , 2024

Abstract Cite PDF

To use artificial intelligence and machine learning models wisely we must understand how they interact with the world, including how they depend causally on data inputs. In this work we develop Causal Dependence Plots (CDPs) to visualize how a model’s predicted outcome depends on changes in a given predictor along with consequent causal changes in other predictor variables. Crucially, this differs from standard methods based on independence or holding other predictors constant, such as regression coefficients or Partial Dependence Plots (PDPs). Our explanatory framework generalizes PDPs, including them as a special case, as well as a variety of other interpretive plots that show, for example, the total, direct, and indirect effects of causal mediation. We demonstrate with simulations and real data experiments how CDPs can be combined in a modular way with methods for causal learning or sensitivity analysis. Since people often think causally about input-output dependence, CDPs can be powerful tools in the xAI or interpretable machine learning toolkit and contribute to applications like scientific machine learning and algorithmic fairness.
@inproceedings{NEURIPS2024_cc84bfab, author = {Loftus, Joshua R and Bynum, Lucius EJ and Hansen, Sakina}, booktitle = {Advances in Neural Information Processing Systems}, editor = {Globerson, A. and Mackey, L. and Belgrave, D. and Fan, A. and Paquet, U. and Tomczak, J. and Zhang, C.}, pages = {112656--112683}, publisher = {Curran Associates, Inc.}, title = {Causal Dependence Plots}, url = {https://proceedings.neurips.cc/paper_files/paper/2024/file/cc84bfabe6389d8883fc2071c848f62a-Paper-Conference.pdf}, volume = {37}, year = {2024}, }
A New Paradigm for Counterfactual Reasoning in Fairness and Recourse

Lucius E.J. Bynum , Joshua R. Loftus , and Julia Stoyanovich

In Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, IJCAI-24 , Aug 2024

Main Track

Abstract Cite PDF

Counterfactuals and counterfactual reasoning underpin numerous techniques for auditing and understanding artificial intelligence (AI) systems. The traditional paradigm for counterfactual reasoning in this literature is the interventional counterfactual, where hypothetical interventions are imagined and simulated. For this reason, the starting point for causal reasoning about legal protections and demographic data in AI is an imagined intervention on a legally-protected characteristic, such as ethnicity, race, gender, disability, age, etc. We ask, for example, what would have happened had your race been different? An inherent limitation of this paradigm is that some demographic interventions – like interventions on race – may not translate into the formalisms of interventional counterfactuals. In this work, we explore a new paradigm based instead on the backtracking counterfactual, where rather than imagine hypothetical interventions on legally-protected characteristics, we imagine alternate initial conditions while holding these characteristics fixed. We ask instead, what would explain a counterfactual outcome for you as you actually are or could be? This alternate framework allows us to address many of the same social concerns, but to do so while asking fundamentally different questions that do not rely on demographic interventions.
@inproceedings{10.24963/ijcai.2024/784, title = {A New Paradigm for Counterfactual Reasoning in Fairness and Recourse}, author = {Bynum, Lucius E.J. and Loftus, Joshua R. and Stoyanovich, Julia}, booktitle = {Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, {IJCAI-24}}, publisher = {International Joint Conferences on Artificial Intelligence Organization}, editor = {Larson, Kate}, pages = {7092--7100}, year = {2024}, month = aug, note = {Main Track}, doi = {10.24963/ijcai.2024/784}, }
Language Models as Causal Effect Generators

Lucius EJ Bynum , and Kyunghyun Cho

arXiv Preprint arXiv:2411.08019, Aug 2024

Abstract Cite PDF

We present a framework for large language model (LLM) based data generation with controllable causal structure. In particular, we define a procedure for turning any language model and any directed acyclic graph (DAG) into a sequence-driven structural causal model (SD-SCM). Broadly speaking, an SD-SCM is a causal model with user-defined structure and LLM-defined structural equations. We characterize how an SD-SCM allows sampling from observational, interventional, and counterfactual distributions according to the desired causal structure. We then leverage this procedure to propose a new type of benchmark for causal inference methods, generating individual-level counterfactual data without needing to manually specify functional relationships between variables. We create an example benchmark consisting of thousands of datasets, and test a suite of popular estimation methods on these datasets for average, conditional average, and individual treatment effect estimation, both with and without hidden confounding. Apart from generating data, the same procedure also allows us to test for the presence of a causal effect that might be encoded in an LLM. This procedure can underpin auditing LLMs for misinformation, discrimination, or otherwise undesirable behavior. We believe SD-SCMs can serve as a useful tool in any application that would benefit from sequential data with controllable causal structure.
@article{bynum2024sdscm, title = {Language Models as Causal Effect Generators}, author = {Bynum, Lucius EJ and Cho, Kyunghyun}, year = {2024}, eprint = {2411.08019}, journal = {arXiv Preprint arXiv:2411.08019}, }

2023

Counterfactuals for the Future

Lucius EJ Bynum , Joshua R Loftus , and Julia Stoyanovich

Proceedings of the AAAI Conference on Artificial Intelligence, Jun 2023

Abstract Cite PDF

Counterfactuals are often described as ’retrospective,’ focusing on hypothetical alternatives to a realized past. This description relates to an often implicit assumption about the structure and stability of exogenous variables in the system being modeled — an assumption that is reasonable in many settings where counterfactuals are used. In this work, we consider cases where we might reasonably make a different assumption about exogenous variables; namely, that the exogenous noise terms of each unit do exhibit some unit-specific structure and/or stability. This leads us to a different use of counterfactuals — a forward-looking rather than retrospective counterfactual. We introduce "counterfactual treatment choice," a type of treatment choice problem that motivates using forward-looking counterfactuals. We then explore how mismatches between interventional versus forward-looking counterfactual approaches to treatment choice, consistent with different assumptions about exogenous noise, can lead to counterintuitive results.
@article{10.1609/aaai.v37i12.26655, title = {Counterfactuals for the Future}, volume = {37}, url = {https://ojs.aaai.org/index.php/AAAI/article/view/26655}, doi = {10.1609/aaai.v37i12.26655}, number = {12}, journal = {Proceedings of the AAAI Conference on Artificial Intelligence}, author = {Bynum, Lucius EJ and Loftus, Joshua R and Stoyanovich, Julia}, year = {2023}, month = jun, pages = {14144-14152}, }
The Possibility of Fairness: Revisiting the Impossibility Theorem in Practice

Andrew Bell , Lucius EJ Bynum , Nazarii Drushchak , and 3 more authors

In Proceedings of the 2023 ACM Conference on Fairness, Accountability, and Transparency , Jun 2023

Abstract Cite PDF

The “impossibility theorem” — which is considered foundational in algorithmic fairness literature — asserts that there must be trade-offs between common notions of fairness and performance when fitting statistical models, except in two special cases: when the prevalence of the outcome being predicted is equal across groups, or when a perfectly accurate predictor is used. However, theory does not always translate to practice. In this work, we challenge the implications of the impossibility theorem in practical settings. First, we show analytically that, by slightly relaxing the impossibility theorem (to accommodate a practitioner’s perspective of fairness), it becomes possible to identify abundant sets of models that satisfy seemingly incompatible fairness constraints. Second, we demonstrate the existence of these models through extensive experiments on five real-world datasets. We conclude by offering tools and guidance for practitioners to understand when — and to what degree — fairness along multiple criteria can be achieved. This work has an important implication for the community: achieving fairness along multiple metrics for multiple groups (and their intersections) is much more possible than was previously believed.
@inproceedings{10.1145/3593013.3594007, author = {Bell, Andrew and Bynum, Lucius EJ and Drushchak, Nazarii and Zakharchenko, Tetiana and Rosenblatt, Lucas and Stoyanovich, Julia}, title = {The Possibility of Fairness: Revisiting the Impossibility Theorem in Practice}, year = {2023}, isbn = {9798400701924}, publisher = {Association for Computing Machinery}, address = {New York, NY, USA}, url = {https://doi.org/10.1145/3593013.3594007}, doi = {10.1145/3593013.3594007}, booktitle = {Proceedings of the 2023 ACM Conference on Fairness, Accountability, and Transparency}, pages = {400–422}, numpages = {23}, keywords = {fairness, machine learning, public policy, responsible AI}, location = {Chicago, IL, USA}, series = {FAccT '23}, }

2022

An Interactive Introduction to Causal Inference

Lucius EJ Bynum , Falaah Arif Khan , Oleksandra Konopatska , and 2 more authors

IEEE VIS Workshop on Visualization for AI Explainability (VISxAI), Jun 2022

Abstract Cite Website

This work is a deep dive into the foundations of causal inference in the style of an interactive story. Learn all about randomization, causal graphical models, estimating treatment effects, and the assumptions behind causal inference.
@article{bynum2022interactive, author = {Bynum, Lucius EJ and Khan, Falaah Arif and Konopatska, Oleksandra and Loftus, Joshua R. and Stoyanovich, Julia}, title = {An Interactive Introduction to Causal Inference}, journal = {IEEE VIS Workshop on Visualization for AI Explainability (VISxAI)}, year = {2022}, url = {https://lbynum.github.io/interactive-causal-inference}, publisher = {IEEE}, }

2021

Disaggregated Interventions to Reduce Inequality

Lucius EJ Bynum , Joshua R Loftus , and Julia Stoyanovich

In Proceedings of the 1st ACM Conference on Equity and Access in Algorithms, Mechanisms, and Optimization , Jun 2021

Abstract Cite PDF

A significant body of research in the data sciences considers unfair discrimination against social categories such as race or gender that could occur or be amplified as a result of algorithmic decisions. Simultaneously, real-world disparities continue to exist, even before algorithmic decisions are made. In this work, we draw on insights from the social sciences brought into the realm of causal modeling and constrained optimization, and develop a novel algorithmic framework for tackling pre-existing real-world disparities. The purpose of our framework, which we call the “impact remediation framework,” is to measure real-world disparities and discover the optimal intervention policies that could help improve equity or access to opportunity for those who are underserved with respect to an outcome of interest. We develop a disaggregated approach to tackling pre-existing disparities that relaxes the typical set of assumptions required for the use of social categories in structural causal models. Our approach flexibly incorporates counterfactuals and is compatible with various ontological assumptions about the nature of social categories. We demonstrate impact remediation with a hypothetical case study and compare our disaggregated approach to an existing state-of-the-art approach, comparing its structure and resulting policy recommendations. In contrast to most work on optimal policy learning, we explore disparity reduction itself as an objective, explicitly focusing the power of algorithms on reducing inequality.
@inproceedings{10.1145/3465416.3483286, author = {Bynum, Lucius EJ and Loftus, Joshua R and Stoyanovich, Julia}, title = {Disaggregated Interventions to Reduce Inequality}, year = {2021}, isbn = {9781450385534}, publisher = {Association for Computing Machinery}, address = {New York, NY, USA}, url = {https://doi.org/10.1145/3465416.3483286}, doi = {10.1145/3465416.3483286}, booktitle = {Proceedings of the 1st ACM Conference on Equity and Access in Algorithms, Mechanisms, and Optimization}, articleno = {2}, numpages = {13}, keywords = {social categories, inequality, fairness, causal modeling}, location = {--, NY, USA}, series = {EAAMO '21}, }

2020

Rotational Equivariance for Object Classification using xView

Lucius EJ Bynum , Timothy Doster , Tegan H Emerson , and 1 more author

In IGARSS 2020-2020 IEEE International Geoscience and Remote Sensing Symposium , Jun 2020

Abstract Cite Link

With the recent addition of large, curated and labeled data sets to the remote sensing discipline, deep learning models have largely surpassed the performance of classical techniques. These deep models, typically Convolutional Neural Networks, are invariant to translation through the use of successive convolution layers which are themselves equivariant to translation. Further, the combination of multiple convolution and pooling layers means that in practice, the model is also approximately invariant to translation. However, until recently these models could only approach rotational invariance through data augmentation. Here we propose using a new model formulation which achieves rotational equaivariance without data augmentation for overhead imagery classification. We utilize the popular xView data set to compare the rotational equivariance formalization against a regular CNN and CNN with rotational data augmentation for the task of image classification.
@inproceedings{bynum2020rotational, title = {Rotational Equivariance for Object Classification using xView}, author = {Bynum, Lucius EJ and Doster, Timothy and Emerson, Tegan H and Kvinge, Henry}, booktitle = {IGARSS 2020-2020 IEEE International Geoscience and Remote Sensing Symposium}, pages = {3684--3687}, year = {2020}, organization = {IEEE}, }
Argumentative Topology: Finding Loop(holes) in Logic

Sarah Tymochko , Zachary New , Lucius EJ Bynum , and 4 more authors

arXiv Preprint arXiv:2011.08952, Jun 2020

Abstract Cite Link

Advances in natural language processing have resulted in increased capabilities with respect to multiple tasks. One of the possible causes of the observed performance gains is the introduction of increasingly sophisticated text representations. While many of the new word embedding techniques can be shown to capture particular notions of sentiment or associative structures, we explore the ability of two different word embeddings to uncover or capture the notion of logical shape in text. To this end we present a novel framework that we call Topological Word Embeddings which leverages mathematical techniques in dynamical system analysis and data driven shape extraction (i.e. topological data analysis). In this preliminary work we show that using a topological delay embedding we are able to capture and extract a different, shape-based notion of logic aimed at answering the question "Can we find a circle in a circular argument?"
@article{tymochko2020argumentative, title = {Argumentative Topology: Finding Loop(holes) in Logic}, author = {Tymochko, Sarah and New, Zachary and Bynum, Lucius EJ and Purvine, Emilie and Doster, Timothy and Chaput, Julien and Emerson, Tegan}, journal = {arXiv Preprint arXiv:2011.08952}, year = {2020}, }