Identification and Multiply Robust Estimation of Causal Effects via Instrumental Variables from an Auxiliary Population
发布时间:2026-05-25浏览量:
一、论文摘要
Estimating causal effects in a target population with unmeasured confounders is challenging, especially when instrumental variables (IVs) are unavailable. However, IVs from auxiliary populations with similar problems can help infer causal effects in the target population. While the homogeneous conditional average treatment effect assumption has been widely used for effect transportability, it has not been explored in IV-based data fusion. We include it as a basic approach, though it may be biased when treatment effect heterogeneity exists. As an alternative approach, we introduce the equi-confounding assumption that the unmeasured confounding bias remains the same after adjusting for observed covariates, while allowing conditional average treatment effects to differ across populations. This allows us to identify the confounding bias in the auxiliary population and remove it from the treatment-outcome association in the target population to recover the causal effect. We develop multiply robust estimators under both approaches and demonstrate them through simulation studies and a real data application.
二、论文发表截图

三、作者介绍

李伟,中国人民大学统计学院副教授,中国人民大学吴玉章青年学者,教育部青年长江学者,入选北京市通州区“运河英才计划”领军人才。研究方向是因果推断、缺失数据及其在生物医学、社会经济学等领域中的应用,已在JRSSB, JASA,Biometrika等国际统计学顶级期刊发表多篇文章。主持国家自然科学基金青年项目和面上项目、北京市自然科学基金面上项目等多项科研课题。个人主页:https://weiliruc.github.io/

刘佳朋,波士顿大学统计学博士研究生,2025年硕士毕业于中国人民大学统计学院。主要研究方向为因果推断,数据融合等。

丁鹏,加州大学伯克利分校统计系副教授。2015年5月在哈佛大学统计系获得博士学位,随后在哈佛大学陈曾熙公共卫生学院(Harvard T. H. Chan School of Public Health)流行病学系从事博士后研究工作,直至2015年12月。此前,他在北京大学获得数学学士、经济学学士以及统计学硕士学位。

耿直,北京工商大学数学与统计学院,教授;创新书院院长。1989年至2021年在北京大学任教,2022年至今组建北京工商大学数理统计及因果推断团队。研究领域为因果推断、不完全数据统计分析、生物医学统计等。曾任中国现场统计研究会理事长、中国概率统计学会理事长、IMS-China主席、中国统计学会副会长;现任中国现场统计研究会因果推断分会理事长,中国人工智能学会不确定性人工智能专委会副主任委员。
更多文章
- Reduced Rank Multivariate Spatial Autoregressive Model for Large-scale Networks
- Neural Network-Based Dynamic Prediction for Interval-Censored Data with Time-Varying Covariates: Application to Alzheimer's Disease
- Co-regularized optimal high-order graph embedding for multi-view clustering.
- Joint identification of spatially variable genes via a network-assisted Bayesian regularization approach