Statistical inference for high-dimensional regression with proxy data
  • 演讲人: 李赛(中国人民大学)
  • 时间:2024年3月8日14:00
  • 地点:浙江大学紫金港校区行政楼1417报告厅
  • 主办单位:浙江大学数据科学研究中心


We study estimation and inference for high-dimensional linear models with two types of “proxy data”. The first type of proxies encompasses marginal statistics and sample covariance matrices computed from distinct sets of individuals. We develop a rate optimal method for estimation and inference for the regression coefficient vector and its linear functionals based on the proxy data. We show the intrinsic limitations in the proxy-data based inference: the minimax optimal rate for estimation is slower than that in the conventional case where individual data are observed. The second type of proxy data is differentially private data. We propose method for private estimation and inference in high-dimensional regression with FDR control.

