Data Integration Framework for Heterogeneous Sources

Published:Dec 30, 2025 16:50
1 min read
ArXiv

Analysis

This paper addresses a crucial problem in data science: integrating data from diverse sources, especially when dealing with summary-level data and relaxing the assumption of random sampling. The proposed method's ability to estimate sampling weights and calibrate equations is significant for obtaining unbiased parameter estimates in complex scenarios. The application to cancer registry data highlights the practical relevance.

Reference

The proposed approach estimates study-specific sampling weights using auxiliary information and calibrates the estimating equations to obtain the full set of model parameters.