Academic Advising Center Advisors Database
Academic Advising Center
Arts & Sciences
Duke University

 HOME > Arts & Sciences > advising > Advisors    Search Help Login 

Publications [#352655] of Jerome P. Reiter

Chapters

  1. Tang, J; Reiter, JP; Steorts, RC, Bayesian Modeling for Simultaneous Regression and Record Linkage, Lecture Notes in Computer Science Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics, vol. 12276 LNCS (January, 2020), pp. 209-223, ISBN 9783030575205 [doi]
    (last updated on 2026/01/21)

    Abstract:
    Often data analysts use probabilistic record linkage techniques to match records across two data sets. Such matching can be the primary goal, or it can be a necessary step to analyze relationships among the variables in the data sets. We propose a Bayesian hierarchical model that allows data analysts to perform simultaneous linear regression and probabilistic record linkage. This allows analysts to leverage relationships among the variables to improve linkage quality. Further, it enables analysts to propagate uncertainty in a principled way, while also potentially offering more accurate estimates of regression parameters compared to approaches that use a two-step process, i.e., link the records first, then estimate the linear regression on the linked data. We propose and evaluate three Markov chain Monte Carlo algorithms for implementing the Bayesian model, which we compare against a two-step process.


Duke University * Arts & Sciences * Advisors * Peer advisors * Staff * Reload * Login
x