Methodology and theory for unbiased Markov chain Monte Carlo and alternatives

Papp, Tamas and Sherlock, Chris (2025) Methodology and theory for unbiased Markov chain Monte Carlo and alternatives. PhD thesis, Lancaster University.

[thumbnail of 2025PappPhD]
Text (2025PappPhD)
2025PappPhD.pdf - Published Version
Available under License Creative Commons Attribution.

Download (4MB)

Abstract

Markov chain Monte Carlo (MCMC) is the default inference method in Bayesian statistics. However, MCMC algorithms are biased, which can limit the extent to which these algorithms can be parallelized, and which can complicate the subsequent statistical inference. Recently-proposed methods provide ways of entirely eliminating this bias, based on simulating a coupling of two Markov chains that evolve in tandem. This thesis contributes to the unbiased MCMC literature by improving the efficiency and the theoretical understanding of coupling methods. Additionally, it provides a competitive alternative to coupling methods. The first contribution is an effective coupling for the random walk Metropolis algorithm, a widely-used practical MCMC algorithm. We design this coupling with the explicit aim of scalability to high dimensions, and analyze this coupling in a theoretical framework that can quantify the efficiency of couplings in high dimensions. This framework may be useful for designing and analyzing couplings of other MCMC algorithms. The second contribution is a study on tuning the scalar parameters of coupling methods. We argue that the so-called time-lag parameter is crucial to the efficiency and the robustness of these methods. Even though unbiased MCMC estimators can be noisier than standard MCMC ones, we demonstrate how judicious tuning can ensure that unbiased MCMC is nearly as efficient as standard MCMC. The final contribution is a pair of estimators of the squared Euclidean 2-Wasserstein distance, a strong measure of the discrepancy between two distributions. These estimators are based on approximately independent samples from the distributions of interest. We show that the estimators are often upper and lower bounds on the underlying discrepancy, and we demonstrate that they often outperform coupling methods in statistical applications.

Item Type:
Thesis (PhD)
Uncontrolled Keywords:
Research Output Funding/yes_externally_funded
Subjects:
?? yes - externally fundedno ??
ID Code:
233553
Deposited By:
Deposited On:
08 Nov 2025 00:34
Refereed?:
No
Published?:
Published
Last Modified:
20 Dec 2025 00:30