Bayesian Modelling and Inference for Multiple Network Data

Mantziou, Anastasia and Lunagomez Coria, Simon and Mitra, Robin and Fearnhead, Paul (2022) Bayesian Modelling and Inference for Multiple Network Data. PhD thesis, Lancaster University.

[thumbnail of 2022MantziouPhD]
Text (2022MantziouPhD)
2022MantziouPhD.pdf - Published Version

Download (14MB)

Abstract

There is a growing need for analysing network data due to their prevalence in applications arising from various scientific fields. A broad literature has been developed for the statistical analysis of networks as single observations, while the formulation of statistical frameworks for modelling multiple network data has only recently been considered by researchers. This thesis contributes to the statistical analysis of multiple network data sets, where now each observation in the data comprises a network rather than a scalar quantity. Our first contribution is the development of a Bayesian model-based approach for clustering multiple network data with respect to similarities detected in the connectivity patterns among the networks' nodes. Our model-based approach allows us to interpret the clusters with respect to a parameterisation, notably, through a network representative for each cluster. Our framework can also be formulated to detect networks in a population that are different from a majority group of networks. Extensive simulation studies show our model performs well in both clustering multiple network data and inferring the model parameters. We further apply our model on two real-world multiple network data sets resulting from the fields of Computing (Human Tracking Systems) and Neuroscience. Our second contribution is twofold. First, we introduce a new network distance metric that measures dissimilarities between networks with respect to their cycles, motivated by an ecological application. Second, we propose a new Markov Chain Monte Carlo (MCMC) scheme for inferring the parameters of the intractable Spherical Network Family (SNF) model for multiple network data. Specifically, we introduce an Importance Sampling (IS) step within a Metropolis-Hastings (MH) algorithm that allows the approximation of the intractable normalising constant of the SNF model within the MH ratio. We explore the behaviour of the newly proposed distance metric and the performance of our MCMC scheme through simulation studies, and apply our algorithm on a real-world ecological application.

Item Type:
Thesis (PhD)
Uncontrolled Keywords:
Data Sharing Template/no
Subjects:
?? no ??
ID Code:
171585
Deposited By:
Deposited On:
14 Jun 2022 16:50
Refereed?:
No
Published?:
Published
Last Modified:
21 Nov 2024 01:52