sourceR : Classification and source attribution of infectious agents among heterogeneous populations

Miller, Poppy and Marshall, Jonathan and French, Nigel and Jewell, Christopher Parry (2017) sourceR : Classification and source attribution of infectious agents among heterogeneous populations. PLoS Computational Biology, 13 (5): e1005564. ISSN 1553-734X

Full text not available from this repository.


Zoonotic diseases are a major cause of morbidity, and productivity losses in both human and animal populations. Identifying the source of food-borne zoonoses (e.g. an animal reservoir or food product) is crucial for the identification and prioritisation of food safety interventions. For many zoonotic diseases it is difficult to attribute human cases to sources of infection because there is little epidemiological information on the cases. However, microbial strain typing allows zoonotic pathogens to be categorised, and the relative frequencies of the strain types among the sources and in human cases allows inference on the likely source of each infection. We introduce sourceR, an R package for quantitative source attribution, aimed at food-borne diseases. It implements a Bayesian model using strain-typed surveillance data from both human cases and source samples, capable of identifying important sources of infection. The model measures the force of infection from each source, allowing for varying survivability, pathogenicity and virulence of pathogen strains, and varying abilities of the sources to act as vehicles of infection. A Bayesian non-parametric (Dirichlet process) approach is used to cluster pathogen strain types by epidemiological behaviour, avoiding model overfitting and allowing detection of strain types associated with potentially high “virulence”. sourceR is demonstrated using Campylobacter jejuni isolate data collected in New Zealand between 2005 and 2008. Chicken from a particular poultry supplier was identified as the major source of campylobacteriosis, which is qualitatively similar to results of previous studies using the same dataset. Additionally, the software identifies a cluster of 9 multilocus sequence types with abnormally high ’virulence’ in humans. sourceR enables straightforward attribution of cases of zoonotic infection to putative sources of infection. As sourceR develops, we intend it to become an important and flexible resource for food-borne disease attribution studies.

Item Type:
Journal Article
Journal or Publication Title:
PLoS Computational Biology
Additional Information:
© 2017 Miller et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Uncontrolled Keywords:
?? bayesian non-parametricsource attributionmcmcpublic healthfood poisoningecologycellular and molecular neuroscienceecology, evolution, behavior and systematicsgeneticsmodelling and simulationcomputational theory and mathematicsmolecular biologydiscipline-b ??
ID Code:
Deposited By:
Deposited On:
24 May 2017 08:06
Last Modified:
15 Jul 2024 17:00