Authors
- Nicola F. Müller1,2,†
- Ryan R. Wick3
- Louise M. Judd4
- Deborah A. Williamson5
- Trevor Bedford2,6
- Benjamin P. Howden3,4
- Sebastián Duchêne3,7,‡
- Danielle J. Ingle3,‡,†
1Division of HIV, ID and Global Medicine, University of California San Francisco, CA, USA
2Vaccine and Infectious Disease Division, Fred Hutchinson Cancer Center, Seattle, WA, USA
3Department of Microbiology and Immunology at the Peter Doherty Institute for Infection and Immunity, The University of Melbourne, Melbourne, VIC, AUS
4Centre for Pathogen Genomics, University of Melbourne (Doherty Institute)
5Department of Infectious Diseases at the Peter Doherty Institute for Infection and Immunity, The University of Melbourne, Melbourne, VIC, AUS
6Howard Hughes Medical Institute, Seattle, WA, USA
7EDID unit, Department of Computational Biology, Institut Pasteur, Paris, France
†Corresponding authors: nicola.felix.mueller@gmail.com, danielle.ingle@unimelb.edu.au
‡These authors contributed equally to this work
The 'silent pandemic' of antimicrobial resistance (AMR) represents a significant global public health threat. These AMR genes are often carried on mobile elements, such as plasmids. The horizontal movement of these plasmids allows AMR genes, and the resistance they confer to key therapeutics, to disseminate throughout a population. However, quantifying the movement of plasmids remains challenging with existing computational approaches.
Here, we introduce a novel method for reconstructing and quantifying the movement of plasmids in bacterial populations over time. To do this, we model the co-evolution of chromosomal and plasmid DNA using a joint coalescent and plasmid transfer process in a Bayesian phylogenetic network framework. Our approach captures differences in the evolutionary histories of plasmids and chromosomes to identify instances where plasmids likely moved between bacterial lineages, while rigorously accounting for uncertainty in the data.
We apply this new method to a five-year dataset of Shigella, exploring transfer rates for five different plasmids that harbor various AMR and virulence profiles. In doing so, we reconstruct the co-evolution of the large Shigella virulence plasmid with the chromosomal DNA, quantify the higher transfer rates of three small plasmids that circulate among lineages of Shigella sonnei, and describe the recent dissemination of a multidrug-resistant plasmid between S. sonnei and S. flexneri lineages. Notably, this plasmid appears to have transferred in multiple independent events alongside an overall increase in its prevalence since 2010.
Our approach provides a powerful framework for understanding the evolutionary dynamics of plasmids carrying AMR genes as they are introduced, circulate, and are maintained in bacterial populations.
.
├── Software/ # Analysis software tools (JAR files)
├── Applications/ # Real-world data analyses
│ └── Shigella/ # Shigella analysis pipeline and results
├── Simulations/ # Simulation studies
│ ├── PlasmidEvol/ # Plasmid evolution simulations
│ └── Rates/ # Rate parameter simulations
└── Validation/ # Method validation tests
The Software/ directory contains custom Java tools for plasmid phylodynamic analysis:
- PlasmidTreeMapper.jar - Maps plasmid trees onto chromosomal phylogenies
- TransferCounter.jar - Counts and analyzes plasmid transfer events
- LossCounter.jar - Identifies and quantifies plasmid loss events
- Summarizer.jar - Summarizes posterior distributions from MCMC analyses
- CoalPT.jar - Coalescent and plasmid transfer inference
- LTT.jar - Lineage-through-time analysis
- Java 8 or higher
- BEAST 2.7+ (for XML-based analyses)
- R 4.0+ (for visualization and data analysis)
- MATLAB (for XML generation scripts)
The Applications/Shigella/ directory contains the complete analysis pipeline for Shigella data:
MATLAB Scripts:
createXml.m- Generate BEAST XML files for individual analysescreateSonneiFlexXml.m- Generate XMLs for S. sonnei and S. flexneri analysescreateChromosomePlasmidTrees.m- Create tanglegram analysescreateSupplementalReassortmentXml.m- Generate supplemental reassortment analysessplitFlex.m- Split S. flexneri data for analysiscompute.m- Main computational pipeline
R Scripts:
plotShigellaData.R- Main visualization of Shigella resultsplotLBIandClusters.R- Plot local branching index and cluster analysisplotDensityTree.R- Create density tree visualizationsplotSupplementalAnalyses.R- Generate supplemental figuresdensitreecopy.R- Additional density tree utilities
Python Notebooks:
plasmid_vis.ipynb- Interactive plasmid visualizationtanglegram.ipynb- Chromosome-plasmid tanglegram generationChromosomePlasmidTanglegrams.ipynb- Advanced tanglegram analyses
data/- Contains sequence alignments and phylogenetic treesSonnei/- S. sonnei genomic dataFlex/- S. flexneri genomic datapksr100/- Plasmid sequence data
combined/- Results from combined chromosome-plasmid analysesxmls/- BEAST XML configuration filessupplementalxmls/- Supplemental analysis configurations
Simulate plasmid transfer and evolution under various evolutionary scenarios.
Key files:
makeXmls.m- Generate simulation XML filesrunSimulations.sh- Execute simulation pipelinerunInferences.sh- Run inference on simulated datacountSNPs.m- Count SNPs in simulation outputsplotRates.R- Visualize inferred rates
Validate rate parameter estimation under known conditions.
Usage:
cd Simulations/Rates/
bash runSimulations.sh
bash runInferences.sh
Rscript plotRates.RThe Validation/ directory contains tests to validate the method's accuracy.
Test cases:
- Serial sampling with 5 taxa and 5 plasmids
- Serial sampling with 5 taxa and 10 plasmids
Run validation:
cd Validation/
beast testAll_serial5taxon5plasmids.xml
beast testAll_serial5taxon10plasmids.xml
Rscript plotValidation.R