Please use this identifier to cite or link to this item: https://ahro.austin.org.au/austinjspui/handle/1/35065
Full metadata record
DC FieldValueLanguage
dc.contributor.authorBuultjens, Andrew H-
dc.contributor.authorVandelannoote, Koen-
dc.contributor.authorMercoulia, Karolina-
dc.contributor.authorBallard, Susan-
dc.contributor.authorSloggett, Clare-
dc.contributor.authorHowden, Benjamin P-
dc.contributor.authorSeemann, Torsten-
dc.contributor.authorStinear, Timothy P-
dc.date2024-
dc.date.accessioned2024-02-07T04:14:31Z-
dc.date.available2024-02-07T04:14:31Z-
dc.date.issued2024-01-30-
dc.identifier.citationApplied and Environmental Microbiology 2024-01-30en_US
dc.identifier.issn1098-5336-
dc.identifier.urihttps://ahro.austin.org.au/austinjspui/handle/1/35065-
dc.description.abstractFundamental to effective Legionnaires' disease outbreak control is the ability to rapidly identify the environmental source(s) of the causative agent, Legionella pneumophila. Genomics has revolutionized pathogen surveillance, but L. pneumophila has a complex ecology and population structure that can limit source inference based on standard core genome phylogenetics. Here, we present a powerful machine learning approach that assigns the geographical source of Legionnaires' disease outbreaks more accurately than current core genome comparisons. Models were developed upon 534 L. pneumophila genome sequences, including 149 genomes linked to 20 previously reported Legionnaires' disease outbreaks through detailed case investigations. Our classification models were developed in a cross-validation framework using only environmental L. pneumophila genomes. Assignments of clinical isolate geographic origins demonstrated high predictive sensitivity and specificity of the models, with no false positives or false negatives for 13 out of 20 outbreak groups, despite the presence of within-outbreak polyclonal population structure. Analysis of the same 534-genome panel with a conventional phylogenomic tree and a core genome multi-locus sequence type allelic distance-based classification approach revealed that our machine learning method had the highest overall classification performance-agreement with epidemiological information. Our multivariate statistical learning approach maximizes the use of genomic variation data and is thus well-suited for supporting Legionnaires' disease outbreak investigations.IMPORTANCEIdentifying the sources of Legionnaires' disease outbreaks is crucial for effective control. Current genomic methods, while useful, often fall short due to the complex ecology and population structure of Legionella pneumophila, the causative agent. Our study introduces a high-performing machine learning approach for more accurate geographical source attribution of Legionnaires' disease outbreaks. Developed using cross-validation on environmental L. pneumophila genomes, our models demonstrate excellent predictive sensitivity and specificity. Importantly, this new approach outperforms traditional methods like phylogenomic trees and core genome multi-locus sequence typing, proving more efficient at leveraging genomic variation data to infer outbreak sources. Our machine learning algorithms, harnessing both core and accessory genomic variation, offer significant promise in public health settings. By enabling rapid and precise source identification in Legionnaires' disease outbreaks, such approaches have the potential to expedite intervention efforts and curtail disease transmission.en_US
dc.language.isoeng-
dc.subjectLegionella pneumophilaen_US
dc.subjectLegionnaires' diseaseen_US
dc.subjectbacterial genomicsen_US
dc.subjectmachine learningen_US
dc.subjectoutbreak controlen_US
dc.subjectpublic healthen_US
dc.subjectsource attributionen_US
dc.titleHigh performance Legionella pneumophila source attribution using genomics-based machine learning classification.en_US
dc.typeJournal Articleen_US
dc.identifier.journaltitleApplied and Environmental Microbiologyen_US
dc.identifier.affiliationDepartment of Microbiology and Immunology, Doherty Institute for Infection and Immunity, University of Melbourne, Melbourne, Victoria, Australia.;Center for Pathogen Genomics, University of Melbourne, Melbourne, Victoria, Australia.en_US
dc.identifier.affiliationBacterial Phylogenomics Group, Institut Pasteur du Cambodge, Phnom Penh, Cambodia.en_US
dc.identifier.affiliationInfectious Diseasesen_US
dc.identifier.affiliationCenter for Pathogen Genomics, University of Melbourne, Melbourne, Victoria, Australia.;Department of Microbiology and Immunology, Microbiology Diagnostic Unit, Doherty Institute for Infection and Immunity, University of Melbourne, Melbourne, Victoria, Australia.en_US
dc.identifier.doi10.1128/aem.01292-23en_US
dc.type.contentTexten_US
dc.identifier.orcid0000-0002-5984-1328en_US
dc.identifier.orcid0000-0003-0150-123Xen_US
dc.identifier.pubmedid38289130-
dc.description.startpagee0129223-
item.openairetypeJournal Article-
item.cerifentitytypePublications-
item.grantfulltextnone-
item.fulltextNo Fulltext-
item.openairecristypehttp://purl.org/coar/resource_type/c_18cf-
item.languageiso639-1en-
crisitem.author.deptInfectious Diseases-
crisitem.author.deptMicrobiology-
Appears in Collections:Journal articles
Show simple item record

Page view(s)

44
checked on Nov 24, 2024

Google ScholarTM

Check


Items in AHRO are protected by copyright, with all rights reserved, unless otherwise indicated.