TY - JOUR
T1 - Benchmarking of methods for identification of antimicrobial resistance genes in bacterial whole genome data
AU - Clausen, Philip T. L. C.
AU - Zankari, Ea
AU - Aarestrup, Frank Møller
AU - Lund, Ole
PY - 2016
Y1 - 2016
N2 - Next generation sequencing (NGS) may be an alternative to phenotypic susceptibility testing for surveillance and clinical diagnosis. However, current bioinformatics methods may be associated with false positives and negatives. In this study, a novel mapping method was developed and benchmarked to two different methods in current use for identification of antibiotic resistance genes in bacterial WGS data. A novel method, KmerResistance, which examines the co-occurrence of k-mers between the WGS data and a database of resistance genes, was developed. The performance of this method was compared with two previously described methods; ResFinder and SRST2, which use an assembly/BLAST method and BWA, respectively, using two datasets with a total of 339 isolates, covering five species, originating from the Oxford University Hospitals NHS Trust and Danish pig farms. The predicted resistance was compared with the observed phenotypes for all isolates. To challenge further the sensitivity of the in silico methods, the datasets were also down-sampled to 1% of the reads and reanalysed. The best results were obtained by identification of resistance genes by mapping directly against the raw reads. This indicates that information might be lost during assembly. KmerResistance performed significantly better than the other methods, when data were contaminated or only contained few sequence reads. Read mapping is superior to assembly-based methods and the new KmerResistance seemingly outperforms currently available methods particularly when including datasets with few reads.
AB - Next generation sequencing (NGS) may be an alternative to phenotypic susceptibility testing for surveillance and clinical diagnosis. However, current bioinformatics methods may be associated with false positives and negatives. In this study, a novel mapping method was developed and benchmarked to two different methods in current use for identification of antibiotic resistance genes in bacterial WGS data. A novel method, KmerResistance, which examines the co-occurrence of k-mers between the WGS data and a database of resistance genes, was developed. The performance of this method was compared with two previously described methods; ResFinder and SRST2, which use an assembly/BLAST method and BWA, respectively, using two datasets with a total of 339 isolates, covering five species, originating from the Oxford University Hospitals NHS Trust and Danish pig farms. The predicted resistance was compared with the observed phenotypes for all isolates. To challenge further the sensitivity of the in silico methods, the datasets were also down-sampled to 1% of the reads and reanalysed. The best results were obtained by identification of resistance genes by mapping directly against the raw reads. This indicates that information might be lost during assembly. KmerResistance performed significantly better than the other methods, when data were contaminated or only contained few sequence reads. Read mapping is superior to assembly-based methods and the new KmerResistance seemingly outperforms currently available methods particularly when including datasets with few reads.
U2 - 10.1093/jac/dkw184
DO - 10.1093/jac/dkw184
M3 - Journal article
C2 - 27365186
SN - 0305-7453
VL - 71
SP - 2484
EP - 2488
JO - Journal of Antimicrobial Chemotherapy
JF - Journal of Antimicrobial Chemotherapy
IS - 9
ER -