Introduction
Streptococcus pyogenes, also referred to as Group A Streptococcus (GAS) for harboring Lancefield group A antigen, is a clinically important human pathogen. Despite its limited prevalence in modern times as compared to other pathogens, the myriad of infections it causes are still commonly lethal. Streptococcal infections range from localized throat infections such as tonsillitis or pharyngitis, to invasive infections such as sepsis, necrotizing fasciitis and streptococcal toxic shock syndrome (STSS). Most of the infections are seen in children aged four to seven years, with penicillin still being effectively used for treatment. Drug resistant clones however, have been increasingly reported globally , and were attributed to environmental and intracellular resistance mechanisms . Treatment complications usually arise with the use of other antimicrobial agents when the patient is allergic to penicillin.
A number of genome-encoded virulence factors such as pili, M proteins, leukocidins, streptolysins, complement inhibiting proteins, immunoglobulin-degrading enzymes, and superantigens have been detected in S. pyogenes, in addition to efflux pumps and leukocyte evasion strategies. Interestingly horizontal gene transfer (HGT) and prophage integration are common amongst S. pyogenes genomes giving them plasticity and genomic variation. These factors collectively confer additional virulence and resistance capabilities and alter the regulation of existing genes. The emm gene, encoding the M protein contains conserved, semi-conserved, and hypervariable regions, and is thus used as an epidemiological marker for GAS. emm occurrence amongst GAS strains is linked to geographic localization.
Materials and Methods
Ethical Approval
Ethical approval was not required as clinical isolates were collected and stored as part of routine clinical care. Clinical isolates and patient records/information were anonymous and de-identified prior to analysis.
Bacterial isolates and genomic DNA extraction
This study was conducted on nine S. pyogenes bacterial isolates previously collected from the American University of Beirut Medical Center (AUB-MC). The samples were recovered from throat and pus swabs of patients with streptococcal infections during the period from August 2010 to November 2011, and were chosen to cover the common emm types in the country (Table 1). The isolates were cultured overnight on Trypticase Soy Agar (TSA) (Bio-Rad, USA) medium. DNA was extracted using the Nucleospin Tissue kit (Macherey-Nagel, Germany) following the manufacturer’s instructions.
Genome sequencing
DNA extracted from each S. pyogenes isolate (50-ng/sample) was prepared for sequencing with the use of the Nextera XT DNA Sample Prep Kit (Illumina). Clean up was performed using the AMPure XP PCR purification beads (Agencourt, Brea, CA, USA). The resulting individual DNA libraries with fragment sizes ranging from 500–1000 bp were quantified by quantitative PCR on a CFX96 (Bio-Rad, USA) in triplicate at two concentrations, 1:1000 and 1:2000, using the Kapa library quantification kit (Kapa Biosystems, Woburn, MA, USA). Based on the individual library concentrations, equimolar pools of the indexed libraries were prepared at a concentration of at least 1 nM using 10 mM Tris-HCl (pH 8.0) and 0.05% Tween 20. Pooled paired-end libraries were sequenced to a read length of at least 250 bp.
Genome assembly
De novo assembly of the sequenced genomes was done using A5 assembler (v. 20130627) with default assembly parameters. This pipeline automates the processes of data cleaning, error correction, contig assembly, scaffolding, and quality control .
Genome annotation and gene detection
The assembled genomes were annotated using the RAST server (http://rast.nmpdr.org) that uses subsystems technology to assign gene function. RAST was also used for the identification of protein encoding genes, rRNAs, and tRNAs . The SEED viewer service from RAST in combination with the web tools provided by the Center for Genomic Epidemiology (CGE) website (www.genomicepidemiology.org) were used to generate a detailed list of the genetic elements in question. The ResFinder 2.1 web server was used to identify acquired antimicrobial resistance genes present on the bacterial genome . Due to ResFinder’s inability to confirm the functional integrity, and levels of gene expression and resistance arising to acquired mutations in housekeeping genes, a hybrid resistance profile was generated using the ResFinder hits in addition to phenotypical data from previously published studies. The VirulenceFinder 1.2 service from the same website was used to identify additional virulence-specific genes . The PathogenFinder 1.1 service from CGE, was used to obtain an overview of the genomic pathogenic gene families .
Multi-locus sequence typing (MLST)
MLST typing of the isolates was carried out using CGE’s MLST 1.7 server to detect sequence polymorphisms within the gki, gtr, muri, muts, recp, xpt, and yqil genes .
Phage and mobile element detection
Phage detection was done using the publicly available Phage Search Tool (PHAST) (http://phast.wishartlab.com/index.html) . This tool provides the closest identity match for detected phages in addition to their site of integration. Putative insertion elements were double checked using BLASTx with an identity threshold of 80%. Putative phage insertion sequences were then annotated using RAST in order to determine the genes they encode.
Phylogenetic analysis
To determine the phylogenetic relatedness a concatenated marker gene maximum-likelihood tree was constructed using a number of S. pyogenes reference genomes chosen based on BLAST similarity results, clonal complexes and sequence types (MGAS6180 CP000056, MGAS10394 CP000003, M1476 AP012491, M1GAS SF370 NC002737, NZ131 CP000829, A20 CP003901, 7F7 PRJNA238516, and S. pneumoniae R6 AE007317 as an outlier strain). The genomes were first processed with PhyloSift , the tree was then constructed using FastTree , visualized and edited with Dendroscope . Pairwise alignment and visualization of our selected genomes with the respective reference strains was achieved through the Mauve aligner using defaults settings.
Results and Discussion
Sequencing resulted in an average of 2,503,465 paired-end reads per isolate, with the average being 1,976,732 high-quality reads following quality trimming and error correction. The average sequence coverage of the whole genomes was 293X and the minimum sequence coverage was 85X for the SP6 isolate. The average N50 for the assemblies was 196,439 bp with the lowest being 98,990 bp for SP7. The initial assemblies resulted in an average of 77 contigs per isolate all of which greater or equal to 500 bp in length. During scaffolding, some contigs were merged based on short overlaps and read-pair information, yielding a reduced final average of 70 contigs per isolate. The complete details and statistics of the sequencing and assembly results for each isolate are shown in Tables 2 & 3 .
how to find lineage of Streptococcal pyogenes via Using BLAST. lineage means it's ancestor/roots.
Find an example of Oxime preparation via a conventional heating method from literature and comment on how it differs from using a microwave. Discuss how oximes can be cleaved.
#2 sample means, and find the mean of this distribution of sample means How does it compare to the mean of the population? Exercise 2. The distribution of walking times for babies has a mean of μ-14 months and a standard deviation of σ-3 months. What is the distribution of sample means for samples of size n = 36? Exercise 3. For the problem considered in section 8.3, assume we got M15. What is the s-score, and do we eject...
How to find out the verbs in the text with part of speech tags using the constrains in Stanford CoreNLP. For example, in the text Supplementary_NNP Data_NNP are_VBP available_JJ at_IN NAR_NNP Online_NNP ._. I want to find out the verb of the sentence which must contain the word `online` I have many thses sentences, which has been added with part of speech suffix to each word. And I would like to use regex or other means to find out verb...
HOW DID THEY GET 1.833 USING EXCEL PLEASE HELP! Find the 90% confidence interval for difference between the two population means. The-critical value at the 90% confidence level with 9 degrees of freedom is, 1.833. (2.28) (1.79) (9)1.833),( 2+2.1693 (-0.1693,4.1693) Hence, the required confidence interva is, (-0.1693,4.1693)
9 Geometry via calculus In this exercise you will see one way to use calculus to do grometry a) Here is one way to find the perpendicsler distance from a point to a line L (no caleulus yet) Let's say L has equation y-3r+2 and the point is (2.1) First, make a graph (picture) of the situation 2Now find an equation for the line AM through (2, 1) perpendicular to L (draw it first, of course). 3. Find the (coordinates...
with job satisfaction? CC-12. Using job characteristics theory, explain why the present system of job design may be con- tributing to employee dissatisfaction. Describe some ways you could help employees feel more satisfied with their work by redesigning their jobs. 1 me idea about **** Repairing Jobs That Fail to Sati Learning Goals Companies often divid ciency, but speciali quences. Drain sed specialization petitors' costs often divide work as a way to impro Secialization can lead to negative DrainFlow is...
How can we assess whether a project is a success or a failure? This case presents two phases of a large business transformation project involving the implementation of an ERP system with the aim of creating an integrated company. The case illustrates some of the challenges associated with integration. It also presents the obstacles facing companies that undertake projects involving large information technology projects. Bombardier and Its Environment Joseph-Armand Bombardier was 15 years old when he built his first snowmobile...