Wiki » History » Version 5
Katie Lennard, 09/20/2022 11:01 AM
| 1 | 1 | Katie Lennard | # Wiki |
|---|---|---|---|
| 2 | |||
| 3 | # Data location: |
||
| 4 | |||
| 5 | The data was transferred from Athena medmicro): |
||
| 6 | |||
| 7 | ``` |
||
| 8 | /MedMicro/Clinton/CRE Pfizer Feb 2022/CRE study_1A_results_17022022 |
||
| 9 | /MedMicro/Clinton/CRE Pfizer Feb 2022/CRE study_1B_results_21022022 |
||
| 10 | ``` |
||
| 11 | |||
| 12 | to Ilifu: |
||
| 13 | |||
| 14 | ``` |
||
| 15 | /scratch3/users/katiel/Clinton/CRE_study_August_2022/ |
||
| 16 | ``` |
||
| 17 | |||
| 18 | 4 | Katie Lennard | # Reference data: |
| 19 | 1 | Katie Lennard | |
| 20 | Klebsiella pneumoniae – strain HS11286 (GenBank accession no. CP003200.1) (n=18); |
||
| 21 | Serratia marcescens – strain KS10 (GenBank accession no. CP027798.1) (n=3); |
||
| 22 | 2 | Katie Lennard | Escherichia coli – strain ATCC 25922 (GenBank accession no. CP009072.1) (n=1); and |
| 23 | 1 | Katie Lennard | Enterobacter cloacae – strain ATCC 13047 (GenBank accession no. NC_014121.1) (n=1). |
| 24 | |||
| 25 | 2 | Katie Lennard | ``` |
| 26 | /scratch3/users/katiel/Clinton/CRE_study_August_2022/ref_genomes |
||
| 27 | ``` |
||
| 28 | |||
| 29 | 4 | Katie Lennard | # Objectives workflow: |
| 30 | 2 | Katie Lennard | ![workflow.png]() |
| 31 | 3 | Katie Lennard | |
| 32 | 4 | Katie Lennard | # QC: |
| 33 | 3 | Katie Lennard | 11 sample failed QC phred scores before trimming and filtering; none failed after filtering and trimming. Filtering and trimming were executed as follows: |
| 34 | |||
| 35 | ``` |
||
| 36 | 1 | Katie Lennard | nextflow run kviljoen/fastq_QC --reads '/scratch3/users/katiel/Clinton/CRE_study_August_2022/raw/study_1A_B_combined/*_R{1,2}_001.fastq.gz' -profile ilifu |
| 37 | ``` |
||
| 38 | QC reports can be found in the 'files' tab |
||
| 39 | 4 | Katie Lennard | |
| 40 | # AMR profiling |
||
| 41 | The preference from Clinton is to do AMR profiling with the ResFinder DB. I'm getting errors there that I think relate to the header formatting though so in the interim have run with the ARG_annot DB that we used for previous projects as: |
||
| 42 | ``` |
||
| 43 | nextflow run kviljoen/uct-srst2 --reads '/scratch3/users/katiel/Clinton/CRE_study_August_2022/2022-09-19-fastq_QC/bbduk/*_{1,2}.fq' -profile ilifu --gene_db /cbio/users/katie/Nicol/Ps_aerug_srst2_MLST/ARGannot_r3.fasta --outdir /scratch3/users/katiel/Clinton/CRE_study_August_2022/srst2_resFinder/coverage_80_run --min_gene_cov 80 |
||
| 44 | ``` |
||
| 45 | 5 | Katie Lennard | |
| 46 | CARD DB: This database is the recommended by srst2 and has been formatted by them already. The DB was downloaded with: |
||
| 47 | |||
| 48 | ``` |
||
| 49 | wget https://github.com/katholt/srst2/blob/master/data/CARD_v3.0.8_SRST2.fasta?raw=true -O CARD_v3.0.8_SRST2.fasta |
||
| 50 | ``` |