Project

General

Profile

Wiki » History » Version 1

Ephie Geza, 02/01/2024 06:36 AM

1 1 Ephie Geza
# Wiki
2
3
## Data location:
4
The data in FASTQ format was sent via WeTransfer, downloaded and uploaded to ilifu
5
``` shell
6
/cbio/projects/032/rawdata/fastq
7
```
8
9
## Reference data:
10
Currently in project 033 on ilifu: Helicobacter pylori strain: MT5135 (RefSeq accession no. CP071982.1) (n=2)
11
``` shell
12
/cbio/projects/033/refs/Helicobacter_pylori_MT5135_refseq.fa 
13
```
14
15
## Workflow
16
17
### 1. QC:
18
Raw reads were quality checked and trimmed using the  kviljoen/fastq_QC pipeline which also trim adpters and filter quality based bbduk
19
``` shell
20
# For nextflow DSL1 pipeline
21
module load nextflow/22.10.7
22
23
nextflow run kviljoen/fastq_QC --reads '/cbio/projects/032/rawdata/fastq/*_R{1,2}_001.fastq.gz' -profile ilifu -resume --email "ephie.geza@uct.ac.za"
24
```
25
QC reports for raw reads and after trimming can be found in the 
26
### 2. AMR profiling
27
The preference from Clinton is to do AMR profiling with the ResFinder DB. I'm getting errors there that I think relate to the header formatting though so in the interim have run with the ARG_annot DB that we used for previous projects as:
28
29
ARGannot