1
0
Fork 0
mirror of https://github.com/MillironX/taxprofiler.git synced 2024-11-25 10:29:54 +00:00

Apply review suggestion

Co-authored-by: James A. Fellows Yates <jfy133@gmail.com>
This commit is contained in:
Sofia Stamouli 2022-12-20 15:42:41 +01:00 committed by GitHub
parent d7c4554b18
commit c6710db7a9
No known key found for this signature in database
GPG key ID: 4AEE18F83AFDEB23

View file

@ -99,15 +99,23 @@ Note that the FASTQ files may _not_ always be the 'final' reads that go into tax
### BBDuk ### BBDuk
[BBDuk](https://jgi.doe.gov/data-and-tools/software-tools/bbtools/bb-tools-user-guide/bbduk-guide/) stands for Decontamination Using Kmers. BBDuk was developed to combine most common data-quality-related trimming, filtering, and masking operations into a single high-performance tool.
It is used in nf-core/taxprofiler for complexity filtering using different algorithms. This means that it will remove reads with low sequence diversity (e.g. mono- or dinucleotide repeats).
<details markdown="1"> <details markdown="1">
<summary>Output files</summary> <summary>Output files</summary>
- `bbduk` - `bbduk/`
- `<sample_id>.bbduk.log` - `<sample_id>.bbduk.log`: log file containing filtering statistics
- `<sample_id>.fastq.gz` - `<sample_id>.fastq.gz`: resulting FASTQ file without low-complexity reads
</details> </details>
By default nf-core/taxprofiler will only provide the `.log` file if BBDuk is selected as the complexity filtering tool. You will only find the complexity filtered reads in your results directory if you provide ` --save_complexityfiltered_reads` .
Note that the FASTQ file(s) may _not_ always be the 'final' reads that go into taxprofiling, if you also run other steps such as host removal, run merging etc..
### PRINSEQ++ ### PRINSEQ++
<details markdown="1"> <details markdown="1">