2021-03-27 02:25:40 +03:00
name : gatk4_markduplicates
description : This tool locates and tags duplicate reads in a BAM or SAM file, where duplicate reads are defined as originating from a single fragment of DNA.
keywords :
- markduplicates
- bam
- sort
tools :
- gatk4 :
2022-02-15 12:15:27 +01:00
description :
Developed in the Data Sciences Platform at the Broad Institute, the toolkit offers a wide variety of tools
2021-03-27 02:25:40 +03:00
with a primary focus on variant discovery and genotyping. Its powerful processing engine
and high-performance computing features make it capable of taking on projects of any size.
homepage : https://gatk.broadinstitute.org/hc/en-us
documentation : https://gatk.broadinstitute.org/hc/en-us/articles/360037052812-MarkDuplicates-Picard-
tool_dev_url : https://github.com/broadinstitute/gatk
doi : 10.1158 /1538-7445.AM2017-3590
2022-02-15 12:15:27 +01:00
licence : [ "MIT" ]
2021-03-27 02:25:40 +03:00
input :
- meta :
type : map
description : |
Groovy Map containing sample information
e.g. [ id:'test', single_end:false ]
- bam :
type : file
description : Sorted BAM file
pattern : "*.{bam}"
output :
- meta :
type : map
description : |
Groovy Map containing sample information
e.g. [ id:'test', single_end:false ]
2021-10-03 08:20:26 +01:00
- versions :
2021-03-27 02:25:40 +03:00
type : file
2021-10-03 08:20:26 +01:00
description : File containing software versions
2021-09-27 10:41:24 +02:00
pattern : "versions.yml"
2021-03-27 02:25:40 +03:00
- bam :
type : file
description : Marked duplicates BAM file
pattern : "*.{bam}"
- metrics :
type : file
description : Duplicate metrics file generated by GATK
pattern : "*.{metrics.txt}"
authors :
- "@ajodeh-juma"
2021-10-29 13:01:05 +02:00
- "@FriederikeHanssen"
2022-04-12 17:15:39 +02:00
- "@maxulysse"