2021-03-26 23:25:40 +00:00
name : gatk4_markduplicates
description : This tool locates and tags duplicate reads in a BAM or SAM file, where duplicate reads are defined as originating from a single fragment of DNA.
keywords :
- markduplicates
- bam
- sort
tools :
- gatk4 :
2022-02-15 11:15:27 +00:00
description :
Developed in the Data Sciences Platform at the Broad Institute, the toolkit offers a wide variety of tools
2021-03-26 23:25:40 +00:00
with a primary focus on variant discovery and genotyping. Its powerful processing engine
and high-performance computing features make it capable of taking on projects of any size.
homepage : https://gatk.broadinstitute.org/hc/en-us
documentation : https://gatk.broadinstitute.org/hc/en-us/articles/360037052812-MarkDuplicates-Picard-
tool_dev_url : https://github.com/broadinstitute/gatk
doi : 10.1158 /1538-7445.AM2017-3590
2022-02-15 11:15:27 +00:00
licence : [ "MIT" ]
2021-03-26 23:25:40 +00:00
input :
- meta :
type : map
description : |
Groovy Map containing sample information
e.g. [ id:'test', single_end:false ]
- bam :
type : file
description : Sorted BAM file
pattern : "*.{bam}"
output :
- meta :
type : map
description : |
Groovy Map containing sample information
e.g. [ id:'test', single_end:false ]
2021-10-03 07:20:26 +00:00
- versions :
2021-03-26 23:25:40 +00:00
type : file
2021-10-03 07:20:26 +00:00
description : File containing software versions
2021-09-27 08:41:24 +00:00
pattern : "versions.yml"
2021-03-26 23:25:40 +00:00
- bam :
type : file
description : Marked duplicates BAM file
pattern : "*.{bam}"
- metrics :
type : file
description : Duplicate metrics file generated by GATK
pattern : "*.{metrics.txt}"
authors :
- "@ajodeh-juma"
2021-10-29 11:01:05 +00:00
- "@FriederikeHanssen"