Commit graph

48 commits

Author SHA1 Message Date
Jose Espinosa-Carrasco
fd1189c2f5
Updating test file for optitype (#782)
* Provide an exisiting bam file for optitype

* Update main.nf

Attempt at fixing this with new testing data

* Trying slightly different approach

* Mini fixes, not sure whats wrong here

* Add bam file with NM tags in all reads for optitype

Co-authored-by: Alexander Peltzer <apeltzer@users.noreply.github.com>
Co-authored-by: Alexander Peltzer <alexander.peltzer@boehringer-ingelheim.com>
2021-10-28 14:24:52 +02:00
James A. Fellows Yates
bd2baa1e7c
New module: maxbin2 (#895)
* Specify more guidelines on input channels

* Linting

* Updates based on code review

* Update README.md

* Fix broken sentence

* feat: add megahit module, currently decompressed output

* Update main.nf

* Update tests/modules/megahit/test.yml

Co-authored-by: Maxime Borry <maxibor@users.noreply.github.com>

* Apply suggestions from code review

Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>

* feat: compress all outputs, remove md5sums due to gz stochasicity

* fix: wrong conda channel for pigz

* fix: broken singleend tests and update meta.yml

* Missed one

* Apply suggestions from code review

Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>

* fix: pigz formatting

* Apply suggestions from code review

Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>

* Apply suggestions from code review

* Added, just need to finish tests once we have bacterial data

* Add prelim test data

* Fix version reporting

* Add tests based on proposed test-dataset

* Finalise new testdata

* Fix md5sum issue by removing it...

* Update main.nf

* Apply suggestions from code review

Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>
Co-authored-by: Maxime Borry <maxibor@users.noreply.github.com>
2021-10-26 22:07:33 +01:00
Gregor Sturm
2ad98162f3
Notebook modules (#617)
* Draft rmarkdown module

* stub jupyter notebook module

* Create yaml file with params

* Update meta.yml for rmarkdown module

* Add comment on YAML

* Update notebooks module, clean up parametrize.nf

* Two separate channels for parameters and input files

* Fix Rmd render script

* Add tests for rmarkdown

* Fix tests for rmarkdown module

* Update checksums

* Fix tests for jupyter

* Test without Grab()

* Update software versions

* update rmarkdown dependencies

* Draft for multiple versions

* Fix indent of script

* Fix indent in rmarkdown script

* Emit version.syml

* Update modules/rmarkdown/main.nf

Co-authored-by: James A. Fellows Yates <jfy133@gmail.com>

* Update modules/rmarkdown/meta.yml

Co-authored-by: James A. Fellows Yates <jfy133@gmail.com>

* Update modules/rmarkdown/meta.yml

Co-authored-by: James A. Fellows Yates <jfy133@gmail.com>

* Rename rmarkdown to rmarkdownnotebook

* Add rmarkdown mulled biocontainer

* Write sessionInfo to separate log file

* Update rmarkdownnotebook

* Sessioninfo does not have a stable md5sum

* Update jupyternotebook

* Update meta

* Add jupyternotebook biocontainers

* Handle Groovy Gstrings in parameterize

* Update to versions.yml

* Update functions.nf

* Fix versions yaml

* Fix EC lint

* Update modules/rmarkdownnotebook/main.nf

Co-authored-by: James A. Fellows Yates <jfy133@gmail.com>

* Update modules/jupyternotebook/main.nf

Co-authored-by: James A. Fellows Yates <jfy133@gmail.com>

* Use official test data

* Harshilify

* Make parameters channel clearer

* Apply suggestions from code review

Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>

* Apply suggestions from code review

* Update main.nf

Co-authored-by: James A. Fellows Yates <jfy133@gmail.com>
Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>
2021-10-24 20:51:56 +02:00
Sébastien Guizard
d3369789da
New module: gstama/merge (#813)
* 👌 IMPROVE: Add some pacbio test files

* 🐛 FIX: Add Pacbio index to test_data.config

* 👌 IMPROVE: Re add 10000 data test

* 👌 IMPROVE: Add some pbindex

* 🐛 FIX: Add pbi extension to files

* 📦 NEW: Add galgal6 chr30 test data

* 📦 NEW: Add bamtools module

* 👌 IMPROVE: ignore test data

* 👌 IMPROVE : add test bed files

* 📦 NEW: Add gstama/merge module

* 🐛 FIX: Change process label

* 👌 IMPROVE: do not merge empty bed

* 🐛 FIX: Change 0 lines files detection

* 🐛 FIX: replace spaces by tab

* 🐛 FIX: Remove tuple for report channel and add version output channel

* 👌 IMPROVE: Update to last templates version

* 👌 IMPROVE: Update module to last template version

* 👌 IMPROVE: Final version of test datasets config

* 👌 IMPROVE: Update test

* 👌 IMPROVE: Remove useless index + Fix Typos

* 👌 IMPROVE: Fix Typos

* 👌 IMPROVE: Updates + clean code

- Update to last versions.yml file
- Better output channels
- Update meta.yml

* 👌 IMPROVE: Correct typo

* 👌 IMPROVE: Remove included filelist creation and add an input channel

* 🐛 FIX: Correct typo

* 👌 IMPROVE: Add filelist file

* 🐛 FIX: tama_merge.py emit a version number

* Update modules/gstama/merge/meta.yml

Co-authored-by: James A. Fellows Yates <jfy133@gmail.com>

* 👌 IMPROVE: Update meta.yml

* Update main.nf

* Apply suggestions from code review

Co-authored-by: James A. Fellows Yates <jfy133@gmail.com>
Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>
2021-10-23 19:00:39 +01:00
Jose Espinosa-Carrasco
32f6191aca
New module: genrich (#877)
* Add genrich module

* Rearrange genrich module from genrich/genrich to genrich

* Remove copy/paste code

* Fix meta.yml

* Implement save_duplicates independently of -r opt
2021-10-22 12:22:02 +02:00
Benjamin Wingfield
4ed5e4eff3
New module: ucsc/liftover (#868)
* add liftOver module

* add liftover module tests

* fix getProcessName

* fix tests

* fix out of date function

* version numbers should be numeric

* drop versions.yml from test.yml

* Update modules/ucsc/liftover/main.nf

Remove software name variable

Co-authored-by: Jose Espinosa-Carrasco <kadomu@gmail.com>

* Update tests/modules/ucsc/liftover/main.nf

Use test chain file

Co-authored-by: Jose Espinosa-Carrasco <kadomu@gmail.com>

* add genome_chain_gz to test data config

* update md5sum for new chain test data

* Fix indentation in file declaration

Co-authored-by: Jose Espinosa-Carrasco <kadomu@gmail.com>
2021-10-21 18:04:15 +02:00
Chris Cheshire
4d89d6b2f0
I accidently deleted a branch before the PR merged for PR 800 (#881)
* hifiasm copied from fastqc

* hifiasm tests init from fastqc

* meta.yml init; test.yml and main.nf for printing version

* Add hifiasm version printing

* Removed spaced on an empty line

* Reverted hifiasm from main

* Updated seacr callpeak to include a control threshold

* Whitespace

Co-authored-by: Sviatoslav Sidorov <sviatoslav.sidorov@crick.ac.uk>
Co-authored-by: Svyatoslav Sidorov <svet.sidorov@gmail.com>
2021-10-21 12:28:59 +01:00
James A. Fellows Yates
97fe899f79
fix: reduce number of required input files for damage profiler (#612)
* Reduce number of required input files for damage profiler

* Remove rebugging

* Add optional species list file.

* Working pending updated test-dataset update

* Add genome header to config
2021-10-20 10:02:30 +02:00
FriederikeHanssen
b6486ef8b4
Add compressed bed files (#864)
* add cram files to config

* Update tests/config/test_data.config

* Add compressed bed file

Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>
2021-10-15 10:10:53 +01:00
FriederikeHanssen
d0e4ee3ccd
Add cram files to config (#862)
* add cram files to config

* Update tests/config/test_data.config

Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>
2021-10-15 09:40:16 +01:00
Moritz E. Beber
de997825de
Add a new module for fasterq-dump (#807)
* chore: use template to create fasterq module

* feat: add fasterq-dump process module

* docs: provide input and output descriptions

* docs: add comment on `--temp`

* fix: use correct variable

* tests: define test output

* refactor: address review comments

* refactor: remove vdb-config input

* chore: add new test data to config

* tests: define single-end and paired-end cases

* refactor: choose specific output

* tests: do not expect single FASTQ for paired-end

* feat: add compression

* Apply suggestions from code review

Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>

* tests: revert the test data name

* Apply suggestions from code review

Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>
2021-10-12 14:20:58 +01:00
GCJMackenzie
f479d4fb8d
Learnreadorientationmodel (#794)
* files for learnreadorientationmodel initialised for first commit

* finished scripts and yml files. test working locally but needs an f1r2 test data on nf-core before it can be submitted

* updated test data location

* versions file updated, test data added

* updated versions file, edited test file

* small formatting update to main.nf

* Update main.nf

* Update test_data.config

* updated tests main.nf

* Update test_data.config

* Apply suggestions from code review

* Update modules/gatk4/learnreadorientationmodel/main.nf

* Update modules/gatk4/learnreadorientationmodel/meta.yml

* fixed tests failing

Co-authored-by: GCJMackenzie <gavin.mackenzie@nibsc.org>
Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>
2021-10-07 13:55:06 +01:00
GCJMackenzie
aa32a8a72e
new module: gatk4/calculatecontamination (#778)
* initiated files for calculate contamination

* pushing local repo to remote

* created script, filled in meta yml, created tests and test yml. local checks passing, needs repo side test data

* added option and tests for outputting optional segmentation file

* saving for test push

* versions updated, test data added

* Update main.nf

* fixed versions info, should report correctly now

* small update to main.nf outputs formatting

* Apply suggestions from code review

* Update test_data.config

* Apply suggestions from code review

Co-authored-by: GCJMackenzie <gavin.mackenzie@nibsc.org>
Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>
2021-10-05 22:52:00 +01:00
Robert A. Petit III
5c463ca6b4
few version and test data config fixes (#758)
* update kleborate version info

* fix stderr capture

* few more version fixes

* fix version info on more tools
2021-09-28 20:42:04 +01:00
Sébastien Guizard
4ec8b025bd
New module: LIMA (#719)
* 📦 NEW: Add module lima

* 👌 IMPROVE: Move .pbi output to reports channel

* 🐛 FIX: Fix report channel definition

* 👌IMPROVE; Remove options from command line

update test script with removed options

* 👌 IMPROVE: Add some pacbio test files

* 🐛 FIX: Add Pacbio index to test_data.config

* 👌 IMPROVE: Re add 10000 data test

* 🐛 FIX: Add pbi input

* 👌 IMPROVE: Add parallelization to lima

* 👌 IMPROVE: Add some pbindex

* 🐛 FIX: Add pbi extension to files

* 👌 IMPROVE: The accept one channel (primers move into the first channel)

* 👌 IMPROVE: Assign a value channel for pimers 

Improve code workflow readability

* 👌 IMPROVE: Update .gitignore

* 👌 IMPROVE: Update module to last template version

* 🐛 FIX: Correct Singularity and Docker URL

* 👌 IMPROVE: Update to the last version of modules template

* 👌 IMPROVE: Update test_data.config

* 👌 IMPROVE: Remove pbi from input files

* 👌 IMPROVE: Final version of test datasets config

* 👌 IMPROVE: Remove useless index + Fix Typos

* 🐛 FIX: Fill contains args

* 📦 NEW: Add module lima

* 👌 IMPROVE: Move .pbi output to reports channel

* 🐛 FIX: Fix report channel definition

* 👌IMPROVE; Remove options from command line

update test script with removed options

* 🐛 FIX: Add pbi input

* 👌 IMPROVE: Add parallelization to lima

* 👌 IMPROVE: Add some pacbio test files

* 🐛 FIX: Add Pacbio index to test_data.config

* 👌 IMPROVE: Re add 10000 data test

* 👌 IMPROVE: Add some pbindex

* 🐛 FIX: Add pbi extension to files

* 👌 IMPROVE: The accept one channel (primers move into the first channel)

* 👌 IMPROVE: Assign a value channel for pimers 

Improve code workflow readability

* 👌 IMPROVE: Update .gitignore

* 👌 IMPROVE: Update module to last template version

* 🐛 FIX: Correct Singularity and Docker URL

* 👌 IMPROVE: Update to the last version of modules template

* 👌 IMPROVE: Update test_data.config

* 👌 IMPROVE: Remove pbi from input files

* 👌 IMPROVE: Final version of test datasets config

* 👌 IMPROVE: Remove useless index + Fix Typos

* 🐛 FIX: Fill contains args

* 👌 IMPROVE: Add channel for each output

* 👌 IMPROVE: Remove comments

* 📦 NEW: Add module lima

* 👌 IMPROVE: Move .pbi output to reports channel

* 🐛 FIX: Fix report channel definition

* 👌IMPROVE; Remove options from command line

update test script with removed options

* 🐛 FIX: Add pbi input

* 👌 IMPROVE: Add parallelization to lima

* 👌 IMPROVE: Add some pacbio test files

* 🐛 FIX: Add Pacbio index to test_data.config

* 👌 IMPROVE: Re add 10000 data test

* 👌 IMPROVE: Add some pbindex

* 🐛 FIX: Add pbi extension to files

* 👌 IMPROVE: The accept one channel (primers move into the first channel)

* 👌 IMPROVE: Assign a value channel for pimers 

Improve code workflow readability

* 👌 IMPROVE: Update module to last template version

* 🐛 FIX: Correct Singularity and Docker URL

* 👌 IMPROVE: Update to the last version of modules template

* 👌 IMPROVE: Update test_data.config

* 👌 IMPROVE: Remove pbi from input files

* 🐛 FIX: Fill contains args

* 📦 NEW: Add module lima

* 👌 IMPROVE: Move .pbi output to reports channel

* 🐛 FIX: Fix report channel definition

* 👌IMPROVE; Remove options from command line

update test script with removed options

* 🐛 FIX: Add pbi input

* 👌 IMPROVE: Add parallelization to lima

* 👌 IMPROVE: Add some pacbio test files

* 🐛 FIX: Add Pacbio index to test_data.config

* 👌 IMPROVE: Re add 10000 data test

* 👌 IMPROVE: Add some pbindex

* 🐛 FIX: Add pbi extension to files

* 👌 IMPROVE: The accept one channel (primers move into the first channel)

* 👌 IMPROVE: Assign a value channel for pimers 

Improve code workflow readability

* 👌 IMPROVE: Update module to last template version

* 🐛 FIX: Correct Singularity and Docker URL

* 👌 IMPROVE: Update to the last version of modules template

* 👌 IMPROVE: Update test_data.config

* 👌 IMPROVE: Remove pbi from input files

* 👌 IMPROVE: Final version of test datasets config

* 👌 IMPROVE: Remove useless index + Fix Typos

* 🐛 FIX: Fill contains args

* 👌 IMPROVE: Add channel for each output

* 👌 IMPROVE: Remove comments

* 🐛 FIX: Clean test_data.config

* Update modules/lima/main.nf

Add meta to each output

Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>

* Update modules/lima/main.nf

Remove useless parenthesis

Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>

* 🐛 FIX: Keep version number only

* 🐛 FIX: Reintegrate prefix variable and use it to define output file name

* 👌 IMPROVE: add suffix arg to check output files names

* 👌 IMPROVE: Use prefix for output filename

* 🐛 FIX: Set optional output

Allow usage of different input formats

* 👌 IMPROVE: Update meta file

* 👌 IMPROVE: Update test

One test for each input file type

* 👌 IMPROVE: add fasta, fastq.gz, fastq, fastq.gz test files

* 👌 IMPROVE: Update with last templates / Follow new version.yaml rule

* 🐛 FIX: Fix typos and include getProcessName function

* 👌 IMPROVE: Update .gitignore

* 👌 IMPROVE: Using suffix to manage output was not a my best idea

Add a bash code to detect extension and update output file name

* 👌 IMPROVE: clean code

Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>
Co-authored-by: Gregor Sturm <mail@gregor-sturm.de>
Co-authored-by: Mahesh Binzer-Panchal <mahesh.binzer-panchal@nbis.se>
2021-09-27 16:14:35 +01:00
James A. Fellows Yates
7830a4a80c
add: MALTEXTRACT (#725)
* Specify more guidelines on input channels

* Linting

* Updates based on code review

* Update README.md

* Fix broken sentence

* Start maltextract module

* start tests

* Get tests working now we have test data

* Apply suggestions from code review

Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>

* Changes after review

* Update tests/modules/maltextract/main.nf

Co-authored-by: Jose Espinosa-Carrasco <kadomu@gmail.com>

* Update tests/modules/maltextract/main.nf

Co-authored-by: Jose Espinosa-Carrasco <kadomu@gmail.com>

* Update tests/modules/maltextract/main.nf

Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>
Co-authored-by: Jose Espinosa-Carrasco <kadomu@gmail.com>
2021-09-21 19:57:27 +01:00
Sébastien Guizard
bbf268c5d3
new module: pbccs (#688)
* 📦 NEW: First commit of pbccs module

* 👌 IMPROVE: Remove option from command + rename output (ccs -> bam)

* 👌 IMPROVE: Move .pbi output  into report channel

* 🐛FIX: Correct code after --rq option removal from command line module

- module main.nf: Remove ramaining rq input channel
- Test main.nf: Transfert rq into addParams
- Test test.yml: Update md5sums

* 🐛FIX: Repair additionnal option usage

* 👌 IMPROVE: Add some pacbio test files

* 🐛 FIX: Add Pacbio index to test_data.config

* 👌 IMPROVE: CCS is run in parallel with --chunk option

* 👌 IMPROVE: Add Pbindex in bam ouput channel

* 👌 IMPROVE: Change label to process_low

* 👌 IMPROVE: Define reports files names + add json version of txt report

* 🐛 FIX: Add missing backslashes

* 🐛 FIX: Add missing gz extension

* 🐛 FIX: update ouput channel

* 🐛 FIX: output file name

* 👌 IMPROVE: .gitignore

* 👌 IMPROVE: Update function.nf to last version

* 👌 IMPROVE: Update saveAs in main.nf

* 👌 IMPROVE: Add pbccs module

* 🐛 FIX: Fix Broken test

* 👌 IMPROVE: Update test_data.config

* 🐛 FIX: Fix test

* 👌 IMPROVE: Update path of test dataset files

* 👌 IMPROVE: Remove useless index + Fix Typos

* 📦 NEW: First commit of pbccs module

* 👌 IMPROVE: Remove option from command + rename output (ccs -> bam)

* 👌 IMPROVE: Move .pbi output  into report channel

* 🐛FIX: Correct code after --rq option removal from command line module

- module main.nf: Remove ramaining rq input channel
- Test main.nf: Transfert rq into addParams
- Test test.yml: Update md5sums

* 🐛FIX: Repair additionnal option usage

* 👌 IMPROVE: Add some pacbio test files

* 🐛 FIX: Add Pacbio index to test_data.config

* 👌 IMPROVE: CCS is run in parallel with --chunk option

* 👌 IMPROVE: Add Pbindex in bam ouput channel

* 👌 IMPROVE: Change label to process_low

* 👌 IMPROVE: Define reports files names + add json version of txt report

* 🐛 FIX: Add missing backslashes

* 🐛 FIX: Add missing gz extension

* 🐛 FIX: update ouput channel

* 🐛 FIX: output file name

* 👌 IMPROVE: .gitignore

* 👌 IMPROVE: Update function.nf to last version

* 👌 IMPROVE: Update saveAs in main.nf

* 👌 IMPROVE: Add pbccs module

* 🐛 FIX: Fix Broken test

* 👌 IMPROVE: Update test_data.config

* 🐛 FIX: Fix test

* 👌 IMPROVE: Update path of test dataset files

* 👌 IMPROVE: Remove useless index + Fix Typos

* 🐛 FIX: fill contains args

* 👌 IMPROVE: One output => One Channel

* 👌 IMPROVE: One input => One channel

* 🐛 FIX: Update tests

* 🐛 FIX: Remove TODOs from test.yaml

* 👌 IMPROVE: Revert and keep bam and pbi together

* 🐛 FIX: Remove old rq input from meta.yml

* 👌 IMPROVE: Update test to match input channels

Co-authored-by: James A. Fellows Yates <jfy133@gmail.com>
2021-09-16 11:48:18 +01:00
Abhinav Sharma
c5634ba99b
Add bcftools/view module (#669)
* initial commit [ci skip]

* add stubs [ci skip]

* Finalize the bcftools/view module

* accomodate optional files

* add optional region file

* refer the targets file in the tests

* all tests passing

* documen the optional parameters
2021-09-12 19:51:40 +02:00
Anders Jemt
b261c1f549
Expansionhunter (#666)
Please enter the commit message for your changes. Lines starting

* adds expansionhunter module

Co-authored-by: Maxime U. Garcia <maxime.garcia@scilifelab.se>
2021-08-16 17:47:42 +02:00
James A. Fellows Yates
e01a98a704
module: unzip (#642)
* Specify more guidelines on input channels

* Linting

* Updates based on code review

* Update README.md

* Fix broken sentence

* Add unzip module

* Remove missing TODOs update mtea

* Apply changes after code-review from @grst

* Account for user trying to supply two input archives

* Remove debugging test

* Update modules/unzip/main.nf

Co-authored-by: Jose Espinosa-Carrasco <kadomu@gmail.com>

* Correct output path

Co-authored-by: Jose Espinosa-Carrasco <kadomu@gmail.com>
2021-08-02 09:40:57 +02:00
James A. Fellows Yates
3cc43838e1
module: picard filtersamreads (#602)
* Start work filtersamreads

* Refactored to allow optional input

* Use proper readlist test data

* Remove typo

* Fix if else condition

* Remove debugging code

* Fix container URLs

* Add required input specification meta

* Cleanup

* Apply suggestions from code review

Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>

* Fix suffixing

* Additional formatting tweaks

* Update modules/picard/filtersamreads/main.nf

Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>

* Update modules/picard/filtersamreads/meta.yml

Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>
2021-07-21 15:22:19 +01:00
Harshil Patel
e4df336294
Add hifiasm module (#596)
* Add hifiasm module

* Tidy up module

* Add pacbio files to test_data.config
2021-07-20 21:50:22 +01:00
Kevin Menden
fce4e60864
Fix rsem (#555)
* bump bcftools filter

* fix bcftools

* rsem/preparereference

* added calculateexpression

* meta.yml and pytest_software

* paired end

* add test.yml
2021-07-02 13:09:48 +02:00
Charles Plessy
207930139a
New last/lastal module to align query sequences on a target index (#510)
* New last/lastal to align query sequences on a target index

`lastal` is the main program of the [LAST](https://gitlab.com/mcfrith/last)
suite.  It align query DNA sequences in FASTA or FASTQ format to a
target index of DNA or protein sequences.  The index is produced by
the `lastdb` program (module `last/lastdb`).  The score matrix for
evaluating the alignment can be chosen among preset ones or computed
iteratively by the `last-train` program (module `last/train`).  For
this reason, the `last/lastal` module proposed here has one input
channel containing an optional file, that has to be dummy when not used.

The LAST aligner outputs MAF files that can be very large (up to
hundreds of gigabytes), therefore this module unconditionally compresses
its output with gzip.

This new module is part of the work described in Issue #464.  During
this development, we fix the version of LAST to 1219 to ensure
consistency (hence ignore lint's version warning).

* Apply suggestions from code review

Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>

* Un-hardcode the path to the LAST index.

Among multiple alternatives I have chosen the following command to
detect the sample name of the index, because it fails in situations
where there is no index files in the index folder, and in situations
were there are two indexes files in the folder.  Not failing would
result in feeding garbage information in the INDEX_NAME variable.

    basename \$(ls $index/*.bck) .bck

In case of missing file, a clear error message is given by `ls`.  In
case of more than one file, the error message of `basename` is more
cryptic, unfortunately.  (`basename: extra operand ‘.bck’`)

Alternatives that do not fail if there is no .bck file:

    basename $index/*bck .bck
    find $index -name '*bck' | sed 's/.bck//'

Alternatives that do not fail if there are more than one .bck file:

    basename -s .bck $index/*bck
    ls $index/*.bck | xargs basename -s .bck
    find $index -name '*bck' | sed 's/.bck//'

Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>
2021-05-25 22:10:48 +01:00
Charles Plessy
e75f88c68a
New module last/mafswap to reorder sequences in alignments (#500)
* New module last/mafswap to reorder sequences in alignments

The `maf-swap` tool distributed with [LAST](https://gitlab.com/mcfrith/last)
reorders sequences in alignment files in Multiple Alignment Format.
When run without command-line arguments, it will swap the target and the
query sequences.  This is useful when turning a many-to-many alignment
into a many-to-one and then a one-to-one alignment in conjunction with
the `last-split` command (split, swap, split and swap again).

The LAST aligner outputs MAF files, but other tools also use this
format.  As MAF files can be very large (up to hundreds of gigabytes),
the module expects its input to be compressed with gzip and will
compress its output.

This new module is part of the work described in Issue #464.  During
this development, we fix the version of LAST to 1219 to ensure
consistency (hence ignore lint's version warning).

* Update MD5 sum.

Actually, 7029066c27ac6f5ef18d660d5741979a is the MD5 sum of
an empty file compressed with `gzip --no-name`…  This happened
because I forgot to update the config file after correcting the
module… sorry !

* Apply suggestions from code review

Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>

* Change name as suggested in pull request.

Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>
2021-05-19 08:59:23 +01:00
aleksandrabliznina
b592cea30b
New last/train module to train alignment parameters. (#492)
* New last/train module to train alignment parameters.

The last-train command creates a parameter file that
will be used by last/lastal module for sequence alignment.
It takes indexed sequences and query sequences as input
and we use the metadata of both to create an id of the
parameter output file.

Submission of the LAST modules is discussed in more
details in the issue #464. For consistancy, we use LAST
version 1219 for this whole development and will upgrade later.

* Corrected files according to the nf-core v1.14 standards.

* Fixed function.nf file for the last-train module.

* Apply suggestions from code review

Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>

* Find index name.

* Correct after the input channels were changed.

* Use double underscore as a name separator.

Single underscores can happen in ids, therefore, we would like to keep two underscores.

* Remove extra spaces.

* Fixed the passing of the "score matrix" line.

* Apply suggestions from code review

Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>

* Update software/last/train/main.nf

Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>
2021-05-19 08:37:08 +01:00
Jose Espinosa-Carrasco
f8ea9828cd
Add artic minion (#486)
* Add artic minion module

* Add fast5 to test data configuration

* Add test for artic minion
2021-05-07 16:37:35 +01:00
Kevin Menden
5e86245388
updated paths to test data (#478)
* updated paths to test data

* Update test_data.config

changed file names
2021-05-03 07:13:07 +01:00
Harshil Patel
466ab67808
Fixes for nanoseq modules (#479)
* Fix minimap2 index module

* Fix minimap2 index tests

* Fix graphmap2 index module

* Fix graphmap2 module

* Fix ECLint

* Fix bedtools bamtobed module

* Fix tests for bedtools bamtobed module

* Add tag for graphmap2 align module

* Fix EClint

* Fix qcat module

* Add md5sum for graphmap2/align module

* Remove non-started test data file

* Remove md5sum for graphmap2 align
2021-04-30 15:57:43 +01:00
Yuk Kei Wan
05f479f03a
add qcat module (from nanoseq modules) (#469)
* add qcat module

* remove md5sum(nom-reproducible)
2021-04-30 13:20:56 +01:00
FriederikeHanssen
9ce4427275
Add gvcf index files (#472) 2021-04-28 20:23:10 +01:00
FriederikeHanssen
ae154b8c3f
Add human data paths (#458)
* Add new human data and fix sarscov paths

* Fix filename typo

* Apply code review

* replace index with to match sarscov data

* lower case

* indent everythin

* Adapt sarscov keys to new naming convention

* Update test_data.config

Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>
2021-04-19 15:30:43 +01:00
Jose Espinosa-Carrasco
750bd8c3e3
Finish deeptools modules (#442)
* Adding bigWig and deeptools computeMatrix files to config

* Adding meta.yml for deeptools modules

* Add test for deeptools modules

* Fixing and reordering tags

* Fixing conda test that worked in local...

* Apply suggestions from code review

* Changing bigwig file pattern to include bigwig extension

* Saving after last change is a good practice

Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>
2021-04-13 21:48:43 +01:00
Jose Espinosa-Carrasco
a9fcbd93cc
Move assembly test files from genome to illumina (#441)
* FIx plasmidid tests for new contigs.fasta file

* Fixing two md5sum hashes

* Update test path in config for illumina assembly files

* Update modules using assembly files

* Correctly setting path of assembly files

* Update tests/config/test_data.config

Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>
2021-04-13 12:52:11 +02:00
Anthony Underwood
df8910de4e
Iqtree (#435)
* Resolve suggests after PR review

* add newline to functions

* need variable interpolation using double quotes; remove unnecessary tag

* add a more resilient link to raw github files

* remove trailing slash

* Update software/iqtree/main.nf

Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>
2021-04-12 17:01:26 +01:00
Jose Espinosa-Carrasco
3ac21ff0dc
Add abacas module (#431)
* Add abacas module

* Add test for abacas module

* Add Harshil to authorship

* Updating test with the data uploaded to nf-core/datasets

* Apply suggestions from code review

Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>
2021-04-12 15:37:08 +01:00
Kevin Menden
0b402ffda8
Move test data to nf-core/test-datasets (#432)
* initial 'modules create' of minia

* fixed tests

* finished meta.yml

* fixed filters.yml

* resolved issues in pytest_software.yml

* add newline

* Update software/minia/main.nf

* fixing a bunch of module tests

* remove vscode

* fixed minia

* move test data directory to nf-core/test-datasets

* bump multiqc version

* remove the test data

* updated test data link

* update README

Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>
2021-04-12 12:32:39 +01:00
Harshil Patel
5e18e36895
Update and check tests are running for all modules (#420)
* Fixes for PR 371

* Fix tags

* Add md5sum

* Add md5sums for stringtie

* Fix tests for modules where subtool = build

* Fix pytest_software name conflicts

* Change _ to / in test.yml tags

* Nope...that didnt work

* Fix sequenzautils/bam2seqz tests
2021-04-09 13:47:05 +01:00
Yuk Kei Wan
16e8d21477
Nanoplot and sequencing_summary.txt for Nanopore test data (#364)
* add nanoplot

* add Nanopore 100 read subset bam and fastq

* add nanoplot module and sequencing_summary.txt

* Update pytest_software.yml

* Update main.nf

* test directory path

* testing one output file

* Update main.nf

* fix typo

* remove path to test outputs (it worked locally)

* use test_data.config in test

* add meta.yml

* allowing either summary.txt or fastq.gz inputs

* fix linting and test.yml

* test whether github check can catch the output

* add summary.txt test output paths

* check Nanoplot-report.html output
2021-04-09 08:31:38 +01:00
Anthony Underwood
63b3975292
Update gubbins module to use new files (#417)
* new gubbins module

* new gubbins module

* new gubbins module

* Update software/gubbins/main.nf

Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>

* Update software/gubbins/main.nf

Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>

* Update software/gubbins/main.nf

Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>

* Update software/gubbins/meta.yml

Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>

* new gubbins module

* new gubbins module

* new gubbins module

* new gubbins module

* new gubbins module

* new gubbins module

* new gubbins module

* Update tests/config/test_data.config

Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>

* Update tests/software/gubbins/main.nf

Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>

* new gubbins module

* new gubbins module

* new snpsites module

* new snpsites module

* new snpsites module

* new snpsites module

* adding fasttree module

* correct trailing whitespace

* using sarscov2 as a test dir

* use sars-cov-2 alignment

* remove old test alignment

* new snpsites module

* new snpsites module

* new snpsites module

* updated test file names

* new snpsites module

* remove snpsites from gubbins branch

* remove redundant test_fas alignment

* update md5 sums

Co-authored-by: avantonder <avt@sanger.ac.uk>
Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>
Co-authored-by: avantonder <ajv37@cam.ac.uk>
2021-04-09 07:54:11 +01:00
Jose Espinosa-Carrasco
f90dde0d88
Add plasmidid module (#410)
* Adding plasmidid module

* Adding test for plasmidid module

* Update plasmidid version to 1.6.4

* Adding contigs.fasta file

* Update tests

* Adding output files on test.yml
2021-04-08 22:44:30 +01:00
Anthony Underwood
be390a25e5
adding fasttree module (#412)
* adding fasttree module

* correct trailing whitespace

* using sarscov2 as a test dir

* remove TODO

* update test data naming

* further test data naming updates

* remove options in favour of $options.args

* ensure non standard exit codes don't cause an issue

* update md5sum
2021-04-08 18:15:23 +01:00
avantonder
710ab76b90
new module: gubbins (#377)
* new gubbins module

* new gubbins module

* new gubbins module

* Update software/gubbins/main.nf

Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>

* Update software/gubbins/main.nf

Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>

* Update software/gubbins/main.nf

Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>

* Update software/gubbins/meta.yml

Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>

* new gubbins module

* new gubbins module

* new gubbins module

* new gubbins module

* new gubbins module

* new gubbins module

* new gubbins module

* Update tests/config/test_data.config

Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>

* Update tests/software/gubbins/main.nf

Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>

* new gubbins module

* new gubbins module

Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>
2021-04-07 16:31:24 +01:00
Robert A. Petit III
c9256616b9
update tests for new config (#384)
* update tests for new config

* fix quast, update more

* more updates!

* more tests updated

* fix tests, added tabix gff3.gz

* Delete main.nf

Removing SPAdes from PR to avoid merge conflicts with https://github.com/nf-core/modules/pull/396

Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>
2021-04-01 20:22:06 +01:00
Ravneet Bhuller
c5c59d44a1
PR for sequenzautils/bam2seqz (#395)
* added files

* edited files

* removed file

* README.md edited

Co-authored-by: kaurravneet4123 <kaurravneet4123@yahoo.com@users.noreply.github.com>
2021-04-01 07:08:27 +02:00
Nicholas Toda
e526eae472
Add tests for hisat2 (#366)
* initial commit hisat2/build

* initial commit hisat2/build

* changed names for hisat2

* fixed directory structure and args

* added splice site test data

* added splice site inputs

* replaced list with individual args

* fixed removed commas

* added test yml file

* updated hisat2 conda version

* added meta.yml

* added meta.yml description

* added meta.yml inputs

* added meta.yml outputs

* update conda version for hisat2

* removed trailing whitespace meta.yml

* fixed version number for containers

* added test data to test config

* updated for new test logic

* fix pytest issue?

* fix pytest issue

* fixed wrong tool in meta.yaml

* updated tets.yaml name

* handle build bug for testing

* handle build bug for testing in yaml

* moved test folder to fix build bug

* use old hisat2 version to avoid conda giving inconsistent md5sum

* initial commit

* removed temp file

* added meta yaml

* add to pytest

* added tests

* added test yml

* add align meta yaml

* add hisat2 align to pytest

* remove need for splice data by calling process

* add hisat2 align se test

* add hisat2 align pe test

* update names hisat2 align

* update software pytest for using mutiple modules

* remove splice site test data since using module instead

* remove splice site from config since using module instead

* fixed extra brace

* added hisat2 align test.yml

* removed md5sum for bam files

* updated build md5sums

* Apply suggestions from code review

Co-authored-by: Nicholas TODA <nicholas.toda@mnhn.fr>
Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>
2021-03-25 21:08:46 +00:00
Jose Espinosa-Carrasco
17fbdf96cb
Fix some module tests after switching to config data (#386)
* Adding tar.gz kraken2 db to test data

* Update test path files for untar module

* Update test path files for kraken2/run module

* Update test path files for cat/fastq module
2021-03-24 18:28:07 +00:00
Harshil Patel
83a500e6b0
Use a config containing all test data for module tests (#365)
* Re-organise all test data

* Fix ECLint

* Fix ECLint agaaainn

* Now is not the time EClint

* Initial commit for test data config

* Rename test data

* Include test config

* Update indents

* Update test for FastQC via config

* Remove quotes of bottom-level variables

* Use underscores in key names

* Get tests working for fastp

* Remove whitespace at beginning of file
2021-03-24 13:07:37 +00:00