* initial commit [ci skip]
* reuse the modules code from nf-core/mag [ci skip]
* add contextual information for the module [ci skip]
* add stubs to avoid downloading db [ci skip]
* trigger test
* iterate on tests [ci skip]
* itereate tests [ci skip]
* add bins [ci skip]
* fix stubs [ci skip]
* interation on tests with stubs [ci skip]
* use the existing pattern and fasta for input
* accomodate the new version file format
* use variable for the stub [ci skip]
* update the versions file in meta.yml
* Accomodate code review regarding publishDir function [ci skip]
Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>
* remove extra newline
* use bioconda channel
* update the description for filtered file
* Apply suggestions from code review
* Update main.nf
* Update main.nf
* Update modules/gtdbtk/classifywf/meta.yml
Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>
Co-authored-by: Robert A. Petit III <robbie.petit@gmail.com>
Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>
* added classic mlst module
* removed nf-core TODO comments
* included drpatelh suggestions
* adjust version capture identation
* update main to pass lint
* follow output expected by test.yml
* suggested prefix change from rpetit3
* Apply suggestions from code review
Co-authored-by: Gregor Sturm <mail@gregor-sturm.de>
Co-authored-by: Robert A. Petit III <robbie.petit@gmail.com>
Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>
* Specify more guidelines on input channels
* Linting
* Updates based on code review
* Update README.md
* Fix broken sentence
* Start work, continue once on non-mobile internet
* finished and working on conda
* Update modules/amps/main.nf
Co-authored-by: Jose Espinosa-Carrasco <kadomu@gmail.com>
* Apply suggestions from code review
Co-authored-by: Jose Espinosa-Carrasco <kadomu@gmail.com>
Co-authored-by: Jose Espinosa-Carrasco <kadomu@gmail.com>
* 📦 NEW: Add module lima
* 👌 IMPROVE: Move .pbi output to reports channel
* 🐛 FIX: Fix report channel definition
* 👌IMPROVE; Remove options from command line
update test script with removed options
* 👌 IMPROVE: Add some pacbio test files
* 🐛 FIX: Add Pacbio index to test_data.config
* 👌 IMPROVE: Re add 10000 data test
* 🐛 FIX: Add pbi input
* 👌 IMPROVE: Add parallelization to lima
* 👌 IMPROVE: Add some pbindex
* 🐛 FIX: Add pbi extension to files
* 👌 IMPROVE: The accept one channel (primers move into the first channel)
* 👌 IMPROVE: Assign a value channel for pimers
Improve code workflow readability
* 👌 IMPROVE: Update .gitignore
* 👌 IMPROVE: Update module to last template version
* 🐛 FIX: Correct Singularity and Docker URL
* 👌 IMPROVE: Update to the last version of modules template
* 👌 IMPROVE: Update test_data.config
* 👌 IMPROVE: Remove pbi from input files
* 👌 IMPROVE: Final version of test datasets config
* 👌 IMPROVE: Remove useless index + Fix Typos
* 🐛 FIX: Fill contains args
* 📦 NEW: Add module lima
* 👌 IMPROVE: Move .pbi output to reports channel
* 🐛 FIX: Fix report channel definition
* 👌IMPROVE; Remove options from command line
update test script with removed options
* 🐛 FIX: Add pbi input
* 👌 IMPROVE: Add parallelization to lima
* 👌 IMPROVE: Add some pacbio test files
* 🐛 FIX: Add Pacbio index to test_data.config
* 👌 IMPROVE: Re add 10000 data test
* 👌 IMPROVE: Add some pbindex
* 🐛 FIX: Add pbi extension to files
* 👌 IMPROVE: The accept one channel (primers move into the first channel)
* 👌 IMPROVE: Assign a value channel for pimers
Improve code workflow readability
* 👌 IMPROVE: Update .gitignore
* 👌 IMPROVE: Update module to last template version
* 🐛 FIX: Correct Singularity and Docker URL
* 👌 IMPROVE: Update to the last version of modules template
* 👌 IMPROVE: Update test_data.config
* 👌 IMPROVE: Remove pbi from input files
* 👌 IMPROVE: Final version of test datasets config
* 👌 IMPROVE: Remove useless index + Fix Typos
* 🐛 FIX: Fill contains args
* 👌 IMPROVE: Add channel for each output
* 👌 IMPROVE: Remove comments
* 📦 NEW: Add module lima
* 👌 IMPROVE: Move .pbi output to reports channel
* 🐛 FIX: Fix report channel definition
* 👌IMPROVE; Remove options from command line
update test script with removed options
* 🐛 FIX: Add pbi input
* 👌 IMPROVE: Add parallelization to lima
* 👌 IMPROVE: Add some pacbio test files
* 🐛 FIX: Add Pacbio index to test_data.config
* 👌 IMPROVE: Re add 10000 data test
* 👌 IMPROVE: Add some pbindex
* 🐛 FIX: Add pbi extension to files
* 👌 IMPROVE: The accept one channel (primers move into the first channel)
* 👌 IMPROVE: Assign a value channel for pimers
Improve code workflow readability
* 👌 IMPROVE: Update module to last template version
* 🐛 FIX: Correct Singularity and Docker URL
* 👌 IMPROVE: Update to the last version of modules template
* 👌 IMPROVE: Update test_data.config
* 👌 IMPROVE: Remove pbi from input files
* 🐛 FIX: Fill contains args
* 📦 NEW: Add module lima
* 👌 IMPROVE: Move .pbi output to reports channel
* 🐛 FIX: Fix report channel definition
* 👌IMPROVE; Remove options from command line
update test script with removed options
* 🐛 FIX: Add pbi input
* 👌 IMPROVE: Add parallelization to lima
* 👌 IMPROVE: Add some pacbio test files
* 🐛 FIX: Add Pacbio index to test_data.config
* 👌 IMPROVE: Re add 10000 data test
* 👌 IMPROVE: Add some pbindex
* 🐛 FIX: Add pbi extension to files
* 👌 IMPROVE: The accept one channel (primers move into the first channel)
* 👌 IMPROVE: Assign a value channel for pimers
Improve code workflow readability
* 👌 IMPROVE: Update module to last template version
* 🐛 FIX: Correct Singularity and Docker URL
* 👌 IMPROVE: Update to the last version of modules template
* 👌 IMPROVE: Update test_data.config
* 👌 IMPROVE: Remove pbi from input files
* 👌 IMPROVE: Final version of test datasets config
* 👌 IMPROVE: Remove useless index + Fix Typos
* 🐛 FIX: Fill contains args
* 👌 IMPROVE: Add channel for each output
* 👌 IMPROVE: Remove comments
* 🐛 FIX: Clean test_data.config
* Update modules/lima/main.nf
Add meta to each output
Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>
* Update modules/lima/main.nf
Remove useless parenthesis
Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>
* 🐛 FIX: Keep version number only
* 🐛 FIX: Reintegrate prefix variable and use it to define output file name
* 👌 IMPROVE: add suffix arg to check output files names
* 👌 IMPROVE: Use prefix for output filename
* 🐛 FIX: Set optional output
Allow usage of different input formats
* 👌 IMPROVE: Update meta file
* 👌 IMPROVE: Update test
One test for each input file type
* 👌 IMPROVE: add fasta, fastq.gz, fastq, fastq.gz test files
* 👌 IMPROVE: Update with last templates / Follow new version.yaml rule
* 🐛 FIX: Fix typos and include getProcessName function
* 👌 IMPROVE: Update .gitignore
* 👌 IMPROVE: Using suffix to manage output was not a my best idea
Add a bash code to detect extension and update output file name
* 👌 IMPROVE: clean code
Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>
Co-authored-by: Gregor Sturm <mail@gregor-sturm.de>
Co-authored-by: Mahesh Binzer-Panchal <mahesh.binzer-panchal@nbis.se>
* initial commit [ci skip]
* remove todo from the module files [ci skip]
* add a sample test case [ci skip]
* push the latest work [ci skip]
* bump kleborate build
* test passing with the new build for kleborate [ci skip]
* ready for review
* Apply suggestions from code review
Co-authored-by: Robert A. Petit III <robbie.petit@gmail.com>
Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>
* Specify more guidelines on input channels
* Linting
* Updates based on code review
* Update README.md
* Fix broken sentence
* Start maltextract module
* start tests
* Get tests working now we have test data
* Apply suggestions from code review
Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>
* Changes after review
* Update tests/modules/maltextract/main.nf
Co-authored-by: Jose Espinosa-Carrasco <kadomu@gmail.com>
* Update tests/modules/maltextract/main.nf
Co-authored-by: Jose Espinosa-Carrasco <kadomu@gmail.com>
* Update tests/modules/maltextract/main.nf
Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>
Co-authored-by: Jose Espinosa-Carrasco <kadomu@gmail.com>
* bbmap/align done
* Tests for single end and prebuilt index
* Write bam file directly
* Forgot to use all cpus for bbmap
* Test md5sums
* Added pigz support
* Update modules/bbmap/align/meta.yml
Co-authored-by: James A. Fellows Yates <jfy133@gmail.com>
* process_medium and fastq
* cat/cat module
* Remove filter from CAT_CAT
Co-authored-by: James A. Fellows Yates <jfy133@gmail.com>
Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>
* adding plink module using nf-core tool [ci skip]
* Restructures the project for plink/vcf (#1)
* Add version string for plink
* Create a plink/vcf module
* small tweaks on main.nf and started to test [ci skip]
* small changes on test args, local test with docker passed!
* Update plink/vcf module listing
* Update tag
* fix tags as per linting guidelines
* revert to the original state of tags
* adding --threads to `main.nf` and `meta.yml` information
Co-authored-by: Abhinav Sharma <abhi18av@users.noreply.github.com>
* mutect2 files added, first draft of module code entered
* removing comment from main.nf
* removing comment from main.nf
* test added, commit made before editing yaml
* tests added, still needs final check and info/comments added
* gatk4 version changed to gatk4=4.2.0.0
* multiple sample support added, information added to module yaml file
* Update meta.yml
fixed linting error
* add keywords to meta.yml
* Corrections made to meta.yml
* removed whitespace from meta.yml
Co-authored-by: Gavin.Mackenzie <gavin.mackenzie@nibsc.org>
Co-authored-by: Maxime U. Garcia <max.u.garcia@gmail.com>
* first commit, added template files for new module
* created getpileupsummaries script, tests and both yml files
* fixed typo in meta.yml
* Update modules/gatk4/getpileupsummaries/meta.yml
changed gz_tbi to gz.tbi as suggested
Co-authored-by: Maxime U. Garcia <maxime.garcia@scilifelab.se>
Co-authored-by: GCJMackenzie <gavin.mackenzie@nibsc.org>
Co-authored-by: Maxime U. Garcia <max.u.garcia@gmail.com>
Co-authored-by: Maxime U. Garcia <maxime.garcia@scilifelab.se>
* feat(homer): Add initial makeTagDirectory
* feat(homer): Add initial findPeaks module
* feat(homer): Update with new options
See 1d30e2c21a
* fix(homer): Correct findpeaks process name
* fix(homer): Takes a bam file instead of bed
* feat(homer): Add initial makeTagDirectory test
* fix(homer): Hardcode genome and configureHomer
I'd like to modularize configureHomer, but I need to figure out how
exactly the genomes work.
* fix(homer): bam => bed
Bam requires samtools to be present, which it's not in this docker image
* feat(homer): Add initial configureHomer script
* ci(homer): Add initial test
* test(homer): Reproducible configuration workaround
- I can't run both tests(one file and two files) at the same time because it breaks
- I can't copy the genome stuff from the configurehomer module because it's read only
- So I can't make the makeTagDirectory module depend on configureHomer
* test(homer): Add placeholder annotatepeaks
The required inputs are necessarily required for all workflows from what
I've used, but I'll need to look at the actual docs
* test(homer): Add missing B.bed
* test(homer): Rename two => groseq
Then all of the various workflows that homer provides can be e2e tested
* feat(homer): Add initial makeUCSCfile module
* test(homer): Add start to makeUCSCfile testing
* chore(homer): Add various cleanups
* test(homer): Rewrite annotatepeaks
Not passing yet
* test(homer): Rewrite configurehomer
* test(homer): Rewrite findpeaks
Still failing
* test(homer): Rewrite makeucscfile
Not passing yet
* test(homer): Rewrite maketagdirectory
All homer modules now follow the new structure. Time to make them pass.
* test(homer): Fix typo for workflow name
* fix(homer): Use correct container
* fix(homer): Accept fasta in maketagdirectory
Apparently all of the homer stuff can just take any old fasta and you
don't need to configure the genome ahead of time with configureHomer
* test(homer): makeTagDirectory passes now
* fix(homer): Update containers in makeucscfile
* test(homer): Rewrite makeucscfile
Takes input from maketagdirectory which is how the module should be used
* fix(homer): Update makeUCSCFile bedgraph path
* test(homer): Update makeucscfile expected output
* fix(homer): Update containers in findpeaks
* fix(homer): Change findpeaks args
The user is just going to have to know what they're doing for now
* test(homer): findPeaks rewrite with tagDir input
* test(homer): Update expected files for findPeaks
And bump filters
* style: Appease editorconfig
* ci: Remove old workflow
* tests(homer): Add md5sums
* test(homer): Add meta test
* style(homer): Capitalize HOMER
* docs(homer): Add maketagdirectory meta.yml
* docs(homer): Add makeucscfile meta.yml
* docs(homer): Add findpeaks meta.yml
* test(homer): Update to new test data standards
* chore: Remove stuff that got revived in the rebase
* chore: software => modules
* test(homer): Update tags
* test(homer): Update annotatepeaks
* ci: Fix uploading of artifacts
GitHub actions doesn't like the / in the tags
* test(homer): Remove annotate md5sum
This is failing and breaking new tests
* test(homer): Use bams instead of beds
* test(homer): Fix meta maketagdirectory
* test(homer): Fix input in all tests
* test(homer): Move back to bed files
Forgot samtools isn't present
* chore(homer): Add TODOs for tests
* test(homer): Add bed format arg
* test(homer): Update md5sums
* test(homer): Fix tags tsvs
* style(homer): Appease nf-core linting
* docs(homer): Be in line with what is in the main.nf file
Co-authored-by: Kevin Menden <kevin.menden@live.com>
Co-authored-by: Kevin Menden <kevin.menden@live.com>
Please enter the commit message for your changes. Lines starting
* adds expansionhunter module
Co-authored-by: Maxime U. Garcia <maxime.garcia@scilifelab.se>
* Initial draft [ci skip]
* trigger first test
* update output file path
* Tests passing
* finishing touches for meta.yml and update checksum
* tweak checksum
* add threads to the module
* skip version info for matching test md5sum [ci skip]
* Add ref fasta and finalize the module
Co-authored-by: Gregor Sturm <mail@gregor-sturm.de>
* draft for bcftools modules [ci skip]
* initial test for bcftools concat
* Update the params for testing
* fix tests
* Accomodate code review [ci skip]
Co-authored-by: James A. Fellows Yates <jfy133@gmail.com>
* Update the meta file and open PR for review
* Update the keyword
* Update the tags for module [ci skip[
* add threads
Co-authored-by: James A. Fellows Yates <jfy133@gmail.com>
* Specify more guidelines on input channels
* Linting
* Updates based on code review
* Update README.md
* Fix broken sentence
* Add MALT with incomplete tests
* Parameter typo fix
* Clean up test yaml
* Finish MALT module prior UNZIP and MALT_BUILD modiules
* Add required modules for tests
* Sync test out with malt-build
* Fix input parameters in tests based on final build module
* Update modules/malt/run/meta.yml
Co-authored-by: Gregor Sturm <mail@gregor-sturm.de>
Co-authored-by: Gregor Sturm <mail@gregor-sturm.de>
* Specify more guidelines on input channels
* Linting
* Updates based on code review
* Update README.md
* Fix broken sentence
* Add bwa/aln module
* Also output reads as required with SAI
* Add sampe
* Fix container paths
* Update based on code review from @grst
* Update input docs
* Specify more guidelines on input channels
* Linting
* Updates based on code review
* Update README.md
* Fix broken sentence
* Add bwa/aln module
* Also output reads as required with SAI
* Add bwa samse
* Fix container paths
* remove TODO comment
* Updated based on code from from @grst on bwa/sampe
* Clarify output docs
* Specify more guidelines on input channels
* Linting
* Updates based on code review
* Update README.md
* Fix broken sentence
* Add unzip module
* Remove missing TODOs update mtea
* Apply changes after code-review from @grst
* Account for user trying to supply two input archives
* Remove debugging test
* Update modules/unzip/main.nf
Co-authored-by: Jose Espinosa-Carrasco <kadomu@gmail.com>
* Correct output path
Co-authored-by: Jose Espinosa-Carrasco <kadomu@gmail.com>
* Updated the version of STAR in align and genomegenerate modules
* Changes in test.yml
* Changes in test.yml
* Added module arriba
* Changes in test configs
* Added module Arriba for fusion detection
* Fixed review comments
* Added an output option for discarded fusions
* Resolved some conflits
* conflicts
* Apply suggestions from code review
Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>
* Update README
* Rename pytest_software.yml to pytest_modules.yml
* Rename main software directory to modules
* Remove deprecated modules
* Rename tests software to modules
* Replace paths for tests in pytest_modules.yml
* Replace software with modules in Github Actions
* Replace software with modules in main.nf tests
* Rename software to modules in test.yml
* testing works but need to fix paths and add exclude list
* edited meta file and removed TODOs
* left over TODOs
* test passed, final push
* resolving initial comments
* Update software/delly/call/main.nf
Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>
Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>
* created templates
* updated functions.nf to the one on dev
* Created test main
* Fasta added and docker, conda, singularity passing
* Fixed output version lint
* Created module template (#558)
* Created meta and implemented main (#558)
* Conda test and lint passing (#558)
* Updated functions.nf to the one on dev (#558)
* Update software/lofreq/filter/main.nf
Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>
* add software/pairtools
* create a branch for pairtools/select
* fix the different output of conda and docker.
* rmove the md5sum for gzip files.
* Update software/pairtools/select/main.nf
Co-authored-by: Jose Espinosa-Carrasco <kadomu@gmail.com>
* update test file and documentation.
Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>
Co-authored-by: Jose Espinosa-Carrasco <kadomu@gmail.com>
* hifiasm copied from fastqc
* hifiasm tests init from fastqc
* meta.yml init; test.yml and main.nf for printing version
* Add hifiasm version printing
* Removed spaced on an empty line
* Reverted hifiasm from main
* hifiasm copied from fastqc
* hifiasm tests init from fastqc
* meta.yml init; test.yml and main.nf for printing version
* Add hifiasm version printing
* Removed spaced on an empty line
* Reverted hifiasm from main
* Added seqtk/subseq and checking for seed in seqtk/sample
* Separate authors in software/seqtk/sample/meta.yml
Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>
* Align commans in output channesl software/seqtk/subseq/main.nf
Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>
* Define prefix in software/seqtk/subseq/main.nf
Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>
* Use prefix in output file name software/seqtk/subseq/main.nf
Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>
* Define suffix in options in tests/software/seqtk/subseq/main.nf
Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>
* Change output file name in tests/software/seqtk/subseq/test.yml
Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>
* Remove a to-do point from tests/software/seqtk/subseq/test.yml
Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>
* Added --no-name into gzip commands
* Update samtools from 1.10 to 1.12 (#530)
* feat: remove social preview image to use GitHub OpenGraph
* feat: update samtools from 1.10 to 1.12
* fix: CI tests
* fix: add meta.yml file for samtools/merge
* Update software/samtools/merge/meta.yml
Co-authored-by: Jose Espinosa-Carrasco <kadomu@gmail.com>
* Update software/samtools/merge/meta.yml
Co-authored-by: Jose Espinosa-Carrasco <kadomu@gmail.com>
* hifiasm copied from fastqc
* hifiasm tests init from fastqc
* meta.yml init; test.yml and main.nf for printing version
* Add hifiasm version printing
* Removed spaced on an empty line
* Reverted hifiasm from main
* Added seqtk/subseq and checking for seed in seqtk/sample
* hifiasm copied from fastqc
* hifiasm tests init from fastqc
* meta.yml init; test.yml and main.nf for printing version
* Add hifiasm version printing
* Removed spaced on an empty line
* Reverted hifiasm from main
* Separate authors in software/seqtk/sample/meta.yml
Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>
* Align commans in output channesl software/seqtk/subseq/main.nf
Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>
* Define prefix in software/seqtk/subseq/main.nf
Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>
* Use prefix in output file name software/seqtk/subseq/main.nf
Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>
* Define suffix in options in tests/software/seqtk/subseq/main.nf
Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>
* Change output file name in tests/software/seqtk/subseq/test.yml
Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>
* Remove a to-do point from tests/software/seqtk/subseq/test.yml
Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>
* Added --no-name into gzip commands
* Replaced functions.nf in seqtk/subseq
* Refreshed tests for sample and subseq
* Corrected paired-end test and YAML description for sample
Co-authored-by: Sviatoslav Sidorov <sviatoslav.sidorov@crick.ac.uk>
Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>
Co-authored-by: Maxime U. Garcia <max.u.garcia@gmail.com>
Co-authored-by: Jose Espinosa-Carrasco <kadomu@gmail.com>
* add software/pairtools
* create a branch for pairtools/restrict
* fix the different output of conda and docker
* remove customized code.
* add newline to Frag.bed file.
* change the folder of frag.bed.
* change \n to \r\n
* Remove work.frag.bed
Co-authored-by: JoseEspinosa <kadomu@gmail.com>
* New last/mafconvert module to convert MAF alignments.
The `maf-convert` tool distributed with [LAST](https://gitlab.com/mcfrith/last)
reads alignmnts in [MAF](https://genome-asia.ucsc.edu/FAQ/FAQformat.html#format5)
format and converts them in another format (axt, blast, blasttab, chain,
gff, html, psl, sam, tab).
This new module is part of the work described in Issue #464. During this
development, we fix the versiob of LAST to 1219 to ensure consistency.
We will upgrade it later.
* Delete white space.
* Update the function.nf file to the dev version.
The `last-postmask` tool distributed with [LAST](https://gitlab.com/mcfrith/last)
filters alignments in a MAF file to remove those with too many masked
(lower-case) positions compared with their score.
As other filter modules like `last/split`, its output file risks to
overwrite its input file as their names are constructed from the sample
ID when multiple filters are chained in the pipeline. I added a check
that gives a clearer error message in this case. Please let me know
what you think about; I can add this test to the existing LAST modules
as well.
This new module is part of the work discribed in Issue #464. During this
development, we fix the version of LAST to 1219 to ensure consistency.
We will upgrade it later.
* New last/dotplot module for pairwise similarity plots
The `last-dotplot` tool takes a pairwise alignment in
[MAF](http://genome.ucsc.edu/FAQ/FAQformat.html#format5) format,
possibly compressed with gzip, or in a tabular format produced by the
`maf-convert` tool, and produces a similarity dot-plot of the two
sequences in one of the graphical formats supported by the Python
Imaging Library.
A the tool guesses the output format by the file extension of the file,
which is constructed by the module at run time, I have used the `args2`
option to convey this information to the module.
This new module is part of the work described in Issue #464. During
this development, we fix the version of LAST to 1219 to ensure
consistency (hence please ignore lint's version warning).
* Update the functions.nf file to the dev branch.
https://raw.githubusercontent.com/nf-core/tools/dev/nf_core/module-template/software/functions.nf
* add software/pairtools
* create a branch for pairtools/sort
* fix the different output of conda and docker.
* remove customized code.
* Apply suggestions from code review
Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>
* add software/pairtools
* create a branch for pairtools/parse
* fix the issue of bioconda output is different from docker.
* remove customized code from test.
Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>
* add software/pairtools
* create a branch for pairtools/flip
* fix the issue of PG line in output
* remove custom code from test.
Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>
* New last/lastal to align query sequences on a target index
`lastal` is the main program of the [LAST](https://gitlab.com/mcfrith/last)
suite. It align query DNA sequences in FASTA or FASTQ format to a
target index of DNA or protein sequences. The index is produced by
the `lastdb` program (module `last/lastdb`). The score matrix for
evaluating the alignment can be chosen among preset ones or computed
iteratively by the `last-train` program (module `last/train`). For
this reason, the `last/lastal` module proposed here has one input
channel containing an optional file, that has to be dummy when not used.
The LAST aligner outputs MAF files that can be very large (up to
hundreds of gigabytes), therefore this module unconditionally compresses
its output with gzip.
This new module is part of the work described in Issue #464. During
this development, we fix the version of LAST to 1219 to ensure
consistency (hence ignore lint's version warning).
* Apply suggestions from code review
Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>
* Un-hardcode the path to the LAST index.
Among multiple alternatives I have chosen the following command to
detect the sample name of the index, because it fails in situations
where there is no index files in the index folder, and in situations
were there are two indexes files in the folder. Not failing would
result in feeding garbage information in the INDEX_NAME variable.
basename \$(ls $index/*.bck) .bck
In case of missing file, a clear error message is given by `ls`. In
case of more than one file, the error message of `basename` is more
cryptic, unfortunately. (`basename: extra operand ‘.bck’`)
Alternatives that do not fail if there is no .bck file:
basename $index/*bck .bck
find $index -name '*bck' | sed 's/.bck//'
Alternatives that do not fail if there are more than one .bck file:
basename -s .bck $index/*bck
ls $index/*.bck | xargs basename -s .bck
find $index -name '*bck' | sed 's/.bck//'
Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>
* New last/split module to find split alignments.
The `last-split` tool distributed with [LAST](https://gitlab.com/mcfrith/last)
finds split or spliced alignments in a MAF file that is produced with, for
example, LAST `lastal` command.
This new module is part of the work discribed in Issue #464. During this
development, we fix the versiob of LAST to 1219 to ensure consistency. We will
upgrade it later.
* Update software/last/split/main.nf
* Apply suggestions from code review
Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>
* add additional ucsc tools
* Update software/ucsc/wigtobigwig/meta.yml
Co-authored-by: Kevin Menden <kevin.menden@live.com>
* Update the functions.nf and software name for ucsc/wigtobigwig and bigwigaverageoverbed.
Co-authored-by: Kevin Menden <kevin.menden@live.com>
* New module last/mafswap to reorder sequences in alignments
The `maf-swap` tool distributed with [LAST](https://gitlab.com/mcfrith/last)
reorders sequences in alignment files in Multiple Alignment Format.
When run without command-line arguments, it will swap the target and the
query sequences. This is useful when turning a many-to-many alignment
into a many-to-one and then a one-to-one alignment in conjunction with
the `last-split` command (split, swap, split and swap again).
The LAST aligner outputs MAF files, but other tools also use this
format. As MAF files can be very large (up to hundreds of gigabytes),
the module expects its input to be compressed with gzip and will
compress its output.
This new module is part of the work described in Issue #464. During
this development, we fix the version of LAST to 1219 to ensure
consistency (hence ignore lint's version warning).
* Update MD5 sum.
Actually, 7029066c27ac6f5ef18d660d5741979a is the MD5 sum of
an empty file compressed with `gzip --no-name`… This happened
because I forgot to update the config file after correcting the
module… sorry !
* Apply suggestions from code review
Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>
* Change name as suggested in pull request.
Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>
* New last/train module to train alignment parameters.
The last-train command creates a parameter file that
will be used by last/lastal module for sequence alignment.
It takes indexed sequences and query sequences as input
and we use the metadata of both to create an id of the
parameter output file.
Submission of the LAST modules is discussed in more
details in the issue #464. For consistancy, we use LAST
version 1219 for this whole development and will upgrade later.
* Corrected files according to the nf-core v1.14 standards.
* Fixed function.nf file for the last-train module.
* Apply suggestions from code review
Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>
* Find index name.
* Correct after the input channels were changed.
* Use double underscore as a name separator.
Single underscores can happen in ids, therefore, we would like to keep two underscores.
* Remove extra spaces.
* Fixed the passing of the "score matrix" line.
* Apply suggestions from code review
Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>
* Update software/last/train/main.nf
Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>
* added intervallisttools module
* add intervallisttools module
* arguments are now supplied using options.args
* removed java heapsize settings
* changes in main.nf and it is tested
* comment added
* Update software/gatk4/intervallisttools/meta.yml
Co-authored-by: Maxime U. Garcia <maxime.garcia@scilifelab.se>
* Update tests/software/gatk4/intervallisttools/test.yml
Co-authored-by: Maxime U. Garcia <maxime.garcia@scilifelab.se>
* review comment on tags in bedtointerval
* modified the test to get input from bedtointerval module
* Update software/gatk4/intervallisttools/meta.yml
* Apply suggestions from code review
Co-authored-by: Kevin Menden <kevin.menden@live.com>
* Apply suggestions from code review
* Update tests/config/pytest_software.yml
Co-authored-by: Kevin Menden <kevin.menden@live.com>
* Apply suggestions from code review
* Apply suggestions from code review
Co-authored-by: @praveenraj2018 <praveen.raj.somarajan@ki.se>
Co-authored-by: Maxime U. Garcia <maxime.garcia@scilifelab.se>
Co-authored-by: Kevin Menden <kevin.menden@live.com>
* Bump pangolin version
* Add nextclade to software list
* Add nextclade module
* Update md5sum for Pangolin due to version bump
* Adding some URL to meta.yml
* Adding new line at end of file
Co-authored-by: JoseEspinosa <kadomu@gmail.com>
* New last/lastdb module to index sequences before alignment.
The `lastdb` command creates a sequence index for the LAST aligner
(https://gitlab.com/mcfrith/last). Input can be in FASTA or FASTQ
format, and compression is handled automagically. DNA or protein
sequences can be indexed.
The sequence index is a collection of files sharing the same basename.
This module sets the basename to the sample identifier (`$meta.id`) and
creates the index in a directory always called `lastdb`. The module's
output channel then conveys a copy of the metadata and the path to the
`lastdb` directory.
Other modules will follow (see Issue #464). The LAST aligner can align
proteins to proteins, DNA to DNA and can translate DNA align to
proteins.
* Remove trailing whitespace.
* Apply suggestions from code review
Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>
* Update as suggested in PR.
* Attempt to pass linting.
Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>
Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>