* Specify more guidelines on input channels
* Linting
* Updates based on code review
* Update README.md
* Fix broken sentence
* Start maltextract module
* start tests
* Get tests working now we have test data
* Apply suggestions from code review
Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>
* Changes after review
* Update tests/modules/maltextract/main.nf
Co-authored-by: Jose Espinosa-Carrasco <kadomu@gmail.com>
* Update tests/modules/maltextract/main.nf
Co-authored-by: Jose Espinosa-Carrasco <kadomu@gmail.com>
* Update tests/modules/maltextract/main.nf
Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>
Co-authored-by: Jose Espinosa-Carrasco <kadomu@gmail.com>
Please enter the commit message for your changes. Lines starting
* adds expansionhunter module
Co-authored-by: Maxime U. Garcia <maxime.garcia@scilifelab.se>
* Specify more guidelines on input channels
* Linting
* Updates based on code review
* Update README.md
* Fix broken sentence
* Add unzip module
* Remove missing TODOs update mtea
* Apply changes after code-review from @grst
* Account for user trying to supply two input archives
* Remove debugging test
* Update modules/unzip/main.nf
Co-authored-by: Jose Espinosa-Carrasco <kadomu@gmail.com>
* Correct output path
Co-authored-by: Jose Espinosa-Carrasco <kadomu@gmail.com>
* New last/lastal to align query sequences on a target index
`lastal` is the main program of the [LAST](https://gitlab.com/mcfrith/last)
suite. It align query DNA sequences in FASTA or FASTQ format to a
target index of DNA or protein sequences. The index is produced by
the `lastdb` program (module `last/lastdb`). The score matrix for
evaluating the alignment can be chosen among preset ones or computed
iteratively by the `last-train` program (module `last/train`). For
this reason, the `last/lastal` module proposed here has one input
channel containing an optional file, that has to be dummy when not used.
The LAST aligner outputs MAF files that can be very large (up to
hundreds of gigabytes), therefore this module unconditionally compresses
its output with gzip.
This new module is part of the work described in Issue #464. During
this development, we fix the version of LAST to 1219 to ensure
consistency (hence ignore lint's version warning).
* Apply suggestions from code review
Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>
* Un-hardcode the path to the LAST index.
Among multiple alternatives I have chosen the following command to
detect the sample name of the index, because it fails in situations
where there is no index files in the index folder, and in situations
were there are two indexes files in the folder. Not failing would
result in feeding garbage information in the INDEX_NAME variable.
basename \$(ls $index/*.bck) .bck
In case of missing file, a clear error message is given by `ls`. In
case of more than one file, the error message of `basename` is more
cryptic, unfortunately. (`basename: extra operand ‘.bck’`)
Alternatives that do not fail if there is no .bck file:
basename $index/*bck .bck
find $index -name '*bck' | sed 's/.bck//'
Alternatives that do not fail if there are more than one .bck file:
basename -s .bck $index/*bck
ls $index/*.bck | xargs basename -s .bck
find $index -name '*bck' | sed 's/.bck//'
Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>
* New module last/mafswap to reorder sequences in alignments
The `maf-swap` tool distributed with [LAST](https://gitlab.com/mcfrith/last)
reorders sequences in alignment files in Multiple Alignment Format.
When run without command-line arguments, it will swap the target and the
query sequences. This is useful when turning a many-to-many alignment
into a many-to-one and then a one-to-one alignment in conjunction with
the `last-split` command (split, swap, split and swap again).
The LAST aligner outputs MAF files, but other tools also use this
format. As MAF files can be very large (up to hundreds of gigabytes),
the module expects its input to be compressed with gzip and will
compress its output.
This new module is part of the work described in Issue #464. During
this development, we fix the version of LAST to 1219 to ensure
consistency (hence ignore lint's version warning).
* Update MD5 sum.
Actually, 7029066c27ac6f5ef18d660d5741979a is the MD5 sum of
an empty file compressed with `gzip --no-name`… This happened
because I forgot to update the config file after correcting the
module… sorry !
* Apply suggestions from code review
Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>
* Change name as suggested in pull request.
Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>
* New last/train module to train alignment parameters.
The last-train command creates a parameter file that
will be used by last/lastal module for sequence alignment.
It takes indexed sequences and query sequences as input
and we use the metadata of both to create an id of the
parameter output file.
Submission of the LAST modules is discussed in more
details in the issue #464. For consistancy, we use LAST
version 1219 for this whole development and will upgrade later.
* Corrected files according to the nf-core v1.14 standards.
* Fixed function.nf file for the last-train module.
* Apply suggestions from code review
Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>
* Find index name.
* Correct after the input channels were changed.
* Use double underscore as a name separator.
Single underscores can happen in ids, therefore, we would like to keep two underscores.
* Remove extra spaces.
* Fixed the passing of the "score matrix" line.
* Apply suggestions from code review
Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>
* Update software/last/train/main.nf
Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>
* Add new human data and fix sarscov paths
* Fix filename typo
* Apply code review
* replace index with to match sarscov data
* lower case
* indent everythin
* Adapt sarscov keys to new naming convention
* Update test_data.config
Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>
* Adding bigWig and deeptools computeMatrix files to config
* Adding meta.yml for deeptools modules
* Add test for deeptools modules
* Fixing and reordering tags
* Fixing conda test that worked in local...
* Apply suggestions from code review
* Changing bigwig file pattern to include bigwig extension
* Saving after last change is a good practice
Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>
* Resolve suggests after PR review
* add newline to functions
* need variable interpolation using double quotes; remove unnecessary tag
* add a more resilient link to raw github files
* remove trailing slash
* Update software/iqtree/main.nf
Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>
* Add abacas module
* Add test for abacas module
* Add Harshil to authorship
* Updating test with the data uploaded to nf-core/datasets
* Apply suggestions from code review
Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>
* initial 'modules create' of minia
* fixed tests
* finished meta.yml
* fixed filters.yml
* resolved issues in pytest_software.yml
* add newline
* Update software/minia/main.nf
* fixing a bunch of module tests
* remove vscode
* fixed minia
* move test data directory to nf-core/test-datasets
* bump multiqc version
* remove the test data
* updated test data link
* update README
Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>
* adding fasttree module
* correct trailing whitespace
* using sarscov2 as a test dir
* remove TODO
* update test data naming
* further test data naming updates
* remove options in favour of $options.args
* ensure non standard exit codes don't cause an issue
* update md5sum
* initial commit hisat2/build
* initial commit hisat2/build
* changed names for hisat2
* fixed directory structure and args
* added splice site test data
* added splice site inputs
* replaced list with individual args
* fixed removed commas
* added test yml file
* updated hisat2 conda version
* added meta.yml
* added meta.yml description
* added meta.yml inputs
* added meta.yml outputs
* update conda version for hisat2
* removed trailing whitespace meta.yml
* fixed version number for containers
* added test data to test config
* updated for new test logic
* fix pytest issue?
* fix pytest issue
* fixed wrong tool in meta.yaml
* updated tets.yaml name
* handle build bug for testing
* handle build bug for testing in yaml
* moved test folder to fix build bug
* use old hisat2 version to avoid conda giving inconsistent md5sum
* initial commit
* removed temp file
* added meta yaml
* add to pytest
* added tests
* added test yml
* add align meta yaml
* add hisat2 align to pytest
* remove need for splice data by calling process
* add hisat2 align se test
* add hisat2 align pe test
* update names hisat2 align
* update software pytest for using mutiple modules
* remove splice site test data since using module instead
* remove splice site from config since using module instead
* fixed extra brace
* added hisat2 align test.yml
* removed md5sum for bam files
* updated build md5sums
* Apply suggestions from code review
Co-authored-by: Nicholas TODA <nicholas.toda@mnhn.fr>
Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>
* Adding tar.gz kraken2 db to test data
* Update test path files for untar module
* Update test path files for kraken2/run module
* Update test path files for cat/fastq module
* Re-organise all test data
* Fix ECLint
* Fix ECLint agaaainn
* Now is not the time EClint
* Initial commit for test data config
* Rename test data
* Include test config
* Update indents
* Update test for FastQC via config
* Remove quotes of bottom-level variables
* Use underscores in key names
* Get tests working for fastp
* Remove whitespace at beginning of file