Commit graph

1639 commits

Author SHA1 Message Date
Maxime U. Garcia
c8877835c2
fix: update to gatk4.2.0.0 (#579) 2021-07-16 11:24:29 +02:00
Maxime U. Garcia
e7be649029
fix: update to gatk4.2.0.0 (#578) 2021-07-16 11:23:08 +02:00
Maxime U. Garcia
9fb21e1a84
fix: update to gatk4.2.0.0 (#577) 2021-07-16 11:22:01 +02:00
Maxime U. Garcia
bee23ba899
fix: update to gatk4.2.0.0 (#576) 2021-07-16 11:18:27 +02:00
Harshil Patel
e937c7950a
Rename software/ directory to modules/ to re-organise module structure (#567)
* Update README

* Rename pytest_software.yml to pytest_modules.yml

* Rename main software directory to modules

* Remove deprecated modules

* Rename tests software to modules

* Replace paths for tests in pytest_modules.yml

* Replace software with modules in Github Actions

* Replace software with modules in main.nf tests

* Rename software to modules in test.yml
2021-07-07 10:10:18 +01:00
Svyatoslav Sidorov
95dfa7bc8d
Enhance bedtools (#557)
* hifiasm copied from fastqc

* hifiasm tests init from fastqc

* meta.yml init; test.yml and main.nf for printing version

* Add hifiasm version printing

* Removed spaced on an empty line

* Reverted hifiasm from main

* hifiasm copied from fastqc

* hifiasm tests init from fastqc

* meta.yml init; test.yml and main.nf for printing version

* Add hifiasm version printing

* Removed spaced on an empty line

* Reverted hifiasm from main

* Generalized intersect and genomecov

* Finished subtract and tests for intersect and genomecov

* Finished up meta YAMLs

* Replaced functions.nf in subtract

* do not add "." in prefix when suffix is set (#550)

when using the suffix option in the fastqc module it gets added with a "." to the ${meta.id}, this leads to sample name clashing when passing the fastqc output of raw and processed files suffixed with "trim" or "clean" to multiqc.
I suggest to remove the "." this would also be more consistent with the prefix def in most of the other nf-core modules.

##' Ref:
https://nfcore.slack.com/archives/CJRH30T6V/p1625129138043900

* Modules TLC (#551)

* Modules TLC

* Fix all the tests

* Fix linting failures (#552)

* Modules TLC

* Fix all the tests

* Fix linting failures

* Soft-link build files to bypass lint error

* Update software/rseqc/junctionannotation/main.nf

* Fix rsem (#555)

* bump bcftools filter

* fix bcftools

* rsem/preparereference

* added calculateexpression

* meta.yml and pytest_software

* paired end

* add test.yml

* fix rseqc/junctionsaturation (#553)

* bump bcftools filter

* junctionsaturation meta.yml and test

* pytest config entry

* bump bcftools version down again

* fix test

* fix bcftools

* Adapt linting action (#556)

* bump bcftools filter

* fix bcftools

* adapt module linting

* Update software/bedtools/genomecov/main.nf

Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>

* Update software/bedtools/genomecov/main.nf

Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>

* Update software/bedtools/genomecov/main.nf

Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>

* Update software/bedtools/genomecov/main.nf

Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>

* Update software/bedtools/genomecov/meta.yml

Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>

* Update software/bedtools/genomecov/meta.yml

Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>

* Update software/bedtools/genomecov/meta.yml

Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>

* Update software/bedtools/genomecov/meta.yml

Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>

* Update software/bedtools/genomecov/meta.yml

Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>

* Update software/bedtools/intersect/main.nf

Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>

* Update software/bedtools/genomecov/main.nf

Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>

* Update software/bedtools/genomecov/main.nf

Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>

* Update software/bedtools/intersect/main.nf

Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>

* Update software/bedtools/intersect/main.nf

Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>

* Update software/bedtools/intersect/main.nf

Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>

* Update software/bedtools/intersect/meta.yml

Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>

* Update software/bedtools/intersect/meta.yml

Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>

* Update software/bedtools/intersect/meta.yml

Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>

* Update software/bedtools/intersect/meta.yml

Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>

* Update software/bedtools/intersect/meta.yml

Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>

* Update software/bedtools/subtract/main.nf

Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>

* Update software/bedtools/subtract/main.nf

Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>

* Update software/bedtools/subtract/meta.yml

Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>

* Update software/bedtools/subtract/meta.yml

Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>

* Update tests/software/bedtools/genomecov/main.nf

Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>

* Update tests/software/bedtools/genomecov/main.nf

Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>

* Update tests/software/bedtools/intersect/main.nf

Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>

* Update tests/software/bedtools/intersect/main.nf

Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>

* Fixed variable names in intersect and genomecov

Co-authored-by: Sviatoslav Sidorov <sviatoslav.sidorov@crick.ac.uk>
Co-authored-by: riederd <dietmar.rieder@i-med.ac.at>
Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>
Co-authored-by: Kevin Menden <kevin.menden@t-online.de>
2021-07-07 09:44:09 +01:00
Maxime U. Garcia
e0d820f9f4
Add snpeff (#546)
* feat: add snpeff

* fix: linting

* fix: tests

* fix: add csv output

* fix: add params information

* fix: improve script

* Update software/snpeff/environment.yml

* Update software/snpeff/environment.yml

Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>

* Apply suggestions from code review

Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>

* Apply suggestions from code review

Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>

* Apply suggestions from code review

Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>

* Apply suggestions from code review

* fix test

Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>
2021-07-06 13:40:55 +01:00
Maxime U. Garcia
7b4a28b6e8
feat: add VEP module (#547)
* feat: add VEP module

* fix: name

* fix: EC lint]

* feat: add info about params

* fix: params as params, not input

* fix: improve script

* Update software/ensemblvep/environment.yml

Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>

* Apply suggestions from code review

Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>

* Apply suggestions from code review

* Apply suggestions from code review

Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>

Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>
2021-07-06 14:11:11 +02:00
Mei Wu
3f4eccbf0f
Delly/call (#565)
* testing works but need to fix paths and add exclude list

* edited meta file and removed TODOs

* left over TODOs

* test passed, final push

* resolving initial comments

* Update software/delly/call/main.nf

Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>

Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>
2021-07-06 12:54:14 +01:00
Johnathan D
79f3a55f6e
Lofreq call (#564)
* created templates

* updated functions.nf to the one on dev

* Created test main

* Fasta added and docker, conda, singularity passing

* Fixed output version lint
2021-07-06 10:41:00 +01:00
Ravneet Bhuller
42adce7af1
Pull request for module: lofreq/callparallel (#562)
* first commit

* updated file

* edited files

* Update test.yml

* Update software/lofreq/callparallel/main.nf

* Update test.yml

* Update tests/software/lofreq/callparallel/test.yml

Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>
2021-07-06 09:51:38 +01:00
Charles Plessy
45f2f1ee5f
Input a triple (id, fasta, params) to last/lastal (#563)
The `last/lastal` submodule takes query sequences to align to a target
index, and optionally takes one set of alignment parameters (including a
score matrix) computed by the `last/train` module for each of the
sequences.

In the previous implementation the sequences and the alignment
parameters were provided in different channels, causing them to be
sometimes desynchronised.

In the patched implementation, `last/lastal` takes a 3-tuple as
input to ensure synchronicity.  To produce this tuple in a pipeline,
one can use the `join` command as in the following example.

     LAST_TRAIN  ( query,
                   target )
     LAST_LASTAL ( query.join(LAST_TRAIN.out.param_file),
                   target )

In case no parameter file is computed one can pass a dummy file
to the module as follows:

     LAST_LASTAL ( query.map { row -> [ row[0], row[1], [] ] },
                   target )
2021-07-06 09:35:04 +01:00
Johnathan D
fbab00238f
Lofreq filter (#559)
* Created module template (#558)

* Created meta and implemented main (#558)

* Conda test and lint passing (#558)

* Updated functions.nf to the one on dev (#558)

* Update software/lofreq/filter/main.nf

Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>
2021-07-05 10:24:45 +01:00
Ravneet Bhuller
cd8bacc90a
PR for lofreq/indelqual module (#539)
* first commit

* edited functions.nf

* edited functions.nf

* version line updated

* Update software/lofreq/indelqual/main.nf

Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>

* Update software/lofreq/indelqual/main.nf

Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>

* Update software/lofreq/indelqual/main.nf

Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>

* Update software/lofreq/indelqual/meta.yml

Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>

* Update tests/software/lofreq/indelqual/test.yml

Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>

* Update tests/software/lofreq/indelqual/main.nf

Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>

* updated files

* updated file

Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>
2021-07-05 10:17:57 +01:00
Kevin Menden
2395734492
Adapt linting action (#556)
* bump bcftools filter

* fix bcftools

* adapt module linting
2021-07-02 14:15:32 +02:00
Kevin Menden
b59e8054e4
fix rseqc/junctionsaturation (#553)
* bump bcftools filter

* junctionsaturation meta.yml and test

* pytest config entry

* bump bcftools version down again

* fix test

* fix bcftools
2021-07-02 13:31:09 +02:00
Kevin Menden
fce4e60864
Fix rsem (#555)
* bump bcftools filter

* fix bcftools

* rsem/preparereference

* added calculateexpression

* meta.yml and pytest_software

* paired end

* add test.yml
2021-07-02 13:09:48 +02:00
Harshil Patel
b5f6a1da79
Fix linting failures (#552)
* Modules TLC

* Fix all the tests

* Fix linting failures

* Soft-link build files to bypass lint error

* Update software/rseqc/junctionannotation/main.nf
2021-07-01 19:50:14 +01:00
Harshil Patel
b3278a1e63
Modules TLC (#551)
* Modules TLC

* Fix all the tests
2021-07-01 16:13:01 +01:00
riederd
601bf5eb64
do not add "." in prefix when suffix is set (#550)
when using the suffix option in the fastqc module it gets added with a "." to the ${meta.id}, this leads to sample name clashing when passing the fastqc output of raw and processed files suffixed with "trim" or "clean" to multiqc.
I suggest to remove the "." this would also be more consistent with the prefix def in most of the other nf-core modules.

##' Ref:
https://nfcore.slack.com/archives/CJRH30T6V/p1625129138043900
2021-07-01 11:14:58 +01:00
James A. Fellows Yates
394273f173
damageprofiler (#545)
* Add damageprofiler module

* Fix tests

* Bump container hopefully with font fix

* Following code review
2021-06-30 09:46:24 +02:00
MGordon09
af4b8814b8
update muscle (#542) 2021-06-28 08:20:02 +02:00
Maxime U. Garcia
81aba5734c
Add modules bgziptabix (#544)
* feat: add modules bgziptabix

* fix: add module to config

* fix: lint

* fix: tests
2021-06-25 10:52:02 +02:00
MGordon09
4d711a1428
Add metaphlan3 module (#543)
* Add metaphlan3 module

* remove whitespace
2021-06-25 08:15:15 +02:00
JIANHONG OU
e0b7952b56
Pairtools select (#523)
* add software/pairtools

* create a branch for pairtools/select

* fix the different output of conda and docker.

* rmove the md5sum for gzip files.

* Update software/pairtools/select/main.nf

Co-authored-by: Jose Espinosa-Carrasco <kadomu@gmail.com>

* update test file and documentation.

Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>
Co-authored-by: Jose Espinosa-Carrasco <kadomu@gmail.com>
2021-06-23 15:58:31 +02:00
Jose Espinosa-Carrasco
e621f78757
Update salmon/quant to provide '--libType A' option (#536)
* Update salmon/quant to provide '--libType A' option

* Modify lib_type description to  be a string

* Update tests

* lyb_type can be explicitly overwrite

* Update main.nf

* Update main.nf

* Update main.nf

Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>
2021-06-17 15:13:47 +02:00
Charles Plessy
c8168bc351
Update last module (#533)
* Update LAST to version 1238.

* Update functions.nf to the latest devel version.

* Update test MD5sums after updating software version.

* Make portable on MacOS

* Allow input alignments to be uncompressed.

While the strategy in this family of modules is to make all inputs and
outputs compressed, this change might be useful to some users.

As of LAST 2138, `last/split` does not allow its input to be compressed.

* Search for .des file, that is guaranteed to be unique.

Some LAST indexes have more than one .bck file and it makes the name
detection crash.

In this commit, I also standardise how the names are detected.

* Use value input channel and optional output channels to handle formats.

As discussed on Slack, it is preferred to use a value input channel
instead of sneaking options through `params.args2` or `params.format`
as we did.

Likewise, optional output channels with clearly labeled format are
preferred to 'catch-all' wildcards.
2021-06-14 12:27:27 +01:00
Charles Plessy
894349b9b5
Fix 459 (#534)
* Update test data names.

Closes: #459

Thanks: @FriederikeHanssen

(`samtools` tests are already fixed by another commit)

* Fix file names in tests.

The update of test data name required changes in the test yml files.

I also updated the `bismark/summary/main.nf` file to get its data
from the `params.test_data` map.
2021-06-14 12:24:59 +01:00
Svyatoslav Sidorov
01dc6a289d
Feat seqtk update (#503)
* hifiasm copied from fastqc

* hifiasm tests init from fastqc

* meta.yml init; test.yml and main.nf for printing version

* Add hifiasm version printing

* Removed spaced on an empty line

* Reverted hifiasm from main

* hifiasm copied from fastqc

* hifiasm tests init from fastqc

* meta.yml init; test.yml and main.nf for printing version

* Add hifiasm version printing

* Removed spaced on an empty line

* Reverted hifiasm from main

* Added seqtk/subseq and checking for seed in seqtk/sample

* Separate authors in software/seqtk/sample/meta.yml

Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>

* Align commans in output channesl software/seqtk/subseq/main.nf

Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>

* Define prefix in software/seqtk/subseq/main.nf

Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>

* Use prefix in output file name software/seqtk/subseq/main.nf

Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>

* Define suffix in options in tests/software/seqtk/subseq/main.nf

Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>

* Change output file name in tests/software/seqtk/subseq/test.yml

Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>

* Remove a to-do point from tests/software/seqtk/subseq/test.yml

Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>

* Added --no-name into gzip commands

* Update samtools from 1.10 to 1.12 (#530)

* feat: remove social preview image to use GitHub OpenGraph

* feat: update samtools from 1.10 to 1.12

* fix: CI tests

* fix: add meta.yml file for samtools/merge

* Update software/samtools/merge/meta.yml

Co-authored-by: Jose Espinosa-Carrasco <kadomu@gmail.com>

* Update software/samtools/merge/meta.yml

Co-authored-by: Jose Espinosa-Carrasco <kadomu@gmail.com>

* hifiasm copied from fastqc

* hifiasm tests init from fastqc

* meta.yml init; test.yml and main.nf for printing version

* Add hifiasm version printing

* Removed spaced on an empty line

* Reverted hifiasm from main

* Added seqtk/subseq and checking for seed in seqtk/sample

* hifiasm copied from fastqc

* hifiasm tests init from fastqc

* meta.yml init; test.yml and main.nf for printing version

* Add hifiasm version printing

* Removed spaced on an empty line

* Reverted hifiasm from main

* Separate authors in software/seqtk/sample/meta.yml

Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>

* Align commans in output channesl software/seqtk/subseq/main.nf

Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>

* Define prefix in software/seqtk/subseq/main.nf

Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>

* Use prefix in output file name software/seqtk/subseq/main.nf

Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>

* Define suffix in options in tests/software/seqtk/subseq/main.nf

Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>

* Change output file name in tests/software/seqtk/subseq/test.yml

Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>

* Remove a to-do point from tests/software/seqtk/subseq/test.yml

Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>

* Added --no-name into gzip commands

* Replaced functions.nf in seqtk/subseq

* Refreshed tests for sample and subseq

* Corrected paired-end test and YAML description for sample

Co-authored-by: Sviatoslav Sidorov <sviatoslav.sidorov@crick.ac.uk>
Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>
Co-authored-by: Maxime U. Garcia <max.u.garcia@gmail.com>
Co-authored-by: Jose Espinosa-Carrasco <kadomu@gmail.com>
2021-06-14 12:23:15 +01:00
JIANHONG OU
d9e48b70ca
Pairtools restrict (#522)
* add software/pairtools

* create a branch for pairtools/restrict

* fix the different output of conda and docker

* remove customized code.

* add newline to Frag.bed file.

* change the folder of frag.bed.

* change \n to \r\n

* Remove work.frag.bed

Co-authored-by: JoseEspinosa <kadomu@gmail.com>
2021-06-10 19:39:35 +02:00
Jose Espinosa-Carrasco
3fce3f4950
Bump new nextclade version (0.14.4) (#532) 2021-06-09 14:12:21 +01:00
Jose Espinosa-Carrasco
d71c1b65eb
Pangolin bump new version (3.0.5) (#531)
* Bump new pangolin version

* Update tests
2021-06-09 14:11:59 +01:00
JIANHONG OU
580d1e194a
Cooler digest (#517)
* add software/cooler

* fix the wrong files uploaded.

* create a branch for cooler/digest

Co-authored-by: Jose Espinosa-Carrasco <kadomu@gmail.com>
2021-06-09 11:39:22 +02:00
JIANHONG OU
8d4c6de50d
Cooler dump (#516)
* add software/cooler

* fix the wrong files uploaded.

* create a branch for cooler/dump

* remove the bin size from metadata.
2021-06-09 11:14:17 +02:00
aleksandrabliznina
0745213729
New last/mafconvert module to convert MAF alignments. (#527)
* New last/mafconvert module to convert MAF alignments.

The `maf-convert` tool distributed with [LAST](https://gitlab.com/mcfrith/last)
reads alignmnts in [MAF](https://genome-asia.ucsc.edu/FAQ/FAQformat.html#format5)
format and converts them in another format (axt, blast, blasttab, chain,
gff, html, psl, sam, tab).

This new module is part of the work described in Issue #464. During this
development, we fix the versiob of LAST to 1219 to ensure consistency.
We will upgrade it later.

* Delete white space.

* Update the function.nf file to the dev version.
2021-06-09 08:38:57 +02:00
Charles Plessy
ca321ce69d
New module last/postmask to filter alignment files (#526)
The `last-postmask` tool distributed with [LAST](https://gitlab.com/mcfrith/last)
filters alignments in a MAF file to remove those with too many masked
(lower-case) positions compared with their score.

As other filter modules like `last/split`, its output file risks to
overwrite its input file as their names are constructed from the sample
ID when multiple filters are chained in the pipeline.  I added a check
that gives a clearer error message in this case.  Please let me know
what you think about; I can add this test to the existing LAST modules
as well.

This new module is part of the work discribed in Issue #464. During this
development, we fix the version of LAST to 1219 to ensure consistency.
We will upgrade it later.
2021-06-08 11:14:08 +02:00
Charles Plessy
f7ebc2fc48
New last/dotplot module for pairwise similarity plots (#529)
* New last/dotplot module for pairwise similarity plots

The `last-dotplot` tool takes a pairwise alignment in
[MAF](http://genome.ucsc.edu/FAQ/FAQformat.html#format5) format,
possibly compressed with gzip, or in a tabular format produced by the
`maf-convert` tool, and produces a similarity dot-plot of the two
sequences in one of the graphical formats supported by the Python
Imaging Library.

A the tool guesses the output format by the file extension of the file,
which is constructed by the module at run time, I have used the `args2`
option to convey this information to the module.

This new module is part of the work described in Issue #464.  During
this development, we fix the version of LAST to 1219 to ensure
consistency (hence please ignore lint's version warning).

* Update the functions.nf file to the dev branch.

https://raw.githubusercontent.com/nf-core/tools/dev/nf_core/module-template/software/functions.nf
2021-06-08 11:13:51 +02:00
Maxime U. Garcia
25f88dee3c
Update samtools from 1.10 to 1.12 (#530)
* feat: remove social preview image to use GitHub OpenGraph

* feat: update samtools from 1.10 to 1.12

* fix: CI tests

* fix: add meta.yml file for samtools/merge

* Update software/samtools/merge/meta.yml

Co-authored-by: Jose Espinosa-Carrasco <kadomu@gmail.com>

* Update software/samtools/merge/meta.yml

Co-authored-by: Jose Espinosa-Carrasco <kadomu@gmail.com>
2021-06-07 13:07:32 +02:00
Michael L Heuer
68c678de93
Remove docker.runOptions in favor of docker.userEmulation. (#525) 2021-05-27 18:29:17 +01:00
JIANHONG OU
a303c6ed5b
Pairtools sort (#524)
* add software/pairtools

* create a branch for pairtools/sort

* fix the different output of conda and docker.

* remove customized code.

* Apply suggestions from code review

Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>
2021-05-27 18:25:23 +01:00
JIANHONG OU
9842a10833
Pairtools parse (#521)
* add software/pairtools

* create a branch for pairtools/parse

* fix the issue of bioconda output is different from docker.

* remove customized code from test.

Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>
2021-05-27 18:15:54 +01:00
JIANHONG OU
cc8439a167
Pairtools flip (#520)
* add software/pairtools

* create a branch for pairtools/flip

* fix the issue of PG line in output

* remove custom code from test.

Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>
2021-05-27 18:09:28 +01:00
JIANHONG OU
912f30e95a
Pairtools dedup (#519)
* add software/pairtools

* create a branch for pairtools/dedup

* remove dedup from the filename.
2021-05-27 17:51:27 +01:00
JIANHONG OU
18440df87a
add genmap/mappability (#509)
* add genmap/mappability

* Add index module.
Remove -w parameter from mappability module.

* change the output of genmap/index

* fix the lint error which can not handle stageAs.

* Apply suggestions from code review

* Update main.nf

* Update test.yml

* Update main.nf

* Update test.yml

* Update tests/software/genmap/mappability/main.nf

* Update software/genmap/mappability/main.nf

* Update software/genmap/mappability/main.nf

Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>
2021-05-25 22:12:10 +01:00
Charles Plessy
207930139a
New last/lastal module to align query sequences on a target index (#510)
* New last/lastal to align query sequences on a target index

`lastal` is the main program of the [LAST](https://gitlab.com/mcfrith/last)
suite.  It align query DNA sequences in FASTA or FASTQ format to a
target index of DNA or protein sequences.  The index is produced by
the `lastdb` program (module `last/lastdb`).  The score matrix for
evaluating the alignment can be chosen among preset ones or computed
iteratively by the `last-train` program (module `last/train`).  For
this reason, the `last/lastal` module proposed here has one input
channel containing an optional file, that has to be dummy when not used.

The LAST aligner outputs MAF files that can be very large (up to
hundreds of gigabytes), therefore this module unconditionally compresses
its output with gzip.

This new module is part of the work described in Issue #464.  During
this development, we fix the version of LAST to 1219 to ensure
consistency (hence ignore lint's version warning).

* Apply suggestions from code review

Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>

* Un-hardcode the path to the LAST index.

Among multiple alternatives I have chosen the following command to
detect the sample name of the index, because it fails in situations
where there is no index files in the index folder, and in situations
were there are two indexes files in the folder.  Not failing would
result in feeding garbage information in the INDEX_NAME variable.

    basename \$(ls $index/*.bck) .bck

In case of missing file, a clear error message is given by `ls`.  In
case of more than one file, the error message of `basename` is more
cryptic, unfortunately.  (`basename: extra operand ‘.bck’`)

Alternatives that do not fail if there is no .bck file:

    basename $index/*bck .bck
    find $index -name '*bck' | sed 's/.bck//'

Alternatives that do not fail if there are more than one .bck file:

    basename -s .bck $index/*bck
    ls $index/*.bck | xargs basename -s .bck
    find $index -name '*bck' | sed 's/.bck//'

Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>
2021-05-25 22:10:48 +01:00
JIANHONG OU
34f555a26a
add software/pairix (#508)
* add software/pairix

* Update the functions.nf to new version.
Remove -p parameter and fix version output command.
fix the duplicated documentation.
2021-05-25 21:22:57 +01:00
aleksandrabliznina
4575e5455c
New last/split module to find split alignments. (#511)
* New last/split module to find split alignments.

The `last-split` tool distributed with [LAST](https://gitlab.com/mcfrith/last)
finds split or spliced alignments in a MAF file that is produced with, for
example, LAST `lastal` command.

This new module is part of the work discribed in Issue #464. During this
development, we fix the versiob of LAST to 1219 to ensure consistency. We will
upgrade it later.

* Update software/last/split/main.nf

* Apply suggestions from code review

Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>
2021-05-24 20:15:57 +01:00
JIANHONG OU
ce68395240
add additional ucsc tools (#506)
* add additional ucsc tools

* Update software/ucsc/wigtobigwig/meta.yml

Co-authored-by: Kevin Menden <kevin.menden@live.com>

* Update the functions.nf and software name for ucsc/wigtobigwig and bigwigaverageoverbed.

Co-authored-by: Kevin Menden <kevin.menden@live.com>
2021-05-20 15:39:33 -04:00
Charles Plessy
e75f88c68a
New module last/mafswap to reorder sequences in alignments (#500)
* New module last/mafswap to reorder sequences in alignments

The `maf-swap` tool distributed with [LAST](https://gitlab.com/mcfrith/last)
reorders sequences in alignment files in Multiple Alignment Format.
When run without command-line arguments, it will swap the target and the
query sequences.  This is useful when turning a many-to-many alignment
into a many-to-one and then a one-to-one alignment in conjunction with
the `last-split` command (split, swap, split and swap again).

The LAST aligner outputs MAF files, but other tools also use this
format.  As MAF files can be very large (up to hundreds of gigabytes),
the module expects its input to be compressed with gzip and will
compress its output.

This new module is part of the work described in Issue #464.  During
this development, we fix the version of LAST to 1219 to ensure
consistency (hence ignore lint's version warning).

* Update MD5 sum.

Actually, 7029066c27ac6f5ef18d660d5741979a is the MD5 sum of
an empty file compressed with `gzip --no-name`…  This happened
because I forgot to update the config file after correcting the
module… sorry !

* Apply suggestions from code review

Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>

* Change name as suggested in pull request.

Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>
2021-05-19 08:59:23 +01:00
aleksandrabliznina
b592cea30b
New last/train module to train alignment parameters. (#492)
* New last/train module to train alignment parameters.

The last-train command creates a parameter file that
will be used by last/lastal module for sequence alignment.
It takes indexed sequences and query sequences as input
and we use the metadata of both to create an id of the
parameter output file.

Submission of the LAST modules is discussed in more
details in the issue #464. For consistancy, we use LAST
version 1219 for this whole development and will upgrade later.

* Corrected files according to the nf-core v1.14 standards.

* Fixed function.nf file for the last-train module.

* Apply suggestions from code review

Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>

* Find index name.

* Correct after the input channels were changed.

* Use double underscore as a name separator.

Single underscores can happen in ids, therefore, we would like to keep two underscores.

* Remove extra spaces.

* Fixed the passing of the "score matrix" line.

* Apply suggestions from code review

Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>

* Update software/last/train/main.nf

Co-authored-by: Harshil Patel <drpatelh@users.noreply.github.com>
2021-05-19 08:37:08 +01:00