1
0
Fork 0
mirror of https://github.com/MillironX/nf-configs.git synced 2024-11-22 00:26:03 +00:00
nf-configs/docs/nihbiowulf.md
2022-03-21 19:23:28 -05:00

2 KiB

nf-core/configs: nihbiowulf Configuration

nf-core pipelines have been configured for use on the Biowulf cluster at the NIH.

To use, run the pipeline with -profile nihbiowulf. This will download and launch the profile.config which has been pre-configured with a setup suitable for the Biowulf cluster at NIH. Using this profile, a docker image containing all of the required software will be downloaded, and converted to a Singularity image before execution of the pipeline.

Below are non-mandatory information e.g. on modules to load etc

Before running the pipeline you will need to load Nextflow and Singularity using the environment module system on Biowulf. You can do this by issuing the commands below:

## Load Nextflow and Singularity environment modules
module purge
module load nextflow
module load singularity

Below are non-mandatory information on iGenomes specific configuration

A partial local copy of the iGenomes resource is available on Biowulf. This is a copy of the Illumina iGenomes resource from several years ago and is not up-to-date with the s3-hosted nf-core iGenomes. There are some structural differences of note. In particular, if using BWA, the igenomes.conf should be modified to specify the BWA version folder, otherwise the BWA module will fail to find an appropriate index. To date, this is the only issue, however functionality has not been extensively tested with iGenomes on Biowulf. Nonetheless, you should, in theory, be able to run the pipeline against any reference available in the igenomes.config specific to the nf-core pipeline.

You can do this by simply using the --genome <GENOME_ID> parameter.

NB: You will need an account to use the HPC cluster on Biowulf in order to run the pipeline. If in doubt contact CIT. NB: Nextflow will need to submit the jobs via the job scheduler to the HPC cluster. The master process submitting jobs should be run either as a batch job or on an interactive node - not on the biowulf login node. If in doubt contact Biowulf staff.