# Remember - The workflow file is always named 'workflow/main.f'
# The workflow must publish all final output into $baseDir
# A list of clueter environment modules that this workflow requires to run.
# Specify versioned module names to ensure reproducability.
workflow_modules:
-'trimgalore/0.4.1'
-'cutadapt/1.9.1'
-'hisat2/2.0.1-beta-intel'
-'samtools/intel/1.3'
-'picard/1.127'
-'subread/1.5.0-intel'
-'stringtie/1.1.2-intel'
-'speedseq/20160506'
# A list of parameters used by the workflow, defining how to present them,
# options etc in the web interface. For each parameter:
#
# REQUIRED INFORMATION
# id: The name of the parameter in the NEXTFLOW workflow
# type: The type of the parameter, one of:
# string - A free-format string
# integer - An integer
# real - A real number
# file - A single file from user data
# files - One or more files from user data
# select - A selection from a list of values
# required: true/false, must the parameter be entered/chosen?
# description: A user friendly description of the meaning of the parameter
#
# OPTIONAL INFORMATION
# default: A default value for the parameter (optional)
# min: Minium value/characters/files for number/string/files types
# max: Maxumum value/characters/files for number/string/files types
# regex: A regular expression that describes valid entries / filenames
#
# SELECT TYPE
# choices: A set of choices presented to the user for the parameter.
# Each choice is a pair of value and description, e.g.
#
# choices:
# - [ 'myval', 'The first option']
# - [ 'myval', 'The second option']
#
# NOTE - All parameters are passed to NEXTFLOW as strings... but they
# are validated by astrocyte using the information provided above
workflow_parameters:
-id:fastqs
type:files
required:true
description:|
One or more input paired-end FASTQ files from a RNASeq experiment and a design file with the link between the same name and the sample group
regex:".*(fastq|fq)*"
min:1
-id:stranded
type:select
required:true
choices:
-['0','Unstranded']
-['1','Stranded']
-['2','ReverseStranded']
description:|
In the case that the sequence libraries where generated using a stranded specific protocol.
-id:pairs
type:select
required:true
choices:
-['pe','PairedEnd']
-['se','SingleEnd']
description:|
In single-end sequencing, the sequencer reads a fragment from only one end to the other, generating the sequence of base pairs. In paired-end reading it starts at one read, finishes this direction at the specified read length, and then starts another round of reading from the opposite end of the fragment.
-id:align
type:select
required:true
choices:
-['hisat','HiSAT2']
-['star','STAR']
description:|
Alignment tool
-id:markdups
type:select
required:true
choices:
-['mark','RemoveDuplicates']
-['keep','KeepAllSequences']
description:|
Duplicate reads are defined as originating from the same original fragment of DNA. Duplicates are identified as read pairs having identical 5-prime positions (coordinate and strand) for both reads in a mate pair and optionally, matching unique molecular identifier reads.
-id:design
type:file
required:true
regex:".*txt"
description:|
A design file listing pairs of sample name and sample group.
Columns must include: SampleID,SampleName,SampleGroup,FullPathToFqR1,FullPathToFqR2