Skip to content

Commit 6de58a6

Browse files
authored
Merge pull request #79 from fulaibaowang/ae_skip-instead-of-run-and-documentation
Ae skip instead of run and documentation
2 parents 38835fb + 08ccd04 commit 6de58a6

File tree

14 files changed

+55
-41
lines changed

14 files changed

+55
-41
lines changed

README.md

Lines changed: 0 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -62,9 +62,6 @@ nextflow run nf-core/drop \
6262
--outdir <OUTDIR> \
6363
--genome hg19 \
6464
--gene_annotation <path/to/gene/annotation/yaml> \
65-
--ae_run true \
66-
--as_run true \
67-
--mae_run true \
6865
--ucsc_fasta <path/to/fasta>
6966
```
7067

assets/schema_input.json

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -35,7 +35,7 @@
3535
"pattern": "^\\S+$",
3636
"errorMessage": "DNA ID must be provided and cannot contain spaces",
3737
"meta": ["dna_id"],
38-
"description": "Unique identifier for the DNA sample. Must not contain spaces."
38+
"description": "Identifier for the DNA sample. Must not contain spaces."
3939
},
4040
"DNA_VCF_FILE": {
4141
"type": "string",
@@ -96,8 +96,8 @@
9696
"type": "string",
9797
"format": "file-path",
9898
"exists": true,
99-
"pattern": "^\\S+\\.tsv\\.gz$",
100-
"errorMessage": "The gene counts file has to exist, cannot contain spaces and must have extension '.tsv.gz'",
99+
"pattern": "^\\S+\\.tsv(\\.gz)?$",
100+
"errorMessage": "The gene counts file has to exist, cannot contain spaces and must have extension '.tsv' or '.tsv.gz'",
101101
"description": "Path to the gene counts file. Must exist and cannot contain spaces."
102102
},
103103
"GENE_ANNOTATION": {

conf/test.config

Lines changed: 0 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -38,7 +38,6 @@ params {
3838
ncbi_fai = "https://github.com/gagneurlab/drop_demo_data/raw/refs/heads/main/Data/chr21_ncbi.fa.fai"
3939

4040
// Aberrant expression
41-
ae_run = true
4241
ae_groups = 'outrider,outrider_external'
4342
ae_genes_to_test = 'https://github.com/gagneurlab/drop_demo_data/raw/refs/heads/main/Data/genes_to_test.yaml'
4443
ae_fpkm_cutoff = 1
@@ -49,7 +48,6 @@ params {
4948
ae_yield_size = 2000000
5049

5150
// Aberrant splicing
52-
as_run = true
5351
as_groups = 'fraser,fraser_external'
5452
as_recount = true
5553
as_long_read = false
@@ -67,7 +65,6 @@ params {
6765
as_delta_psi_cutoff = 0.05
6866

6967
// MAE
70-
mae_run = true
7168
mae_groups = 'mae'
7269
mae_gatk_header_check = false
7370
mae_padj_cutoff = 0.5

docs/output.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -79,7 +79,7 @@ Additionally the `mae` subworkflow creates the following files:
7979
<summary>Output files</summary>
8080

8181
- `multiqc/`
82-
- `multiqc_report.html`: a standalone HTML file that can be viewed in your web browser.
82+
- `multiqc_report.html`: a merged report for all subworkflows.
8383
- `multiqc_data/`: directory containing parsed statistics from the different tools used in the pipeline.
8484
- `multiqc_plots/`: directory containing static images from the report in various formats.
8585

docs/usage.md

Lines changed: 22 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -6,7 +6,7 @@
66
77
## Introduction
88

9-
nf-core/drop allows controlling which subworkflows to run via parameters (`--ae_run` (Aberrant Expression), `--as_run` (Aberrant Splicing), `--mae_run` (Mono-Allelic Expression)). By default, each subworkflow is set to false. We describe different global and module-specific parameters in the [parameter documentation](https://nf-co.re/drop/parameters).
9+
nf-core/drop allows controlling which subworkflows to run. By default, all subworkflows will run. You can skip subsworkflow via parameters (`--ae_skip` (Aberrant Expression), `--as_skip` (Aberrant Splicing), `--mae_skip` (Mono-Allelic Expression)). We describe different global and module-specific parameters in the [parameter documentation](https://nf-co.re/drop/parameters).
1010

1111
## Samplesheet input
1212

@@ -100,12 +100,32 @@ the `DROP_GROUP` BLOOD_AS for the aberrant expression module (containing S10R, E
100100

101101
An [example samplesheet](../assets/samplesheet.tsv) has been provided with the pipeline.
102102

103+
| Column | Description |
104+
| ------------------- | ---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- |
105+
| `RNA_ID` | Unique identifier for the RNA sample. |
106+
| `RNA_BAM_FILE` | Path to the RNA BAM file. |
107+
| `RNA_BAI_FILE` | Path to the RNA BAM index file. It will be automatically generated from the BAM files if not given. |
108+
| `DNA_ID` | Identifier for the DNA sample. It shall be the sample header in the VCF file. |
109+
| `DNA_VCF_FILE` | Path to the DNA VCF file. |
110+
| `DNA_TBI_FILE` | Path to the DNA VCF index file. It will be automatically generated from the VCF files if not given. |
111+
| `DROP_GROUP` | See [above](#drop_group). |
112+
| `PAIRED_END` | Indicates if the input is paired-end or single-end. Default: `true` (paired-end). Refer to the documentation of [HTSeq](https://htseq.readthedocs.io/en/latest/). |
113+
| `COUNT_MODE` | Count mode. Default: `IntersectionStrict`. Options: `union`, `IntersectionStrict`, `IntersectionNotEmpty`. Refer to the documentation of [HTSeq](https://htseq.readthedocs.io/en/latest/). |
114+
| `COUNT_OVERLAPS` | Indicates if overlaps should be counted. Default: `true`. Refer to the documentation of [HTSeq](https://htseq.readthedocs.io/en/latest/). |
115+
| `STRAND` | Samples within each `DROP_GROUP` should either be stranded (`yes`, `reverse`, or a combination of `yes` and `reverse`) or unstranded (only `no`), and this can vary between different `DROP_GROUP`s. |
116+
| `HPO_TERMS` | Comma-separated list of HPO terms associated with the sample. |
117+
| `GENE_COUNTS_FILE` | Path to the gene counts file (`.tsv` or `.tsv.gz`). See details also [above](#aberrant-expression). |
118+
| `GENE_ANNOTATION` | Gene annotation in YAML format, e.g. [assets/gene_annotation.yaml](../assets/gene_annotation.yaml). |
119+
| `GENOME` | See details [above](#mae). Default: `ucsc`. Options: `ncbi`, `ucsc`. |
120+
| `SPLICE_COUNTS_DIR` | Path to the splice counts directory. See details [above](#aberrant-splicing). |
121+
| `SEX` | Sex of the sample. Samples of `m`, `male`, `f`, `femal` are analysed for sex bias aberrant expression report. |
122+
103123
## Running the pipeline
104124

105125
The typical command for running the pipeline is as follows:
106126

107127
```bash
108-
nextflow run nf-core/drop --input ./samplesheet.tsv --outdir ./results --genome hg19 --gene_annotation ./gene_annotation.yaml -profile docker --ae_run true
128+
nextflow run nf-core/drop --input ./samplesheet.tsv --outdir ./results --genome hg19 --gene_annotation ./gene_annotation.yaml -profile docker
109129
```
110130

111131
This will launch the pipeline with the `docker` configuration profile. See below for more information about profiles.

main.nf

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -141,18 +141,18 @@ workflow {
141141
ec_exclude_groups,
142142

143143
// Aberrant expression parameters
144-
params.ae_run,
144+
params.ae_skip,
145145
ae_groups,
146146
ae_genes_to_test,
147147

148148
// Aberrant splicing parameters
149-
params.as_run,
149+
params.as_skip,
150150
as_groups,
151151
params.as_fraser_version,
152152
as_genes_to_test,
153153

154154
// Mono Allelic Expression parameters
155-
params.mae_run,
155+
params.mae_skip,
156156
mae_groups,
157157
mae_qc_groups
158158
)

nextflow.config

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -29,7 +29,7 @@ params {
2929
ec_exclude_groups = null
3030

3131
// Aberrant expression
32-
ae_run = false
32+
ae_skip = false
3333
ae_groups = ""
3434
ae_min_ids = 1
3535
ae_fpkm_cutoff = 1
@@ -41,7 +41,7 @@ params {
4141
ae_genes_to_test = null
4242

4343
// Aberrant splicing
44-
as_run = false
44+
as_skip = false
4545
as_groups = ""
4646
as_min_ids = 1
4747
as_recount = false
@@ -60,7 +60,7 @@ params {
6060
as_genes_to_test = null
6161

6262
// Mono-allelic expression
63-
mae_run = false
63+
mae_skip = false
6464
mae_groups = null
6565
mae_gatk_header_check = false
6666
mae_padj_cutoff = 0.05

nextflow_schema.json

Lines changed: 6 additions & 6 deletions
Original file line numberDiff line numberDiff line change
@@ -89,9 +89,9 @@
8989
"fa_icon": "fas fa-file-export",
9090
"description": "Aberrant expression options.",
9191
"properties": {
92-
"ae_run": {
92+
"ae_skip": {
9393
"type": "boolean",
94-
"description": "Run aberrant expression analysis.",
94+
"description": "Skip aberrant expression analysis.",
9595
"fa_icon": "fas fa-file-export"
9696
},
9797
"ae_groups": {
@@ -163,9 +163,9 @@
163163
"fa_icon": "fas fa-file-export",
164164
"description": "Aberrant splicing options.",
165165
"properties": {
166-
"as_run": {
166+
"as_skip": {
167167
"type": "boolean",
168-
"description": "Run aberrant splicing analysis.",
168+
"description": "Skip aberrant splicing analysis.",
169169
"fa_icon": "fas fa-file-export"
170170
},
171171
"as_groups": {
@@ -282,9 +282,9 @@
282282
"fa_icon": "fas fa-file-export",
283283
"description": "Mono-allelic expression options.",
284284
"properties": {
285-
"mae_run": {
285+
"mae_skip": {
286286
"type": "boolean",
287-
"description": "Run mono-allelic expression analysis.",
287+
"description": "Skip mono-allelic expression analysis.",
288288
"fa_icon": "fas fa-file-export"
289289
},
290290
"mae_groups": {

ro-crate-metadata.json

Lines changed: 1 addition & 1 deletion
Large diffs are not rendered by default.

subworkflows/local/utils_nfcore_drop_pipeline/main.nf

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -199,7 +199,7 @@ workflow PIPELINE_COMPLETION {
199199
def validateInputParameters() {
200200
genomeExistsError()
201201

202-
if (params.ae_run && params.gene_annotation == null) {
202+
if ((!params.ae_skip) && params.gene_annotation == null) {
203203
error("Please provide a gene annotation file using the --gene_annotation parameter when running the aberrant expression analysis.")
204204
}
205205
}

0 commit comments

Comments
 (0)