Skip to content
Snippets Groups Projects
Commit 4430629a authored by Gervaise Henry's avatar Gervaise Henry :cowboy:
Browse files

Merge branch 'develop' into 'master'

Develop

See merge request !32
parents e1d62b1f cb957223
Branches
2 merge requests!64Master,!32Develop
Pipeline #3349 passed with stages
in 4 minutes and 24 seconds
|*master*|*develop*|
|:-:|:-:|
|[![Build Status](https://git.biohpc.swmed.edu/BICF/Astrocyte/cellranger_count/badges/master/build.svg)](https://git.biohpc.swmed.edu/BICF/Astrocyte/cellranger_count/commits/master)|[![Build Status](https://git.biohpc.swmed.edu/BICF/Astrocyte/cellranger_count/badges/develop/build.svg)](https://git.biohpc.swmed.edu/BICF/Astrocyte/cellranger_count/commits/develop)|
10x Genomics scRNA-Seq (cellranger) count Pipeline 10x Genomics scRNA-Seq (cellranger) count Pipeline
======================================== ==================================================
Introduction Introduction
------------ ------------
...@@ -19,13 +23,14 @@ To Run: ...@@ -19,13 +23,14 @@ To Run:
* **--fastq** * **--fastq**
* path to the fastq location * path to the fastq location
* R1 and R2 only necessary but can include I2 * R1 and R2 only necessary but can include I2
* only fastq's in designFile (see below) are used, not present will be ignored
* eg: **--fastq '/project/shared/bicf_workflow_ref/workflow_testdata/cellranger/cellranger_count/v3s2r100k/\*.fastq.gz'** * eg: **--fastq '/project/shared/bicf_workflow_ref/workflow_testdata/cellranger/cellranger_count/v3s2r100k/\*.fastq.gz'**
* **--designFile** * **--designFile**
* path to design file (csv format) location * path to design file (csv format) location
* column 1 = "Sample" * column 1 = "Sample"
* column 2 = "fastq_R1" * column 2 = "fastq_R1"
* column 3 = "fastq_R2" * column 3 = "fastq_R2"
* can have repeated "Sample" if there are multiole fastq R1/R2 pairs for the samples * can have repeated "Sample" if there are multiple fastq R1/R2 pairs for the samples
* eg: **--designFile '/project/shared/bicf_workflow_ref/workflow_testdata/cellranger/cellranger_count/v3s2r100k/design.csv'** * eg: **--designFile '/project/shared/bicf_workflow_ref/workflow_testdata/cellranger/cellranger_count/v3s2r100k/design.csv'**
* **--genome** * **--genome**
* reference genome * reference genome
...@@ -44,7 +49,7 @@ To Run: ...@@ -44,7 +49,7 @@ To Run:
* eg: **--genome 'GRCh38-3.0.0'** * eg: **--genome 'GRCh38-3.0.0'**
* **--genomeLocationFull** * **--genomeLocationFull**
* path to a custom genome * path to a custom genome
* if --genomeLocationFull is used --genome is not necessary and is overwritten * if --genomeLocationFull is used --genome is not necessary and is ignored
* eg. **--genomeLocationFull '/project/apps_database/cellranger/refdata-cellranger-GRCh38-3.0.0'** * eg. **--genomeLocationFull '/project/apps_database/cellranger/refdata-cellranger-GRCh38-3.0.0'**
* **--expectCells** * **--expectCells**
* expected number of cells to be detected * expected number of cells to be detected
...@@ -57,11 +62,11 @@ To Run: ...@@ -57,11 +62,11 @@ To Run:
* **--forceCells** * **--forceCells**
* forces filtering of the top number of cells matching this parameter * forces filtering of the top number of cells matching this parameter
* 0-10000 * 0-10000
* if --forceCells is used then --expectedCells is not necessary and is overwritten * if --forceCells is used then --expectedCells is not necessary and is ignored
* eg: **--forceCells 10000** * eg: **--forceCells 10000**
* **--kitVersion** * **--kitVersion**
* the library chemistry version number for the 10x Genomics Gene Expression kit * the library chemistry version number for the 10x Genomics Gene Expression kit
* setting to auto will attempt to autodetect from the detected cycle strategy in the fastq's * setting to auto will attempt to autodetect from the detected sequencing strategy in the fastq's
* version numbers are spelled out * version numbers are spelled out
* --kitversion is only used if --version (cellranger version) is > 2 * --kitversion is only used if --version (cellranger version) is > 2
* --version (cellranger version) 2.1.1 can only read --kitVersion of two (2) * --version (cellranger version) 2.1.1 can only read --kitVersion of two (2)
...@@ -69,7 +74,7 @@ To Run: ...@@ -69,7 +74,7 @@ To Run:
* *'auto'* * *'auto'*
* *'three'* * *'three'*
* *'two'* * *'two'*
* eg: **--kitVersion 'three'**' * eg: **--kitVersion 'three'**
* **--version** * **--version**
* cellranger version * cellranger version
* --version (cellranger version) 2.1.1 can only read --kitVersion of two (2) * --version (cellranger version) 2.1.1 can only read --kitVersion of two (2)
...@@ -91,4 +96,4 @@ To Run: ...@@ -91,4 +96,4 @@ To Run:
|---------|------------------------------------|------------------------------------| |---------|------------------------------------|------------------------------------|
| sample1 | pbmc_1k_v2_S1_L001_R1_001.fastq.gz | pbmc_1k_v2_S1_L001_R2_001.fastq.gz | | sample1 | pbmc_1k_v2_S1_L001_R1_001.fastq.gz | pbmc_1k_v2_S1_L001_R2_001.fastq.gz |
| sample2 | pbmc_1k_v2_S2_L001_R1_001.fastq.gz | pbmc_1k_v2_S2_L001_R2_001.fastq.gz | | sample2 | pbmc_1k_v2_S2_L001_R1_001.fastq.gz | pbmc_1k_v2_S2_L001_R2_001.fastq.gz |
| sample2 | pbmc_1k_v2_S2_L002_R1_001.fastq.gz | pbmc_1k_v2_S2_L002_R2_001.fastq.gz | | sample2 | pbmc_1k_v2_S2_L002_R1_001.fastq.gz | pbmc_1k_v2_S2_L002_R2_001.fastq.gz |
\ No newline at end of file
10x Genomics scRNA-Seq (cellranger) count Pipeline 10x Genomics scRNA-Seq (cellranger) count Pipeline
======================================== ==================================================
Introduction Introduction
------------ ------------
...@@ -24,7 +24,7 @@ To Run: ...@@ -24,7 +24,7 @@ To Run:
* column 3 = "fastq_R2" * column 3 = "fastq_R2"
* can have repeated "Sample" if there are multiole fastq R1/R2 pairs for the samples * can have repeated "Sample" if there are multiole fastq R1/R2 pairs for the samples
* eg: can be downloaded [HERE](https://git.biohpc.swmed.edu/BICF/Astrocyte/cellranger_count/blob/8db3e25c13cb1463c2a50e510159c72380ae5826/docs/design.csv) * eg: can be downloaded [HERE](https://git.biohpc.swmed.edu/BICF/Astrocyte/cellranger_count/blob/8db3e25c13cb1463c2a50e510159c72380ae5826/docs/design.csv)
* **genome** * **genome**
* Reference species and genome used for alignment and subsequent analysis. * Reference species and genome used for alignment and subsequent analysis.
* name of available 10x Gemomics premade reference genomes: * name of available 10x Gemomics premade reference genomes:
* *'GRCh38-3.0.0'* = Human GRCh38 release 93 * *'GRCh38-3.0.0'* = Human GRCh38 release 93
...@@ -36,30 +36,39 @@ To Run: ...@@ -36,30 +36,39 @@ To Run:
* *'hg19_and_mm10-3.0.0'* = Human GRCh37 (hg19) + Mouse GRCm38 (mm19) release 93 * *'hg19_and_mm10-3.0.0'* = Human GRCh37 (hg19) + Mouse GRCm38 (mm19) release 93
* *'hg19_and_mm10-1.2.0'* = Human GRCh37 (hg19) + Mouse GRCm38 (mm19) release 84 * *'hg19_and_mm10-1.2.0'* = Human GRCh37 (hg19) + Mouse GRCm38 (mm19) release 84
* *'ercc92-1.2.0'* = ERCC.92 Spike-In * *'ercc92-1.2.0'* = ERCC.92 Spike-In
* **expect cells** * **expect cells**
* Expected number of recovered cells. * Expected number of recovered cells.
* guides cellranger in it's cutoff for background/low quality cells * guides cellranger in it's cutoff for background/low quality cells
* as a guide it doesn't have to be exact * as a guide it doesn't have to be exact
* 0-10000 * 0-10000
* if --expextedCells is used then --forceCells is not necessary * if --expextedCells is used then --forceCells is not necessary
* only used if force cells is not entered or set to 0 * only used if force cells is not entered or set to 0
* **force cells** * **force cells**
* Force pipeline to use this number of cells, bypassing the cell detection algorithm. Use this if the number of cells estimated by Cell Ranger is not consistent with the barcode rank plot. A value of 0 ignores this option. Any value other than 0 overrides expect-cells. * Force pipeline to use this number of cells, bypassing the cell detection algorithm. Use this if the number of cells estimated by Cell Ranger is not consistent with the barcode rank plot. A value of 0 ignores this option. Any value other than 0 overrides expect-cells.
* 0-10000 * 0-10000
* if force cells is used then expected cells is not necessary and is ignored * if force cells is used then expected cells is not necessary and is ignored
* **chemistry version** * **chemistry version**
* 10x single cell gene expression chemistry version (only used in cellranger version 3.x). * 10x single cell gene expression chemistry version (only used in cellranger version 3.x).
* setting to auto will attempt to autodetect from the detected cycle strategy in the fastq's * setting to auto will attempt to autodetect from the detected cycle strategy in the fastq's
* chemistry version is only used if cellranger version is > 2.x * chemistry version is only used if cellranger version is > 2.x
* cellranger version 2.1.1 can only read chemistry version less than or equal to two (2) * cellranger version 2.1.1 can only read chemistry version less than or equal to two (2)
* **cellranger version** * **cellranger version**
* 10x cellranger version. * 10x cellranger version.
* cellranger version 2.1.1 can only read chemistry version less than or equal to two (2) * cellranger version 2.1.1 can only read chemistry version less than or equal to two (2)
* Design example: * Design example:
| Sample | fastq_R1 | fastq_R2 | | Sample | fastq_R1 | fastq_R2 |
|---------|------------------------------------|------------------------------------| |---------|------------------------------------|------------------------------------|
| sample1 | pbmc_1k_v2_S1_L001_R1_001.fastq.gz | pbmc_1k_v2_S1_L001_R2_001.fastq.gz | | sample1 | pbmc_1k_v2_S1_L001_R1_001.fastq.gz | pbmc_1k_v2_S1_L001_R2_001.fastq.gz |
| sample2 | pbmc_1k_v2_S2_L001_R1_001.fastq.gz | pbmc_1k_v2_S2_L001_R2_001.fastq.gz | | sample2 | pbmc_1k_v2_S2_L001_R1_001.fastq.gz | pbmc_1k_v2_S2_L001_R2_001.fastq.gz |
| sample2 | pbmc_1k_v2_S2_L002_R1_001.fastq.gz | pbmc_1k_v2_S2_L002_R2_001.fastq.gz | | sample2 | pbmc_1k_v2_S2_L002_R1_001.fastq.gz | pbmc_1k_v2_S2_L002_R2_001.fastq.gz |
Credits
-------
This worklow is was developed jointly with the [Bioinformatic Core Facility (BICF), Department of Bioinformatics](http://www.utsouthwestern.edu/labs/bioinformatics/)
Please cite in publications: Pipeline was developed by BICF from funding provided by **Cancer Prevention and Research Institute of Texas (RP150596)**.
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment