process_getData
Purpose:
- Download input data from the consortium
Input:
- bdbag.zip (see example at /archive/BICF/shared/GUDMAP.RBK/RNA/data/Study_Q-Y4H0.zip)
Outputs for later processes:
- file.csv
- Experiment Settings.csv
- Experiment.csv
- *.fastq.gz
Process:
- Remove all lines in fetch.txt (within bdbag.zip) that the filename doesn't end in .fastq.gz
- Run BDBAG to fetch all files
Tools:
- Python v3.7 or lower depending on compatibility
- Pandas v0.25.1 (if python will be used for fetch.txt filtering)
- BDBAG v1.5.5