Skip to content
Snippets Groups Projects
CHANGELOG.md 2.98 KiB
Newer Older
# v0.1.0 (in development)
Gervaise Henry's avatar
Gervaise Henry committed
* Add option to pull references from datahub
* Add option to send email on workflow error, with pipeline error message
* Add versions and paper references of software used to report
Gervaise Henry's avatar
Gervaise Henry committed
* Upload input bag
Gervaise Henry's avatar
Gervaise Henry committed
* Upload execution run
* Upload mRNA QC
Gervaise Henry's avatar
Gervaise Henry committed
* Create and upload output bag
Gervaise Henry's avatar
Gervaise Henry committed
* Add optional to not upload
Gervaise Henry's avatar
Gervaise Henry committed
* Update references to use bags
* Update to newer references (GRCh38.p13.v36 and GRCm38.p6.vM25)
* Use production server for data-hub reference call
* Stop pipeline if submitted does not match infered
* Update execution run with "Success" or "Error"
Gervaise Henry's avatar
Gervaise Henry committed
* Remove (comment out) option to pull references from S3
* Make pull references from BioHPC default (including in biohpc.config)
Gervaise Henry's avatar
Gervaise Henry committed
* Start using new gudmaprbk dockerhub (images autobuilt)
Venkat Malladi's avatar
Venkat Malladi committed
* Moved consistency checks to be fully python
* Changed order of steps so that fastqc is done after the trim step
Gervaise Henry's avatar
Gervaise Henry committed
* Change docker images to production
Gervaise Henry's avatar
Gervaise Henry committed
* Add automated version badges
Gervaise Henry's avatar
Gervaise Henry committed
* Only calculate/report tin values on regular chromosomes (from gtf)
Gervaise Henry's avatar
Gervaise Henry committed
* Datahub reference pull uses dev.gudmap.org as source until referencencs are placed on production
* Override params (inputBag, fastq, species) aren't checked for integrity
Gervaise Henry's avatar
Gervaise Henry committed
* 
**User Facing**
* TPM table:
  * Add Ensembl Gene ID
  * Rename columns: *GENCODE_Gene_Symbol*, *Ensembl_GeneID*, *NCBI_GeneID*
Gervaise Henry's avatar
Gervaise Henry committed
* MultiQC output custom tables (html+JSON):
  * Run table: *Session ID* and *Pipeline Version*
Gervaise Henry's avatar
Gervaise Henry committed
  * Reference Table: *Species*, *Genome Reference Consortium Build*, *Genome Reference Consortium Patch*, *GENCODE Annotation Release* (outputs both human and mouse versions)
Gervaise Henry's avatar
Gervaise Henry committed
* Add inputBag override param (`inputBagForce`) [`*.zip`]
Gervaise Henry's avatar
Gervaise Henry committed
  * Uses provided inputBag instead of downloading from data-hub
  * Still requires matching repRID input param
* Add fastq override param (`fastqsForce`) [`R1`,`R2`]
  * Uses provided fastq instead of downloading from data-hub
  * Still requires matching repRID input param and will pull inputBag from data-hub to access submitted metadata for reporting
Gervaise Henry's avatar
Gervaise Henry committed
* Add species override param (`speciesForce`) [`Mus musculus` or `Homo sapiens`]
  * forces the use of the provided species
Gervaise Henry's avatar
Gervaise Henry committed
  * ignores inferred ambiguous species

**Background**
* Add GeneSymbol/EnsemblID/EntrezID translation files to references

*Known Bugs*
* outputBag does not contain fetch for processed data
* Does not include automatic data upload
Gervaise Henry's avatar
Gervaise Henry committed
* Override params (inputBag, fastq, species) aren't checked for integrity
**User Facing**
Gervaise Henry's avatar
Gervaise Henry committed
* Output:
  * inputBag
  * outputBag
Gervaise Henry's avatar
Gervaise Henry committed
* Remove gene details from tpm table
* Add EntrezID translation to tpm table (from version specific reference)
Gervaise Henry's avatar
Gervaise Henry committed
* Add GeneSymbol/EnsemblID/EntrezID translation files to references
Gervaise Henry's avatar
Gervaise Henry committed
* outputBag does not contain fetch for processed data
* Does not include automatic data upload
Gervaise Henry's avatar
Gervaise Henry committed

<hr>

# v0.0.1
**INITIAL BETA VERSION**\
Does not include automatic data upload\
This version is for initial upload of test data to GUDMAP/RBK data-hub for internal integration