Pipeline Execution
Once all the configuration dependencies are met and paramaters are set in the config.yaml
, sc-VirusScan can be initiated as described below:
Activate the conda environment:
conda activate sc-VirusScan
Once conda environment is activated, trigger the pipeline using following command:
snakemake --cores 16 --configfile config.yaml --latency-wait 60 --profile <Slurm_Profile_Name>
–cores: Cores to be specified for the pipeline (Minimum: 16)
–configfile: Path to the config.yaml file
–profile: If slurm profile available, specify the slurm profile name
Pipeline Outputs
On successfully completion of sc-VirusScan, following output file are generated in the respective directories as representated below.
├── results
│ ├── kraken2 #Kraken Classification Reports
│ │ ├── Sample1.kraken
│ │ └── Sample1.report.txt
│ │
│ ├── cellranger #CellRanger scRNAseq Analysis Intermediate Files
│ │ └── Sample1
│ │ ├── filtered_feature_bc_matrix
│ │ ├── raw_feature_bc_matrix
│ │ ├── possorted_genome_bam
│ │ ├── possorted_genome_bam.bai
│ │ ├── filtered_feature_bc_matrix.h5
│ │ └── raw_feature_bc_matrix.h5
│ │
│ ├── count_matrix #Final Count matrix result
│ │ └── Sample1/
│ │ └── count_matrix.tsv
│ │
│ └── kraken_reports #QC Reports and Plots based on Kraken2 Reports
│ ├── Familywise_tax_readcounts.tsv
│ ├── Specieswise_tax_readcounts.tsv
│ ├── Clustermap_Familywise_log10.png
│ └── Clustermap_Specieswise_log10.png
└── logs
The users can use the count_matrix.tsv
file from the results directory along with the Cellranger barcodes in the sample wise directories under cellranger directory for further downstream single cell analysis using Seurat and ScanPy.