Pipeline Execution

Once all the configuration dependencies are met and paramaters are set in the config.yaml, sc-VirusScan can be initiated as described below:

Activate the conda environment: conda activate sc-VirusScan
Once conda environment is activated, trigger the pipeline using following command:

snakemake --cores 16 --configfile config.yaml --latency-wait 60 --profile <Slurm_Profile_Name>

–cores: Cores to be specified for the pipeline (Minimum: 16)

–configfile: Path to the config.yaml file

–profile: If slurm profile available, specify the slurm profile name

Pipeline Outputs

On successfully completion of sc-VirusScan, following output file are generated in the respective directories as representated below.

├── results
│    ├── kraken2                                   #Kraken Classification Reports
│    │    ├── Sample1.kraken
│    │    └── Sample1.report.txt
│    │
│    ├── cellranger                                #CellRanger scRNAseq Analysis Intermediate Files
│    │  └── Sample1
│    │     ├── filtered_feature_bc_matrix
│    │     ├── raw_feature_bc_matrix
│    │     ├── possorted_genome_bam
│    │     ├── possorted_genome_bam.bai
│    │     ├── filtered_feature_bc_matrix.h5
│    │     └── raw_feature_bc_matrix.h5
│    │
│    ├── count_matrix                               #Final Count matrix result
│    │  └── Sample1/
│    │     └── count_matrix.tsv
│    │
│    └── kraken_reports                             #QC Reports and Plots based on Kraken2 Reports
│       ├── Familywise_tax_readcounts.tsv
│       ├── Specieswise_tax_readcounts.tsv
│       ├── Clustermap_Familywise_log10.png
│       └── Clustermap_Specieswise_log10.png
└── logs

The users can use the count_matrix.tsv file from the results directory along with the Cellranger barcodes in the sample wise directories under cellranger directory for further downstream single cell analysis using Seurat and ScanPy.