Parameters
Input/output options
Define where the pipeline should find input data and save output data.
| Parameter | Description | Default |
|---|---|---|
dataset_name | The name of this dataset, used in output files and within visualisations | |
input | Path to comma-separated file containing information about the samples in the experiment. TODO: finish samplesheet | |
outdir | The output directory where the results will be saved. You have to use absolute paths to storage on Cloud infrastructure. | results |
Pipeline parameters
| Parameter | Description | Default |
|---|---|---|
mod_method | The method used to call the RNA modification (accepted: dorado|m6anet) | dorado |
diff_method | The method used to call differentially expressed sites (accepted: lmer|modkit) | lmer |
genome | The reference genome, in .fasta format | |
transcriptome | The reference transcriptome, in .fasta format | |
min_reads | Minimum reads needed at a site (across all samples) to be called | 20 |
prob_threshold | Minimum probability detected at a site to be called | 0 |
Samplesheet
The samplesheet is a CSV file which contains information about the samples to be analysed in the pipeline. It should have the following columns. A header is required. Optional columns should be left empty.
name: a unique name for each samplegroup: the experimental group or condition for each samplepath_dorado: (optional) path to pre-basecalled Dorado modification data for each samplepath_m6anet: (optional) path to pre-basecalled m6Anet modification data for each samplepath_pod5: (optional) path to raw POD5
If you call modifications using Dorado, you must provide either path_dorado or path_pod5 for each sample. If you call modifications using m6Anet, you must provide either path_m6anet or path_pod5 for each sample.
Two groups should be provided to call differential modifications between conditions. Group names should be alphanumeric and without spaces. The underlying model take the first group alphabetically as the reference level, and the second group alphabetically as the treatment level.
An example samplesheet is shown below:
name,group,path_dorado,path_m6anet,path_pod5
sample1,group1,,,pod5_path
| name | group | path_dorado | path_m6anet | path_pod5 |
|---|---|---|---|---|
| sample1 | group1 | pod5_path |