Skip to content

NGS pipeline for somatic variant calling from amplicon datasets

Notifications You must be signed in to change notification settings

AWGL/SomaticAmplicon

 
 

Repository files navigation

SomaticAmplicon

NGS pipeline for somatic variant calling from amplicon datasets

Supports Illumina paired-end only and must have resonable overlap (>10bp)

Uses BWA for Alignment, Pisces for Variant Calling and VEP for annotation.

Executables are stored as singularity containers (see envs/)

Run

Directory structure required for pipeline

/data
└── results
     └── seqId
         ├── panel1
         │   ├── sample1
         │   ├── sample2
         │   ├── sample3
         |   └── Gathered_Results
         |       └── Database
         |           └── upload_files
         └── panel2
             ├── sample1
             ├── sample2
             ├── sample3
         |   └── Gathered_Results
         |       └── Database
         |           └── upload_files

Script 1 runs in sample folder, requires fastq files split by lane

sbatch 1_SomaticAmplicon.sh

Assays

Validated for TruSightMyeloid, NGHS-101X (CRM), NGHS-102X (BRCA)

Configuration

The variables files within each panel folder e.g. NGHS-101X should contain variables stating whether or not to execute optional section of the pipeline. For example:

custom_coverage=false
custom_variants=true
generate_worksheets=false
merge_reports=false

Resources

Reference Genome: human_g1k_v37.fasta

SNPs/Indels for Realignment: 1000G_phase1.indels.b37.vcf, Mills_and_1000G_gold_standard.indels.b37.vcf and cosmic_78.indels.b37.vcf

SNPS/Indels for Evaluation: 1000G_omni2.5.b37.vcf, hapmap_3.3.b37.vcf cosmic_78.b37.vcf

VEP Cache: refseq37_v97

Issue with NGHS-102X

If NGHS-102X crashes as per #44 you can now navigate to the folder of the sample which has failed and run the following:

. <sample_id>.variables

echo -e "Sample\tBRCA1_500X\tBRCA2_500X\tBRCA1_100X\tBRCA2_100X" > /data/output/results/"$seqId"/"$panel"/"$seqId"_merged_coverage_report.txt

sbatch /data/diagnostics/pipelines/SomaticAmplicon/SomaticAmplicon-master/brca_rerun.sh

References

[1] https://github.com/Illumina/Pisces

About

NGS pipeline for somatic variant calling from amplicon datasets

Resources

Stars

Watchers

Forks

Packages

No packages published

Languages

  • Shell 72.0%
  • Python 28.0%