Automatic NGS Pipeline Monitoring and Quality Control

Introduction

A software system to monitor the status of in house pipeline at AWGS and view and approve quality control data from NGS pipelines.

Project Workflow

The software consists of several modules and configuration files:

Pipelines (pipelines/). This module consists of different classes which each represent a different pipeline run at AWMGL. These classes have methods for detecting whether a pipeline has completed ina valid manner and for collecting relevant QC metrics. The module also contains parsers.py which contains functions for parsing common QC files used in NGS. These classes can be configured for example changing the expected files each pipeline produces using the confi/config.yaml files.
config/*.yaml. Specific pipeline configuration. Each pipeline gets a key in the config. The key is a concatanation of the pipeline name, pipeline version and panel. For example GermlineEnrichment-2.5.3-IlluminaTruSightCancer. This way multiple pipelines can use the same class in the pipelines module.
update_database.py. Script for updating the database.

Install

Works on Linux/Mac OS

The software is a Django application using Python 3. It is recommended that the software be deployed in a conda virtual environment.

First install Conda/Miniconda from [1]. Then type the following commands in your terminal to install and setup the application.

git clone https://github.com/AWGL/auto_qc.git

cd auto_qc

conda env create -f env/main.yaml

source activate auto_qc

python manage.py migrate

# For Auto QC database 
python manage.py makemigrations qc_database

#For SampleSheet Generator
python manage.py makemigrations sample_sheet

python manage.py migrate

Configure

There are several files which need to be configured to get the application to run:

mysite/settings.py - Update the variable CONFIG_PATH to point to the config yaml below.
mysite/settings.py - Uncomment the first DATABASES variable surronded by """ """, and comment out the second DATABASES variable.
config/config.yaml - Pipeline specific variables - see example for how to set this up.

Update

To update the database the following script will need to be run:

raw_data_dir = raw data from sequencer e.g. bcl, Interop, SampleSheet.csv etc

fastq_data_dir = directory with fastqs

results_dir = directory with pipeline output

config = YAML config file specifiying pipeline specific variables

python manage.py update_database --raw_data_dir /media/joseph/Storage/data/archive/nextseq \
								--config config/config_local.yaml

It is recommended you set up a cronjob to automate the update of the database.

Test

python manage.py test

Run Webapp

python manage.py runserver

Login Locally

python manage.py createsuperuser

Adding fixtures to the SampleSheet Generator

python manage.py loaddata sample_sheet/fixtures/referraltype.json 
python manage.py loaddata sample_sheet/fixtures/assay.json

# Dump the assay content added in the django /admin app into the json file
python manage.py dumpdata sample_sheet.assay > sample_sheet/fixtures/assay.json 
python manage.py dumpdata sample_sheet.referraltype > sample_sheet/fixtures/referraltype.json

Schema

Database schema are available for:

the AutoQC database
the Samplesheet Generator database

API

A REST API is provided to query runs and samples e.g.

http GET http://127.0.0.1:8000/api/sample-analyses/pipelines/DragenWGS-master/runs/200327_A00748_0019_AHL3JHDRXX/ 'Accept: application/json' 'Authorization: $your_key'

Create an API key in the admin panel

References

[1] https://conda.io/miniconda.html

Name		Name	Last commit message	Last commit date
Latest commit History 730 Commits
.github/PULL_REQUEST_TEMPLATE		.github/PULL_REQUEST_TEMPLATE
auto_qc_queries		auto_qc_queries
config		config
deploy		deploy
env		env
mysite		mysite
pipelines		pipelines
qc_database		qc_database
sample_sheet		sample_sheet
schema		schema
static		static
test_data		test_data
.gitignore		.gitignore
changelog.md		changelog.md
manage.py		manage.py
readme.md		readme.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Automatic NGS Pipeline Monitoring and Quality Control

Introduction

Project Workflow

Install

Configure

Update

Test

Run Webapp

Login Locally

Adding fixtures to the SampleSheet Generator

Schema

API

References

About

Releases 10

Packages

Contributors 14

Languages

AWGL/auto_qc

Folders and files

Latest commit

History

Repository files navigation

Automatic NGS Pipeline Monitoring and Quality Control

Introduction

Project Workflow

Install

Configure

Update

Test

Run Webapp

Login Locally

Adding fixtures to the SampleSheet Generator

Schema

API

References

About

Resources

Stars

Watchers

Forks

Releases 10

Packages 0

Contributors 14

Languages

Packages