Overview and Envisioned Workflow

This project focuses on developing a pipeline to classify respiratory sounds into diagnostic categories using the ICBHI 2017 dataset. The pipeline involves data preprocessing, feature extraction, model training, and evaluation. It supports binary and multi-class classification tasks. Please check out the instructions (PDF) or deployed web-app (Hugging Face) for more descriptive info.

Since there was only a short amount of time, I tried to mimic how I would approach such a classification problem. Therefore, I created a workflow that tests different options, from data processing to model architecture, to determine the best-performing model.
You can also checkout (gitHub actions) to view a limited implementation of CI/CD pipeline.

Environment Setup Instructions

Create a Virtual Environment

conda create -n myenv
conda activate myenv

Install Dependencies

pip install -r requirements.txt

Data Preparation

Download the ICBHI 2017 dataset and put it inside ```data`` folder in the main directory. If you are using Kaggle, you can add this dataset into your space easily.

|-data/
|-----demographic_info.txt
|-----Respiratory_Sound_Database/
|---------patient_diagnosis.csv
|---------filename_format.txt
|---------audio_and_txt_files/
|-------------*.wav
|-------------*.txt

Based on my data exploration, the dataset is highly imbalanced. Some categories (e.g., Asthma) have a handful of samples. Therefore, I aimed to do binary (Normal vs Abnormal) as well as multiclass (broader conditions like Normal, Chronic Respiratory Diseases, and Respiratory Infections).

Running dataset exploration code: python Exploration/inference.py --diagnosis_file ./data//Respiratory_Sound_Database//patient_diagnosis.csv --audio_path ./data/Respiratory_Sound_Database/testsample

Training model

As I earlier said, I tried to explore different options in model/system design of a such a project. Therefore, Train.py script in the main directory contain different options that you can choose from. For example, you can do binary or multiclass classifications using 3 different types of input: 1. MFCC, 2. Log-Mel Spectrum, 3. MFCC with augmented features.

Note: you can skip this step since trained models are already in ./models/ directory.

Model Training Workflow

The training process involves:

Data preprocessing (filtering, sampling, and feature extraction).
Splitting the dataset into training, validation, and test subsets.
Oversampling using SMOTE to balance class distribution.
Hyperparameter optimization using Optuna.
Experiment tracking with MLflow.
Model saving and evaluation.

Using the command line:

python Train.py --metadata_path data/Respiratory_Sound_Database/audio_and_txt_files --audio_files_path data/Respiratory_Sound_Database/audio_and_txt_filesv --demographic_path data/demographic_info.tx --diagnosis_path --diagnosis_path data/Respiratory_Sound_Database/patient_diagnosis.csv --classification_modes binary --feature_types mfcc

Or for example, if you want to run all the combinations of configurations:

python Train.py --metadata_path data/Respiratory_Sound_Database/audio_and_txt_files --audio_files_path data/Respiratory_Sound_Database/audio_and_txt_filesv --demographic_path data/demographic_info.tx --diagnosis_path --diagnosis_path data/Respiratory_Sound_Database/patient_diagnosis.csv --classification_modes binary multi --feature_types mfcc log_mel augmented

Alternatively, you can run in debug mode, which uses randomly generated data to check the functionality of different parts of code.

python Train.py --debug

Note: ./LegacyTraining/train.py incorporates everything in a single file and it is suitable to be used on Kaggle/Google Notebooks.

Outputs and Logging

Models are saved in .h5 format inside ./models/
Unseen testing samples (out-of-bag testing) are also stored in .npy files inside ./processed_datasets/. These files already preprocessed and you only need to feed it to model for testing.
MLflow logs are available at ./mlruns/

Testing and Inference

Test samples via .npy files:

python TestModels.py

this script automatically matches .npy files with .h5 models and makes the final evaluation of model.

I selected audio files inside data\Respiratory_Sound_Database\testsample directory. The patient_diagnosis.csv file inside that folder shows ground truth labels. a) you can run python Model_Inference.py:
- for example:

Model_Inference.py ./data/Respiratory_Sound_Database/testsample/157_1b1_Al_sc_Meditron.wav

This file will iterate over all models and will give you general overview of different model performance.

Use the Hugging Face app to upload your audio file and see results!

Run UI locally + metrics collection

To run the UI locally: streamlit run app.py

now if you want to access to different logs via Prometheus: prometheus --config.file=prometheus.yml

After running successfully, you can start your Grafana UI available at http://localhost:3000
From the sidebar, go to Data SourcesH (it should be under Connections)
Here you can add Prometheus as a data source and add panels with queries to visualize metrics from Prometheus.

Directory and files

Main Directory
│   .gitignore   #ignores some directories while pushing on git
│   app.py    #main streamlit UI for deployment
│   Model_Inference.py   #testing model
│   prometheus.yml    #settings to run prometheus
│   README.md        #readme
│   requirements.txt   # reqs to run codes
│   TestModels.py   #tesing models
│   Train.py   #runnig workflow and training models
│
├───.github
│   └───workflows
│           cicd.yaml   # for github actions
├───data
│   │   demographic_info.txt
│   ├───Respiratory_Sound_Database
│   │   │   filename_differences.txt
│   │   │   filename_format.txt
│   │   │   patient_diagnosis.csv
│   │   │
│   │   └───testsample                #sample audio files
│   │           215_1b3_Tc_sc_Meditron.txt
│   │           215_1b3_Tc_sc_Meditron.wav
│   │           patient_diagnosis.csv
│   │           ....     
├───Exploration
│   │   inference.py    # data exploration
│   │
│   ├───LegacyTraining   #codes for training in single file --> for Kaggle 
│   │       train.py
│   │
│───│───models   #final trained models
│   │       final_model_binary_augmented.h5
│   │       final_model_binary_log_mel.h5
│   │       ....
│   │
│───│───processed_datasets      #unseen test samples to evaluate models
│   │       X_test_binary_augmented.npy
│   │       X_test_binary_log_mel.npy
│   │       X_test_binary_mfcc.npy
│   │       ....
│   │
│───│───streamlit_ui      #different components of ui
│   │   │   data_exploration.py    #for second page, to explore data
│   │   │   model_deployment.py
│   │   │   model_evaluation_summary.csv
│   │   │   model_performance.py
│   │   │   readme.py   # first anding page
│   │   │   style.css   
│   │   │
│   │   ├───img
│   │   │       deployment.png
│   │   │       logo.png
│   │   │       training.png
│   │
│───│───utils    #utility functions for training
│   │   │   audioprocessing.py
│   │   │   augmentation.py
│   │   │   data_loader.py
│   │   │   evaluation.py
│   │   │   model_utils.py
│   │
│───│───src #includes extra images and a pdf file
│   │
│   │

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Overview and Envisioned Workflow

Environment Setup Instructions

Data Preparation

Training model

Model Training Workflow

Outputs and Logging

Testing and Inference

Run UI locally + metrics collection

Directory and files

About

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 53 Commits
.github/workflows		.github/workflows
Exploration		Exploration
LegacyTraining		LegacyTraining
data		data
mlruns/0		mlruns/0
models		models
processed_datasets		processed_datasets
src		src
streamlit_ui		streamlit_ui
utils		utils
.gitignore		.gitignore
Model_Inference.py		Model_Inference.py
README.md		README.md
TestModels.py		TestModels.py
Train.py		Train.py
app.py		app.py
prometheus.yml		prometheus.yml
requirements.txt		requirements.txt

magnumical/amp1

Folders and files

Latest commit

History

Repository files navigation

Overview and Envisioned Workflow

Environment Setup Instructions

Data Preparation

Training model

Model Training Workflow

Outputs and Logging

Testing and Inference

Run UI locally + metrics collection

Directory and files

About

Resources

Stars

Watchers

Forks

Languages