Build software better, together

ddbourgin / numpy-ml

Machine learning, in numpy

machine-learning reinforcement-learning word2vec lstm neural-networks gaussian-mixture-models vae topic-modeling attention resnet bayesian-inference wavenet mfcc knn gaussian-processes hidden-markov-models gradient-boosting wgan-gp good-turing-smoothing

Updated Oct 29, 2023
Python

x4nth055 / emotion-recognition-using-speech

Star

Building and training Speech Emotion Recognizer that predicts human emotions using Python, Sci-kit learn and Keras

machine-learning deep-learning sklearn keras recurrent-neural-networks feature-extraction neural-networks support-vector-machine mfcc librosa emotion-detection gradient-boosting emotion-recognition kneighborsclassifier random-forest-classifier mlp-classifier speech-emotion-recognition emotion-recognizer

Updated Nov 3, 2023
Python

SuperKogito / spafe

Sponsor

Star

🔉 spafe: Simplified Python Audio Features Extraction

Updated Mar 20, 2025
Python

gionanide / Speech_Signal_Processing_and_Classification

Star

Front-end speech processing aims at extracting proper features from short- term segments of a speech utterance, known as frames. It is a pre-requisite step toward any pattern recognition problem employing speech or audio (e.g., music). Here, we are interesting in voice disorder classification. That is, to develop two-class classifiers, which can…

nlp classifier natural-language-processing feature-extraction nltk gaussian-mixture-models support-vector-machines mfcc principal-component-analysis speech-processing linear-discriminant-analysis isomap spectral-clustering long-short-term-memory kernel-pca spectral-embedding locally-linear-embedding linear-prediction-coefficients speech-utterance

Updated Mar 3, 2023
Python

jsingh811 / pyAudioProcessing

Star

Audio feature extraction and classification

classifier audio-files feature-extraction audio-data mfcc hyperparameter-tuning wav-files classify mfcc-features mfcc-extractor classify-audio gfcc gfcc-features gfcc-extractor spectral-features chroma-features classifier-options classify-audio-samples pyaudioprocessing

Updated Jul 6, 2023
Python

SuperKogito / Voice-based-gender-recognition

Sponsor

Star

🔉 👦 👧Voice based gender recognition using Mel-frequency cepstrum coefficients (MFCC) and Gaussian mixture models (GMM)

data-science machine-learning scikit-learn voice speech gaussian-mixture-models signal gender-recognition gender gmm mfcc speaker gender-classification vocal gender-recognition-by-voice gender-detection mel-frequencies scikit-learn-python

Updated Jul 6, 2023
Python

sp-nitech / diffsptk

Star

A differentiable version of SPTK

Updated Apr 24, 2025
Python

tympanix / subsync

Star

Synchronize your subtitles using machine learning

machine-learning neural-network delay subtitles subtitle fix mfcc shift subsync speech-detection shift-subtitle

Updated Sep 18, 2023
Python

ZhuoZhuoCrayon / AcousticKeyBoard-Web

Star

❓声学键盘｜脑洞大开：做一个能听懂键盘敲击键位的「玩具」，学习信号处理 / 深度学习 / 安卓 / Django。

django deep-learning tensorflow lstm mfcc

Updated Jan 26, 2025
Python

GauravWaghmare / Speaker-Identification

Star

A program for automatic speaker identification using deep learning techniques.

keras mfcc speaker-recognition speaker-verification

Updated Feb 28, 2017
Python

MycroftAI / sonopy

Star

A simple audio feature extraction library

library sound spectrogram mfcc audio-processing mel-spectrogram

Updated Jul 3, 2019
Python

ZitengWang / python_kaldi_features

Star

python codes to extract MFCC and FBANK speech features for Kaldi

kaldi mfcc

Updated Nov 28, 2018
Python

k-farruh / speech-accent-detection

Star

The human speaks a language with an accent. A particular accent necessarily reflects a person's linguistic background. The model defines accent based audio record. The result of the model could be used to determine accents and help decrease accents to English learning students and improve accents by training.

mfcc accent accent-detection native-speakers english-languages

Updated Nov 9, 2021
Python

georgid / AlignmentDuration

Star

Lyrics-to-audio-alignement system. Based on Machine Learning Algorithms: Hidden Markov Models with Viterbi forced alignment. The alignment is explicitly aware of durations of musical notes. The phonetic model are classified with MLP Deep Neural Network.

python music duration synchronization research deep-learning signal-processing lyrics decoding music-information-retrieval neural-networks alignment hidden-markov-model gmm mfcc upf htk

Updated Mar 9, 2020
Python

SuperKogito / Voice-based-speaker-identification

Sponsor

Star

🔉 👦 👧 👩 👨 Speaker identification using voice MFCCs and GMM

machine-learning scikit-learn voice speech gaussian-mixture-models signal gmm mfcc speaker-recognition vocal mel-frequencies speaker-identification mel-frequency-cepstral-coefficients scikit-learn-python

Updated Dec 13, 2020
Python

stefantaubert / mel-cepstral-distance

Star

A Python library for computing the Mel-Cepstral Distance (Mel-Cepstral Distortion, MCD) between two inputs. This implementation is based on the method proposed by Robert F. Kubichek in "Mel-Cepstral Distance Measure for Objective Speech Quality Assessment".