Skip to content
#

mfcc

Here are 114 public repositories matching this topic...

Front-end speech processing aims at extracting proper features from short- term segments of a speech utterance, known as frames. It is a pre-requisite step toward any pattern recognition problem employing speech or audio (e.g., music). Here, we are interesting in voice disorder classification. That is, to develop two-class classifiers, which can…

  • Updated Mar 3, 2023
  • Python

Lyrics-to-audio-alignement system. Based on Machine Learning Algorithms: Hidden Markov Models with Viterbi forced alignment. The alignment is explicitly aware of durations of musical notes. The phonetic model are classified with MLP Deep Neural Network.

  • Updated Mar 9, 2020
  • Python

A Python library for computing the Mel-Cepstral Distance (Mel-Cepstral Distortion, MCD) between two inputs. This implementation is based on the method proposed by Robert F. Kubichek in "Mel-Cepstral Distance Measure for Objective Speech Quality Assessment".

  • Updated Apr 29, 2025
  • Python

Improve this page

Add a description, image, and links to the mfcc topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the mfcc topic, visit your repo's landing page and select "manage topics."

Learn more