Machine Learning Approach to built a robust speaker recognition model using MFCC features and GMM universal background model.
☆15May 30, 2020Updated 5 years ago
Alternatives and similar repositories for Speaker-Recognition
Users that are interested in Speaker-Recognition are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Automatic Speaker Recognition algorithms in Python☆96Sep 25, 2021Updated 4 years ago
- brainless concatenative text to speech☆14May 11, 2021Updated 4 years ago
- Source code for paper "Breaking Security-Critical Voice Authentication".☆13Jul 10, 2023Updated 2 years ago
- Speaker identification using voice MFCCs and GMM☆54Dec 13, 2020Updated 5 years ago
- Implementation of a speaker identification and a speaker verification system based on Gaussian Mixture Models (GMM) in combination with a…☆21Mar 1, 2018Updated 8 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Create reliability diagrams to quantify ML calibration.☆10Feb 1, 2022Updated 4 years ago
- Respiratory Disorder Classification Based on Lung Auscultation sounds☆13Oct 22, 2024Updated last year
- System for identifying speaker from given speech signal using MFCC,LPC features and Gaussian Mixture Models☆21Nov 5, 2017Updated 8 years ago
- Human age estimation using deep neural networks (Keras)☆14Aug 10, 2023Updated 2 years ago
- [CVPR2024] Continual-MAE: Adaptive Distribution Masked Autoencoders for Continual Test-Time Adaptation☆19Sep 3, 2024Updated last year
- Python interface to Optotune focus-tunable lenses☆13Feb 4, 2020Updated 6 years ago
- [ICLR 2025] "Noisy Test-Time Adaptation in Vision-Language Models"☆13Feb 22, 2025Updated last year
- Utility to mass-download a Twitch streamer's clips. Allows both local storage, as well as directly upload to Google Drive☆12Dec 20, 2023Updated 2 years ago
- Code for the paper: Deep Residual Networks with Auditory Inspired Features for Robust Speech Recognition.☆21Mar 22, 2017Updated 9 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Interoperability for Grasshopper and Revit☆22Aug 18, 2017Updated 8 years ago
- This model is designed Using GMM and MFCC and tested with Hindi/English audio samples with a good resultant accuracy.☆15Jun 10, 2020Updated 5 years ago
- Code repository for the paper - "Neural Priming for Sample-Efficient Adaptation"☆14Nov 13, 2023Updated 2 years ago
- A three-dimensional vocal tract acoustic model using the finite-difference time-domain (FDTD) numerical scheme.☆18Sep 25, 2022Updated 3 years ago
- Tic Tac Toe game with socket programming and pygame☆10Jan 6, 2024Updated 2 years ago
- A rough and ready Python utility which splits audio files based on silence and desired min/max chunk duration.☆16Jun 22, 2022Updated 3 years ago
- [ICLR 2025] "Noisy Test-Time Adaptation in Vision-Language Models"☆16Feb 22, 2025Updated last year
- Official PyTorch Implementation of RA-TTA (ICLR25)☆26Apr 19, 2025Updated last year
- Adafruit CircuitPython module for the MPR121 capacitive touch breakout board.☆19Updated this week
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- 课程材料、讨论区☆10Sep 6, 2016Updated 9 years ago
- Download twitch vods, clips, and render videos with chat.☆26Feb 7, 2026Updated 2 months ago
- This repo summarizes the courses and materials for speech signal processing. You are kindly invited to pull requests.☆100Jul 20, 2020Updated 5 years ago
- This repository presents an evaluation framework for speech-to-speech (S2S) models, following the methodology described in the EmphAsses …☆25Jan 9, 2024Updated 2 years ago
- Knock your images before you get stressed.☆11Jan 9, 2022Updated 4 years ago
- MultiSV: scripts for data preparation☆30Jan 18, 2025Updated last year
- Synthesis speech detection based on Breathing-Talking-Silence sounds☆21Sep 3, 2025Updated 7 months ago
- [ICML 2025] DPCore: Dynamic Prompt Coreset for Continual Test-Time Adaptation☆29Feb 27, 2026Updated 2 months ago
- A framework for implementing equivariant DL☆10May 25, 2021Updated 4 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Dummy project to test your Open3D build☆10May 6, 2021Updated 4 years ago
- Extract frequency, power, width and dissonance of formants from wav files☆28Jun 3, 2022Updated 3 years ago
- Find ground breaking 3D point cloud analysis papers☆13Jul 28, 2020Updated 5 years ago
- CS231n Spring 2022 作业代码实现☆20Aug 20, 2023Updated 2 years ago
- MummyIsland is a 3D game, Written in Python using Pyopengl and Pygame.☆32Feb 22, 2023Updated 3 years ago
- This repository contains the code associated with the paper: "Who Are You (I Really Wanna Know)? Detecting Audio DeepFakes Through Vocal …☆25Dec 14, 2023Updated 2 years ago
- 🚀 Implementaton of SO-SLAM [unofficial]☆32May 10, 2023Updated 2 years ago