A recipe for disfluency detection on the LibriStutter dataset using SpeechBrain
☆11Mar 13, 2021Updated 5 years ago
Alternatives and similar repositories for LibriStutter
Users that are interested in LibriStutter are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Final semester project on Stuttered Speech recognition☆17Sep 29, 2017Updated 8 years ago
- StutterFormer is an AI model that aims to be able to receive a speech sample with stuttering disfluencies, and return it with the disflue…☆19Feb 10, 2023Updated 3 years ago
- SLT 2024 Mandarin Stuttering Event Detection and Automatic Speech Recognition Challenge☆12Jun 11, 2024Updated last year
- SChunk-Encoder (Transformer or Conformer) for streaming E2E ASR☆11Oct 21, 2022Updated 3 years ago
- ☆10Jun 8, 2022Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- A list of compilers with some metadata.☆12May 31, 2024Updated last year
- ☆10Jun 23, 2023Updated 2 years ago
- ☆33Aug 22, 2024Updated last year
- The source code for target sound detection☆15Feb 26, 2022Updated 4 years ago
- Parkinson’s Disease Classification from Speech Data using multiple Machine Learning approaches. This was implemented using scikit-learn P…☆14Feb 2, 2020Updated 6 years ago
- Numpydocs -> mkdocs friendly markdown☆12Jun 10, 2022Updated 3 years ago
- Lightweight Bayesian deep learning library for fast prototyping based on PyTorch☆14Feb 24, 2023Updated 3 years ago
- Autonomous Driving W/ Deep Reinforcement Learning in Lane Keeping - DDQN and SAC with kinematics/birdview-images☆13Mar 24, 2026Updated 3 weeks ago
- Speech enhancement using mimic loss☆16Oct 25, 2019Updated 6 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Implementation of CTC alignment-based single step non-autoregressive transformer☆13Jun 2, 2023Updated 2 years ago
- This repo contains the code for "Voice Disorder Analysis: A Transformer-based Approach", accepted at Interspeech 2024☆15Jun 11, 2024Updated last year
- Disfluency Detection using Auto-Correlational Neural Networks☆47Dec 23, 2020Updated 5 years ago
- ☆17Mar 1, 2024Updated 2 years ago
- An implementation of Speech Emotion Recognition, based on HuBERT model, training with PyTorch and HuggingFace framework, and fine-tuning …☆33May 18, 2022Updated 3 years ago
- Multi class audio classification using Deep Learning (MLP, CNN): The objective of this project is to build a multi class classifier to id…☆69Jan 8, 2021Updated 5 years ago
- Supervised Speech Representation Learning for Parkinson's Disease Classification☆17Oct 26, 2021Updated 4 years ago
- Non-parallel voice conversion called ICRCycleGAN-VC based on CycleGAN and Inception-resNet module by Afiuny☆15Oct 30, 2025Updated 5 months ago
- Automatic Detection of Potentially Idiomatic Expressions☆12Feb 19, 2021Updated 5 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- a collection of random scripts that I have written and put on my path to save time or do cool stuff.☆25Jan 10, 2015Updated 11 years ago
- ☆11Oct 20, 2022Updated 3 years ago
- Using the function read.table() to break file into chunks to loop and process them. This allows processing files of any size beyond what …☆10Aug 19, 2014Updated 11 years ago
- Models and codes for INTERSPEECH 2023 paper DistilXLSR: A Light Weight Cross-Lingual Speech Representation Model☆13Mar 30, 2025Updated last year
- AD-TUNING: An Adaptive CHILD-TUNING Approach to Efficient Hyperparameter Optimization of Child Networks for Speech Processing Tasks in th…☆11Feb 23, 2024Updated 2 years ago
- Target speaker automatic speech recognition (TS-ASR)☆13Oct 14, 2023Updated 2 years ago
- kaldi cnn-tdnnf baseline☆13Aug 31, 2021Updated 4 years ago
- A comparative analysis of speech signal processing algorithms for Parkinson’s disease classification and the use of the tunable Q-factor …☆15Dec 8, 2022Updated 3 years ago
- ☆10Jul 6, 2023Updated 2 years ago
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- ☆11Mar 31, 2023Updated 3 years ago
- Getting confidences from any end-to-end systems☆11May 24, 2023Updated 2 years ago
- Official repository for U-SAM (Interspeech 2025)☆26Jun 3, 2025Updated 10 months ago
- Code for InterSpeech 2024 Paper: LipGER: Visually-Conditioned Generative Error Correction for Robust Automatic Speech Recognition☆19Jul 16, 2024Updated last year
- Writing Observer and Learning Observer: A system for monitoring learning process data, with an initial focus on writing process data from…☆12Updated this week
- Reinforcement learning for self-driving in a 3D simulation☆20Dec 6, 2021Updated 4 years ago
- Use spaCy for NLP and output to the FoLiA XML format.☆12Feb 27, 2024Updated 2 years ago