egorsmkv / optimized-whisperView external linksLinks
Use quantized versions of Whisper to speed up inference
☆12Oct 16, 2024Updated last year
Alternatives and similar repositories for optimized-whisper
Users that are interested in optimized-whisper are comparing it to the libraries listed below
Sorting:
- Implementation of the paper "Variable Bitrate Residual Vector Quantization for Audio Coding"☆11Apr 10, 2025Updated 10 months ago
- Agent toolkit for 100 hours of speech and 10 GiB of text☆14Jul 15, 2025Updated 6 months ago
- The Multi-band Excited WaveNet☆15Feb 2, 2023Updated 3 years ago
- a Neural Vocoder supporting Ring Attention, Conformer and NSF.☆24Aug 1, 2025Updated 6 months ago
- (re)Implementation of Learning Multi-level Dependencies for Robust Word Recognition☆17Jul 25, 2024Updated last year
- This repository includes training, inference, evaluation, and utility scripts developed for fine-tuning the Whisper medium.en model on Ai…☆24Oct 9, 2024Updated last year
- Official Implementation for the paper: A Variational Framework for Improving Naturalness in Generative Spoken Language Models☆22Jun 18, 2025Updated 7 months ago
- Official Code Implementation for 'A Simple Early Exiting Framework for Accelerated Sampling in Diffusion Models'☆20Jul 24, 2024Updated last year
- Speech Resynthesis and Language Modeling☆27Jun 11, 2025Updated 8 months ago
- Code for the paper "Toward Fully Self-Supervised Multi-Pitch Estimation".☆23Sep 27, 2025Updated 4 months ago
- Accompanying repository for the paper "DiffVox: A Differentiable Model for Capturing and Analysing Professional Effects Distributions"☆38Oct 28, 2025Updated 3 months ago
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.☆27Feb 15, 2024Updated last year
- AudioCodec-Hub is a Python library for encoding and decoding audio data, supporting various neural audio codec models☆25Sep 26, 2023Updated 2 years ago
- A neural speech codec based on discrete WavLM representations☆24Aug 28, 2024Updated last year
- ☆32Jan 9, 2024Updated 2 years ago
- A Semantically Consistent Dataset for Data-Efficient Query-Based Universal Sound Separation☆101Feb 5, 2026Updated last week
- Official repo for DisCoder: High-Fidelity Music Vocoder using Neural Audio Codecs presented at ICASSP 2025☆37Feb 24, 2025Updated 11 months ago
- Principal Component Anlaysis (PCA) in PyTorch.☆39Jul 10, 2025Updated 7 months ago
- My vocoder experiments☆31Jul 26, 2025Updated 6 months ago
- ☆35Sep 24, 2024Updated last year
- [ICASSP 2024] Official code for FreGrad☆35May 13, 2024Updated last year
- Official Implementation (Pytorch) of "Constant Acceleration Flow", NeurIPS 2024☆35Oct 31, 2025Updated 3 months ago
- [ACMMM'2024] Generative Expressive Conversational Speech Synthesis☆43Oct 28, 2024Updated last year
- ☆40Jan 24, 2023Updated 3 years ago
- PASE: Phonologically Anchored Speech Enhancer☆37Dec 10, 2025Updated 2 months ago
- Top 3% in Kaggle housing competition☆10Feb 6, 2021Updated 5 years ago
- [ACL 2025] OZSpeech: One-step Zero-shot Speech Synthesis with Learned-Prior-Conditioned Flow Matching☆45Feb 9, 2025Updated last year
- A Windows based containerized Adobe After Effects renderer☆12Jan 21, 2024Updated 2 years ago
- Where is the "main theme" in an orchestral score?☆12Oct 25, 2025Updated 3 months ago
- Try to replicate the architecture of MiniMaxTTS mentioned in it's technical report☆49Sep 2, 2025Updated 5 months ago
- Машинне навчання для інженерів із систем керування☆11Jul 19, 2023Updated 2 years ago
- ☆10Sep 17, 2022Updated 3 years ago
- Demo for DART, Audio Imagination workshop submission in NeurIPS 2024☆12Apr 15, 2025Updated 9 months ago
- ☆12Jul 26, 2024Updated last year
- SocksSharp provides support for Socks4/4a/5 proxy servers to HttpClient☆12Feb 3, 2021Updated 5 years ago
- This is a fork of tortoise tts fast to easily create audio books locally on your computer☆12Apr 24, 2024Updated last year
- Amazon S3 tokenizer☆10Updated this week
- Punch Out Model Synthesis - a program for constraint based tiling generation☆18Feb 1, 2026Updated last week
- Official implementation of "AEROMamba: An efficient architecture for audio super-resolution using generative adversarial networks and sta…☆49Nov 11, 2025Updated 3 months ago