lucasnewman / vocos-mlxView external linksLinks
Implementation of 'Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis', in MLX
☆23Oct 30, 2024Updated last year
Alternatives and similar repositories for vocos-mlx
Users that are interested in vocos-mlx are comparing it to the libraries listed below
Sorting:
- Implementation of E2-TTS, "Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS", in MLX☆21Oct 8, 2024Updated last year
- Core ML Demos is an experimental Core ML app. It visualizes the inference results of ML models and can be used to benchmark ML models and…☆12Jan 8, 2026Updated last month
- ☆11Nov 7, 2024Updated last year
- [INTERSPEECH 2024] Official code for VoxSim: A perceptual voice similarity dataset☆12Sep 29, 2025Updated 4 months ago
- This is not remotely close to a finished product, and does not intend to nor does this claim to be working fine-tuning code for MaskGCT. …☆13Dec 4, 2024Updated last year
- offical code for Dense-TSNet☆12Sep 17, 2024Updated last year
- Project for HIDING SPEAKER’S SEX IN SPEECH USING ZERO-EVIDENCE SPEAKER REPRESENTATION IN AN ANALYSIS/SYNTHESIS PIPELINE☆15Nov 30, 2022Updated 3 years ago
- Project of Singing Voice Conversion.☆16Oct 27, 2023Updated 2 years ago
- ☆14Aug 1, 2025Updated 6 months ago
- ☆15Feb 6, 2026Updated last week
- ☆17Sep 22, 2025Updated 4 months ago
- Graph model execution API for Candle☆17Jul 27, 2025Updated 6 months ago
- Repo of the paper "Towards Building an End-to-End Multilingual Automatic Lyrics Transcription Model""☆14Jun 28, 2024Updated last year
- python library with a set of tools for simple debugging of python programs☆19Apr 17, 2023Updated 2 years ago
- Voice activity detection and speaker gender segmentation audiovisual corpus☆16Jan 20, 2025Updated last year
- Character-level conversion between Hebrew text and Latin transliteration using deep learning - a demonstration of seq2seq training.☆14Jun 27, 2023Updated 2 years ago
- Clean and modernized implementation of FastSpeech2/LightSpeech using IPA☆18Aug 16, 2024Updated last year
- Audio transcription using mlx whisper and vad silence processing☆17Oct 14, 2024Updated last year
- Vocal Tract Area Estimation by Gradient Descent☆38Jul 16, 2023Updated 2 years ago
- ESLTTS dataset☆16Feb 6, 2025Updated last year
- A solution to denoising and separating for two-speaker-mixed noisy speech, using a BSRNN inspired network.☆14Aug 22, 2023Updated 2 years ago
- [WIP]Trying to implement "Ultra Low Complexity Deep Learning Based Noise Suppression." arXiv preprint arXiv:2312.08132 (2023).☆26May 29, 2024Updated last year
- ☆16Apr 24, 2025Updated 9 months ago
- MLX Tree Chat Power by MLX and Phi4bit for iOS LLM☆15Aug 15, 2024Updated last year
- Conformer block with Rotary Position Embedding, modified from lucidrains' implement☆16Sep 13, 2024Updated last year
- Implementation of "Audio xLSTMs: Learning Self-supervised audio representations with xLSTMs" in PyTorch☆19Feb 9, 2026Updated last week
- This is a fork of the original fairseq repository (version 0.12.2) with added classes for training mHuBERT-147.☆20Nov 19, 2024Updated last year
- ☆13Sep 12, 2024Updated last year
- Python package of MP-SENet from Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech Enhancement.☆21Nov 1, 2024Updated last year
- ☆14Jul 11, 2022Updated 3 years ago
- Unofficial implementation of ConvNeXt-TTS powered by lightning☆18Oct 20, 2024Updated last year
- ☆20Apr 27, 2024Updated last year
- Speech-to-text transcription VST3/ARA plugin☆53Feb 2, 2026Updated 2 weeks ago
- source code of EfficientTTS 2☆20Feb 18, 2024Updated 2 years ago
- LIGHTVOC AN UPSAMPLING-FREE GAN VOCODER BASED ON CONFORMER AND INVERSE SHORT-TIME FOURIER TRANSFORM☆18May 17, 2024Updated last year
- Test server code for Phi-2 model. support OpenAI API spec☆18Dec 15, 2023Updated 2 years ago
- AudioSR-Upsampling (any -> 48kHz)☆42Feb 13, 2024Updated 2 years ago
- ☆17Jul 22, 2024Updated last year
- CMake modules to support compiling Apple Metal shaders as part of a CMake build system.☆22May 22, 2025Updated 8 months ago