talker93 / oneMinTTSLinks
Launch your speech synthesis within one minute.
☆12Updated last year
Alternatives and similar repositories for oneMinTTS
Users that are interested in oneMinTTS are comparing it to the libraries listed below
Sorting:
- Project of Singing Voice Conversion.☆15Updated 2 years ago
- Use VITS and Opencpop to develop singing voice synthesis; Different from VISinger.☆35Updated 2 years ago
- Simple inference for Vits2 TTS Using ONNXRUNTIME and espeak-ng on C++☆18Updated last year
- a lightweight voice conversion☆86Updated last year
- A fork of sinsy: HMM/DNN-based singing voice synthesis system☆72Updated 3 years ago
- An High-resolution implementation of HiFi-GAN Vocoder for Voice Conversion.☆32Updated 2 years ago
- Implementation of "Audio xLSTMs: Learning Self-supervised audio representations with xLSTMs" in PyTorch☆19Updated last week
- Export an ONNX graph that performs ISTFT. Designed for TTS models.☆27Updated last year
- RVC Onnx Infer- Upgraded and simplified-ish☆25Updated last year
- Accompanying repository for the paper "DiffVox: A Differentiable Model for Capturing and Analysing Professional Effects Distributions"☆37Updated 3 months ago
- SpeechPlus: Small LLM-Based Text-to-Speech Library 🚀☆20Updated 8 months ago
- ☆11Updated last year
- Real-time end-to-end singing voice convertion☆23Updated last year
- A minimum inference engine for DiffSinger☆37Updated last year
- ☆15Updated 11 months ago
- ☆69Updated last year
- Using OpenVINO to speed up MeloTTS inference☆15Updated last year
- A fast python library for aligning similar audio snippets passed in as NumPy arrays☆48Updated 3 months ago
- singing voice conversion without f0☆23Updated 2 years ago
- [Early Alpha] A unified framework for text-to-speech, voice conversion, automatic speech recognition, audio classification, voice activit…☆22Updated last year
- Export the STFT or ISTFT process in ONNX format.☆40Updated 2 months ago
- AudioSR-Upsampling (any -> 48kHz)☆42Updated last year
- Streaming Audio Models Examples in JS☆19Updated last year
- Multispeaker Community Vocoder Model for DiffSinger☆39Updated 5 months ago
- Perform the forced decoding with target transcription☆11Updated 7 years ago
- Streaming source separation for music and speech files, using the Open-Unmix LSTM architecture.☆21Updated 3 years ago
- (WIP) A retrain of F5-TTS on permissively-licensed data☆13Updated 9 months ago
- ☆53Updated last year
- Unofficial implementation of ConvNeXt-TTS powered by lightning☆18Updated last year
- Simple and lightweight Zero-shot Text-to-Speech (TTS) synthesis model☆36Updated 9 months ago