cgisky1980 / rwkv-tts-rsLinks
RWKV-based Text-to-Speech implementation in Rust
☆22Updated this week
Alternatives and similar repositories for rwkv-tts-rs
Users that are interested in rwkv-tts-rs are comparing it to the libraries listed below
Sorting:
- This repo is an exploratory experiment to enable frozen pretrained RWKV language models to accept speech modality input. We followed the …☆51Updated 9 months ago
- Python runtime for WeTextProcessing (does not depend on Pynini)☆32Updated 2 weeks ago
- This project is to train an RWKV LLM for TTS generation which compatible to other TTS engine(like fish/cosy/chattts).☆84Updated last week
- Official implementation of the TTS model Lina-Speech☆169Updated 8 months ago
- We Speech Transcript based on LLM, in 300 lines of code.☆176Updated 3 months ago
- ONNX Inference of Pyannote Segmentation☆93Updated 9 months ago
- Grapheme-to-Phoneme for Mixed Chinese (Mandarin or Cantonese) and English.☆108Updated 6 months ago
- SenseVoice-python: A enterprise-grade open source multi-language asr system from funasr opensource with onnxruntime☆101Updated last year
- Colab notebooks for Next-gen Kaldi☆28Updated 3 weeks ago
- Offline Speaker Diarization with SenseVoice by Sherpa ONNX.☆14Updated 9 months ago
- Port of Funasr's Paraformer model in C/C++☆35Updated last year
- noise reduction☆17Updated last year
- C++ version of pyannote audio speaker diarizaiton pipeline☆21Updated last year
- A enterprise-grade Voice Activity Detector from modelscope and funasr.☆113Updated 2 years ago
- Chinese and English Bilinguish G2P☆21Updated 2 years ago
- OpenS2S : Advancing Fully Open-Source End-to-End Empathetic Large Speech Language Model☆83Updated 2 months ago
- ☆22Updated 10 months ago
- Python的音频工具☆16Updated 10 months ago
- SpeechDenoiser: Real-Time Speech Denoising with ONNX Welcome to SpeechDenoiser, a simple and effective solution for real-time speech den…☆87Updated last year
- Benchmark for evaluating TTS models on complex prosodic, expressiveness, and linguistic challenges.☆156Updated last month
- An easy-to-use, fast, and easily integrable tool for evaluating audio LLM☆144Updated this week
- ☆98Updated 2 months ago
- A Large-scale Cantonese Speech Corpus with Multi-dimensional Annotation☆159Updated last week
- Torch Audio Forced Aligner for Mixed Chinese (Mandarin or Cantonese) and English.☆15Updated 2 weeks ago
- Official code for "F5R-TTS: Improving Flow-Matching based Text-to-Speech with Group Relative Policy Optimization"☆117Updated 3 months ago
- RWKV-SpeechChat is a real-time dialogue script based on a frozen 3B RWKV model with trained adapters and initial states. Various trained …☆27Updated 8 months ago
- flow mirror models from JZX AI Labs☆44Updated 11 months ago
- A fast speech-to-speech & speech-to-text translation model that supports simultaneous decoding and offers 28× speedup.☆76Updated 11 months ago
- Python Wrapper of Silero VAD☆60Updated 4 months ago
- ☆123Updated 3 weeks ago