cgisky1980 / rwkv-tts-rsLinks
RWKV-based Text-to-Speech implementation in Rust
☆25Updated this week
Alternatives and similar repositories for rwkv-tts-rs
Users that are interested in rwkv-tts-rs are comparing it to the libraries listed below
Sorting:
- This repo is an exploratory experiment to enable frozen pretrained RWKV language models to accept speech modality input. We followed the …☆50Updated 9 months ago
- This project is to train an RWKV LLM for TTS generation which compatible to other TTS engine(like fish/cosy/chattts).☆87Updated last week
- Python runtime for WeTextProcessing (does not depend on Pynini)☆33Updated 2 weeks ago
- We Speech Transcript based on LLM, in 300 lines of code.☆177Updated 4 months ago
- Offline Speaker Diarization with SenseVoice by Sherpa ONNX.☆14Updated 9 months ago
- SenseVoice-python: A enterprise-grade open source multi-language asr system from funasr opensource with onnxruntime☆103Updated 2 weeks ago
- Colab notebooks for Next-gen Kaldi☆29Updated last week
- Official implementation of the TTS model Lina-Speech☆170Updated 9 months ago
- ☆22Updated 11 months ago
- noise reduction☆17Updated last year
- Grapheme-to-Phoneme for Mixed Chinese (Mandarin or Cantonese) and English.☆110Updated 7 months ago
- ONNX Inference of Pyannote Segmentation☆93Updated 9 months ago
- ☆23Updated last year
- RWKV-SpeechChat is a real-time dialogue script based on a frozen 3B RWKV model with trained adapters and initial states. Various trained …☆27Updated 9 months ago
- A enterprise-grade Voice Activity Detector from modelscope and funasr.☆113Updated 2 years ago
- ☆100Updated 2 weeks ago
- C++ version of pyannote audio speaker diarizaiton pipeline☆21Updated last year
- 单独维护的中文TTS☆35Updated 2 years ago
- A Large-scale Cantonese Speech Corpus with Multi-dimensional Annotation☆173Updated 2 weeks ago
- flow mirror models from JZX AI Labs☆44Updated last year
- Utilizes ONNX Runtime to transcribe audio into text.☆56Updated last month
- LSLM implements full duplex modeling in interactive speech language models, based on research by Ma et al. (2024). This project advances …☆78Updated 3 months ago
- A fast speech-to-speech & speech-to-text translation model that supports simultaneous decoding and offers 28× speedup.☆76Updated 11 months ago
- Python Wrapper of Silero VAD☆60Updated 5 months ago
- OpenS2S : Advancing Fully Open-Source End-to-End Empathetic Large Speech Language Model☆84Updated 3 months ago
- CTC decoder with hotwords for ASR.☆27Updated 6 months ago
- Official code for "F5R-TTS: Improving Flow-Matching based Text-to-Speech with Group Relative Policy Optimization"☆118Updated 4 months ago
- Streaming Text to Speech Web UI☆22Updated last year
- [NeurIPS' 25] Benchmark for evaluating TTS models on complex prosodic, expressiveness, and linguistic challenges.☆164Updated last week
- An easy-to-use, fast, and easily integrable tool for evaluating audio LLM☆150Updated last week