cgisky1980 / rwkv-tts-rsLinks
RWKV-based Text-to-Speech implementation in Rust
☆26Updated last month
Alternatives and similar repositories for rwkv-tts-rs
Users that are interested in rwkv-tts-rs are comparing it to the libraries listed below
Sorting:
- This repo is an exploratory experiment to enable frozen pretrained RWKV language models to accept speech modality input. We followed the …☆53Updated 11 months ago
- This project is to train an RWKV LLM for TTS generation which compatible to other TTS engine(like fish/cosy/chattts).☆90Updated last month
- Python runtime for WeTextProcessing (does not depend on Pynini)☆36Updated last week
- We Speech Transcript based on LLM, in 300 lines of code.☆181Updated 5 months ago
- Colab notebooks for Next-gen Kaldi☆30Updated last month
- Official implementation of the TTS model Lina-Speech☆175Updated 10 months ago
- noise reduction☆17Updated last year
- 基于ONNXRuntime以及LLama.cpp推理引擎实现的高性能C++语音推理框架,在性能极差的边缘设备上都能做到RTF<0.7实时对话。☆29Updated 2 weeks ago
- Grapheme-to-Phoneme for Mixed Chinese (Mandarin or Cantonese) and English.☆110Updated 8 months ago
- ☆23Updated last year
- ONNX Inference of Pyannote Segmentation☆97Updated 11 months ago
- Compute WER and SER for speech recognition evaluation☆15Updated this week
- FlashCosyVoice: A lightweight vLLM implementation built from scratch for CosyVoice.☆215Updated 3 weeks ago
- ☆48Updated 2 weeks ago
- Utilizes ONNX Runtime to transcribe audio into text.☆59Updated last week
- Offline Speaker Diarization with SenseVoice by Sherpa ONNX.☆15Updated 11 months ago
- ☆102Updated 2 months ago
- ☆124Updated last month
- Text-audio foundation model from Boson AI☆112Updated 3 months ago
- CTC decoder with hotwords for ASR.☆34Updated 7 months ago
- SenseVoice-python: A enterprise-grade open source multi-language asr system from funasr opensource with onnxruntime☆106Updated 2 months ago
- Chinese and English Bilinguish G2P☆22Updated 2 years ago
- Streaming Text to Speech Web UI☆22Updated last year
- A enterprise-grade Voice Activity Detector from modelscope and funasr.☆120Updated 2 years ago
- Python Wrapper of Silero VAD☆62Updated 6 months ago
- flow mirror models from JZX AI Labs☆43Updated last year
- LSLM implements full duplex modeling in interactive speech language models, based on research by Ma et al. (2024). This project advances …☆82Updated 5 months ago
- ☆204Updated last year
- Official code for "F5R-TTS: Improving Flow-Matching based Text-to-Speech with Group Relative Policy Optimization"☆133Updated 6 months ago
- SpeechDenoiser: Real-Time Speech Denoising with ONNX Welcome to SpeechDenoiser, a simple and effective solution for real-time speech den…☆102Updated last year