cgisky1980 / rwkv-tts-rsLinks
RWKV-based Text-to-Speech implementation in Rust
☆26Updated 3 months ago
Alternatives and similar repositories for rwkv-tts-rs
Users that are interested in rwkv-tts-rs are comparing it to the libraries listed below
Sorting:
- This repo is an exploratory experiment to enable frozen pretrained RWKV language models to accept speech modality input. We followed the …☆54Updated last year
- This project is to train an RWKV LLM for TTS generation which compatible to other TTS engine(like fish/cosy/chattts).☆92Updated 3 months ago
- Python runtime for WeTextProcessing (does not depend on Pynini)☆46Updated 2 months ago
- Official implementation of the TTS model Lina-Speech☆176Updated last year
- Offline Speaker Diarization with SenseVoice by Sherpa ONNX.☆15Updated last year
- MOSS-Speech is a true speech-to-speech large language model without text guidance.☆120Updated last month
- Grapheme-to-Phoneme for Mixed Chinese (Mandarin or Cantonese) and English.☆114Updated last month
- Compute WER and SER for speech recognition evaluation☆23Updated last month
- We Speech Transcript based on LLM, in 300 lines of code.☆183Updated 7 months ago
- ☆105Updated 3 months ago
- Colab notebooks for Next-gen Kaldi☆29Updated 3 months ago
- ☆23Updated last year
- ☆23Updated last year
- Streaming Text to Speech Web UI☆22Updated last year
- g2p for english tts☆19Updated 3 years ago
- CTC decoder with hotwords for ASR.☆34Updated 9 months ago
- FlashCosyVoice: A lightweight vLLM implementation built from scratch for CosyVoice.☆240Updated 2 months ago
- Python Wrapper of Silero VAD☆64Updated 8 months ago
- Causal streaming adaptation of OpenAI Whisper for real-time transcription on small audio chunks.☆62Updated 4 months ago
- SenseVoice-python: A enterprise-grade open source multi-language asr system from funasr opensource with onnxruntime☆108Updated 3 months ago
- A enterprise-grade Voice Activity Detector from modelscope and funasr.☆124Updated 2 years ago
- flow mirror models from JZX AI Labs☆43Updated last year
- LSLM implements full duplex modeling in interactive speech language models, based on research by Ma et al. (2024). This project advances …☆85Updated 7 months ago
- poorman's ar-dit tts☆43Updated last month
- noise reduction☆17Updated last year
- RWKV-SpeechChat is a real-time dialogue script based on a frozen 3B RWKV model with trained adapters and initial states. Various trained …☆28Updated last year
- 单独维护的中文TTS☆34Updated 3 years ago
- SpeechDenoiser: Real-Time Speech Denoising with ONNX Welcome to SpeechDenoiser, a simple and effective solution for real-time speech den…☆109Updated last year
- Python的音频工具☆16Updated last month
- LEMAS‑TTS is a multilingual zero‑shot text‑to‑speech system, supporting 10 languages: Chinese English Spanish Russian French German Ital…☆85Updated 2 weeks ago