cgisky1980 / rwkv-tts-rsLinks
RWKV-based Text-to-Speech implementation in Rust
☆26Updated 2 months ago
Alternatives and similar repositories for rwkv-tts-rs
Users that are interested in rwkv-tts-rs are comparing it to the libraries listed below
Sorting:
- This repo is an exploratory experiment to enable frozen pretrained RWKV language models to accept speech modality input. We followed the …☆54Updated last year
- We Speech Transcript based on LLM, in 300 lines of code.☆181Updated 6 months ago
- Colab notebooks for Next-gen Kaldi☆29Updated 2 months ago
- SenseVoice-python: A enterprise-grade open source multi-language asr system from funasr opensource with onnxruntime☆108Updated 2 months ago
- This project is to train an RWKV LLM for TTS generation which compatible to other TTS engine(like fish/cosy/chattts).☆91Updated 2 months ago
- Grapheme-to-Phoneme for Mixed Chinese (Mandarin or Cantonese) and English.☆112Updated 3 weeks ago
- Official implementation of the TTS model Lina-Speech☆175Updated 11 months ago
- FlashCosyVoice: A lightweight vLLM implementation built from scratch for CosyVoice.☆231Updated last month
- Offline Speaker Diarization with SenseVoice by Sherpa ONNX.☆15Updated last year
- ONNX Inference of Pyannote Segmentation☆97Updated last year
- A enterprise-grade Voice Activity Detector from modelscope and funasr.☆120Updated 2 years ago
- RAG SYSTEM FOR RWKV☆51Updated last year
- MOSS-Speech is a true speech-to-speech large language model without text guidance.☆112Updated 3 weeks ago
- ☆23Updated last year
- Baichuan-Audio: A Unified Framework for End-to-End Speech Interaction☆215Updated 10 months ago
- MooER: Moore-threads Open Omni model for speech-to-speech intERaction. MooER-omni includes a series of end-to-end speech interaction mode…☆220Updated 11 months ago
- Python runtime for WeTextProcessing (does not depend on Pynini)☆39Updated last month
- RWKV-SpeechChat is a real-time dialogue script based on a frozen 3B RWKV model with trained adapters and initial states. Various trained …☆28Updated 11 months ago
- flow mirror models from JZX AI Labs☆43Updated last year
- Utilizes ONNX Runtime to transcribe audio into text.☆63Updated last week
- CTC decoder with hotwords for ASR.☆34Updated 8 months ago
- LongCat Audio Tokenizer and Detokenizer☆264Updated 2 weeks ago
- Utilizes ONNX Runtime for speech activity detection.☆38Updated 2 weeks ago
- Port of Funasr's Paraformer model in C/C++☆39Updated last year
- A collection of optimized utilities for text-to-audio processing, enhancing both training and inference workflows. This repository contai…☆42Updated 8 months ago
- Compute WER and SER for speech recognition evaluation☆17Updated 2 weeks ago
- 基于ONNXRuntime以及LLama.cpp推理引擎实现的高性能C++语音推理框架,在性能极差的边缘设备上都能做到RTF<0.7实时对话。☆31Updated this week
- ☆204Updated last year
- Running the F5-TTS by ONNX Runtime☆186Updated last month
- Fun-ASR is an end-to-end speech recognition large model launched by Tongyi Lab.☆503Updated this week