antirez / voxtral.cLinks
Pure C inference of Mistral Voxtral Realtime 4B speech to text model
☆290Updated this week
Alternatives and similar repositories for voxtral.c
Users that are interested in voxtral.c are comparing it to the libraries listed below
Sorting:
- Flux 2 image generation model pure C inference☆1,632Updated this week
- Very fast, accurate speaker diarization☆223Updated last month
- Liquid Audio - Speech-to-Speech audio models by Liquid AI☆388Updated 2 weeks ago
- A high quality and fast TTS repository☆498Updated last month
- A simple, hackable text-to-speech system in PyTorch and MLX☆187Updated 6 months ago
- ☆63Updated last year
- ☆346Updated 5 months ago
- Open Audio Watermarking Tool☆465Updated last month
- VLLM Port of the Chatterbox TTS model☆364Updated 3 months ago
- Heirarchical Navigable Small Worlds☆101Updated 6 months ago
- Fast audio super resolution from 16khz to 48khz.☆192Updated last month
- Train your own speech AI model from scratch☆137Updated last week
- TTS support with GGML☆218Updated 4 months ago
- A highly optimized engine for neutts-air model to generate minutes of audio in seconds. Over 200x realtime on modern hardware!☆110Updated 2 months ago
- A character-level language diffusion model trained on Tiny Shakespeare☆849Updated 3 weeks ago
- Soprano-Factory: Train your own 2000x realtime text-to-speech model☆203Updated 3 weeks ago
- A minimalistic C++ Jinja templating engine for LLM chat templates☆203Updated 4 months ago
- Fast Streaming TTS with Orpheus + WebRTC (with FastRTC)☆348Updated 9 months ago
- On-device streaming text-to-speech engine powered by deep learning☆128Updated 2 weeks ago
- Official Rust Implementation of Model2Vec☆152Updated this week
- Pytorch script hot swap: Change code without unloading your LLM from VRAM☆125Updated 9 months ago
- Live-bending a foundation model’s output at neural network level.☆273Updated 10 months ago
- Official implementation of "WhisperNER: Unified Open Named Entity and Speech Recognition"☆200Updated 11 months ago
- DACVAE☆191Updated last month
- A highly compressive and high-quality neural audio codec for speech models.☆250Updated 2 weeks ago
- High-Performance Implementation of OpenAI's TikToken.☆467Updated 7 months ago
- I publish my weekly research here☆20Updated 7 months ago
- A random walk voice style cloning application for Kokoro text to speech☆205Updated 7 months ago
- Soprano: Instant, Ultra-Realistic Text-to-Speech☆1,164Updated 3 weeks ago
- ☆50Updated 3 months ago