SesameAILabs / wavtoolsLinks
Record and stream WAV audio data in the browser across all platforms
☆37Updated 8 months ago
Alternatives and similar repositories for wavtools
Users that are interested in wavtools are comparing it to the libraries listed below
Sorting:
- Faster Whisper with additional features☆47Updated 7 months ago
- SGLang is a fast serving framework for large language models and vision language models.☆21Updated 5 months ago
- Record and stream WAV audio data in the browser across all platforms☆91Updated 11 months ago
- List of curated use cases built using Sesame's CSM 1B☆73Updated 4 months ago
- WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)☆59Updated 11 months ago
- kokoro text to speech using javascript☆62Updated 8 months ago
- The JavaScript client for the Cartesia API.☆112Updated last month
- OpenAI compatible TTS for Sesame CSM:1b & dia:1.6b - Voice Cloning from File/YT☆415Updated 3 weeks ago
- Play.ht's Text to Speech API☆92Updated 2 months ago
- ☆272Updated last month
- Blazing fast whisper turbo for ASR (speech-to-text) tasks☆217Updated last year
- Real-Time Voice Inference Web SDK☆288Updated last week
- Streaming Avatar SDK☆95Updated last month
- Simulates talk with an AI that can express emotions☆80Updated 4 months ago
- A cog implementation of Meta's MusicGen models☆102Updated last year
- ML-powered speech synthesis directly in your browser☆166Updated 8 months ago
- Joint speech-language model - respond directly to audio!☆371Updated last year
- Groq MCP server☆31Updated last month
- Web client SDK for Ultravox.☆26Updated last week
- Voice Agent Framework for Conversational AI☆63Updated 5 months ago
- ☆91Updated 5 months ago
- Shush is an app that deploys a WhisperV3 model with Flash Attention v2 on Modal and makes requests to it via a NextJS app☆217Updated last year
- Sesame CSM 1B Voice Cloning☆323Updated 7 months ago
- ☆634Updated 2 months ago
- ☆42Updated 6 months ago
- ☆41Updated last year
- Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor…☆12Updated 2 years ago
- ☆49Updated last month
- A WebRTC server that allows you to interact with an LLM using your speech and responds back with generated audio.☆138Updated last year
- Realtime demo, Streaming and Finetuning code for CSM☆405Updated last month