SesameAILabs / wavtoolsLinks
Record and stream WAV audio data in the browser across all platforms
☆37Updated 7 months ago
Alternatives and similar repositories for wavtools
Users that are interested in wavtools are comparing it to the libraries listed below
Sorting:
- Faster Whisper with additional features☆46Updated 5 months ago
- WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)☆59Updated 10 months ago
- Voice Agent Framework for Conversational AI☆62Updated 3 months ago
- Record and stream WAV audio data in the browser across all platforms☆88Updated 9 months ago
- SGLang is a fast serving framework for large language models and vision language models.☆20Updated 3 months ago
- OpenAI compatible TTS for Sesame CSM:1b & dia:1.6b - Voice Cloning from File/YT☆390Updated 3 weeks ago
- List of curated use cases built using Sesame's CSM 1B☆70Updated 3 months ago
- ML-powered speech synthesis directly in your browser☆165Updated 6 months ago
- kokoro text to speech using javascript☆60Updated 7 months ago
- 💻 An MCP for Claude to control your computer (probably a bad idea)☆70Updated 3 weeks ago
- Real-Time Voice Inference Web SDK☆280Updated this week
- ☆631Updated last month
- The JavaScript client for the Cartesia API.☆107Updated 2 months ago
- ☆83Updated last week
- ☆91Updated 3 months ago
- ☆48Updated 3 weeks ago
- Blazing fast whisper turbo for ASR (speech-to-text) tasks☆214Updated 10 months ago
- A WebRTC server that allows you to interact with an LLM using your speech and responds back with generated audio.☆136Updated last year
- Streaming and Fine-tuning for Chatterbox TTS☆164Updated 2 months ago
- Web client SDK for Ultravox.☆26Updated last week
- Example applications and code snippets for LiveKit Agents☆141Updated this week
- ☆69Updated 2 months ago
- ☆41Updated 11 months ago
- Simulates talk with an AI that can express emotions☆78Updated 2 months ago
- Model Context Protocol server for Replicate's API☆83Updated 3 months ago
- Model Context Protocol Servers (Browserbase Version)☆49Updated 9 months ago
- Replicate Flux LoRA image editor.☆52Updated last year
- ☆14Updated 7 months ago
- Cascading voice assistant combining real-time speech recognition, AI reasoning, and neural text-to-speech capabilities.☆114Updated 3 weeks ago
- Sesame Converse - Real Time Conversations - Powered by Gemma 3☆63Updated 5 months ago