SesameAILabs / wavtoolsLinks
Record and stream WAV audio data in the browser across all platforms
☆36Updated 7 months ago
Alternatives and similar repositories for wavtools
Users that are interested in wavtools are comparing it to the libraries listed below
Sorting:
- Faster Whisper with additional features☆46Updated 6 months ago
- SGLang is a fast serving framework for large language models and vision language models.☆21Updated 4 months ago
- WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)☆58Updated 10 months ago
- OpenAI compatible TTS for Sesame CSM:1b & dia:1.6b - Voice Cloning from File/YT☆405Updated last month
- Voice Agent Framework for Conversational AI☆63Updated 4 months ago
- List of curated use cases built using Sesame's CSM 1B☆73Updated 3 months ago
- Record and stream WAV audio data in the browser across all platforms☆88Updated 10 months ago
- The JavaScript client for the Cartesia API.☆112Updated 2 weeks ago
- Real-Time Voice Inference Web SDK☆287Updated this week
- Exemplar uses of hyper-realistic rime voices as livekit agents with fine-tuned prompts☆33Updated last week
- Cascading voice assistant combining real-time speech recognition, AI reasoning, and neural text-to-speech capabilities.☆118Updated 2 weeks ago
- ☆632Updated last month
- kokoro text to speech using javascript☆62Updated 7 months ago
- Sesame CSM 1B Voice Cloning☆323Updated 6 months ago
- Realtime demo, Streaming and Finetuning code for CSM☆394Updated last week
- ☆250Updated 3 weeks ago
- Blazing fast whisper turbo for ASR (speech-to-text) tasks☆215Updated 11 months ago
- Demonstrates how to protect your OpenAI API Key using a Cloudflare Worker to serve your ephemeral token and then do client side tool call…☆318Updated 7 months ago
- ☆14Updated 8 months ago
- The official Cartesia client for Python.☆104Updated this week
- Build Agents That Recall What Matters. Systematically engineer relevant context from chat history & business data. (TypeScript Client)☆63Updated last month
- Template for creating Ultravox demo that gets deployed to Vercel.☆18Updated 6 months ago
- An implementation of the CSM(Conversation Speech Model) for Apple Silicon using MLX.☆377Updated last month
- MCP server for macOS text-to-speech functionality☆17Updated 8 months ago
- A lightweight recreation of OS1/Samantha from the movie Her, running locally in the browser☆108Updated 2 months ago
- A simple voice assistant example built with Next.js and LiveKit React Components☆286Updated this week
- Groq MCP server☆30Updated 2 weeks ago
- Example projects built with the Hume AI APIs☆222Updated this week
- Real-Time Transcription Using OpenAI Whisper☆296Updated 6 months ago
- The ElevenLabs Agents SDK for TypeScript.☆52Updated this week