SesameAILabs / wavtools
Record and stream WAV audio data in the browser across all platforms
☆31Updated 3 months ago
Alternatives and similar repositories for wavtools
Users that are interested in wavtools are comparing it to the libraries listed below
Sorting:
- Faster Whisper with additional features☆44Updated 2 months ago
- Record and stream WAV audio data in the browser across all platforms☆81Updated 6 months ago
- WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)☆52Updated 6 months ago
- ☆38Updated 7 months ago
- SGLang is a fast serving framework for large language models and vision language models.☆19Updated this week
- List of curated use cases built using Sesame's CSM 1B☆67Updated last month
- Model Context Protocol Servers (Browserbase Version)☆47Updated 5 months ago
- Choose a topic, a music genre and wait for the agents to generate a song☆55Updated 10 months ago
- ☆19Updated last week
- Collection of templates, guides, and best practices to help you get the most out of Browserbase.☆73Updated 2 months ago
- Node.js SDK for Browserbase☆46Updated last week
- Replicate Flux LoRA image editor.☆51Updated 8 months ago
- ☆29Updated 8 months ago
- Developer showcase of projects built on Cartesia☆17Updated 8 months ago
- ☆34Updated last month
- ☆36Updated last week
- Voice AI agent starter kit with Groq, Llama 4, and (optionally) Twilio☆67Updated last week
- kokoro text to speech using javascript☆56Updated 3 months ago
- WhisperAnywhere: Effortless speech-to-text everywhere on your Mac. Use a hotkey to dictate in any app, powered by Whisper AI and Groq API…☆29Updated 8 months ago
- React app for inspecting, building and debugging with the Realtime API☆37Updated 7 months ago
- A Framework for Narrative Agents☆34Updated 7 months ago
- The JavaScript client for the Cartesia API.☆97Updated last week
- An AI-powered Snake game where Claude, an advanced language model, controls the serpent in real-time, showcasing intelligent decision-mak…☆44Updated 6 months ago
- Code generator using LlamaIndexTS workflows with OpenAI o1 model☆52Updated 3 months ago
- Voice Agent Framework for Conversational AI☆47Updated last week
- ☆22Updated 10 months ago
- ☆23Updated 2 months ago
- Voice data <= 10 mins can also be used to train a good VC model!☆12Updated last year
- make your own NotebookLM clone with OpenAI + ElevenLabs + Cartesia☆32Updated 6 months ago
- Turn text from websites into spoken audio with edge-tts, F5, etc. and save as mp3 files☆47Updated 2 months ago