SesameAILabs / wavtools
Record and stream WAV audio data in the browser across all platforms
☆30Updated 2 months ago
Alternatives and similar repositories for wavtools:
Users that are interested in wavtools are comparing it to the libraries listed below
- Faster Whisper with additional features☆40Updated last month
- Record and stream WAV audio data in the browser across all platforms☆80Updated 5 months ago
- Choose a topic, a music genre and wait for the agents to generate a song☆54Updated 9 months ago
- WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)☆49Updated 5 months ago
- ☆25Updated 3 months ago
- ☆38Updated 6 months ago
- Real-Time Voice Inference Web SDK☆214Updated last week
- SGLang is a fast serving framework for large language models and vision language models.☆17Updated 3 weeks ago
- ☆14Updated last month
- List of curated use cases built using Sesame's CSM 1B☆62Updated last month
- A Framework for Narrative Agents☆33Updated 6 months ago
- Example Optimizely clone created with GPT Pilot☆23Updated 5 months ago
- Open Source multi-modal LLM environment. Host your own web and mobile chat interface, powered by real-time bots and voice AI functionalit…☆34Updated 4 months ago
- Model Context Protocol Servers (Browserbase Version)☆47Updated 4 months ago
- A WebRTC server that allows you to interact with an LLM using your speech and responds back with generated audio.☆129Updated 10 months ago
- kokoro text to speech using javascript☆55Updated 2 months ago
- WIP exploration using Twilio Media Streams and Generative AI☆40Updated last year
- Collection of templates, guides, and best practices to help you get the most out of Browserbase.☆65Updated last month
- Model Context Protocol (MCP) Server for Langfuse Prompt Management. This server allows you to access and manage your Langfuse prompts thr…☆53Updated 2 months ago
- The JavaScript client for the Cartesia API.☆95Updated last week
- Replicate Flux LoRA image editor.☆50Updated 7 months ago
- ☆19Updated 2 months ago
- Code Interpreter Replica☆22Updated last year
- An JS web client for connecting to Pipecat bots with voice and vision☆44Updated 4 months ago
- A library for generating structured JSON using GPT-4o.☆13Updated 8 months ago
- AI Tube (website)☆13Updated 7 months ago
- WhisperAnywhere: Effortless speech-to-text everywhere on your Mac. Use a hotkey to dictate in any app, powered by Whisper AI and Groq API…☆28Updated 7 months ago
- Chat interface that searches the web for you real-time☆91Updated 6 months ago
- Turn text from websites into spoken audio with edge-tts, F5, etc. and save as mp3 files☆45Updated last month
- A general AI Agent. Inspired by Manus☆42Updated 3 weeks ago