SesameAILabs / wavtoolsLinks
Record and stream WAV audio data in the browser across all platforms
☆32Updated 4 months ago
Alternatives and similar repositories for wavtools
Users that are interested in wavtools are comparing it to the libraries listed below
Sorting:
- Faster Whisper with additional features☆44Updated 2 months ago
- SGLang is a fast serving framework for large language models and vision language models.☆20Updated 2 weeks ago
- Record and stream WAV audio data in the browser across all platforms☆81Updated 6 months ago
- List of curated use cases built using Sesame's CSM 1B☆66Updated this week
- ☆15Updated 3 weeks ago
- Replicate Flux LoRA image editor.☆51Updated 9 months ago
- WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)☆53Updated 7 months ago
- ☆28Updated last month
- ☆38Updated 8 months ago
- Text-to-Music Generation with Rectified Flow Transformer☆8Updated 9 months ago
- A WebRTC server that allows you to interact with an LLM using your speech and responds back with generated audio.☆132Updated 11 months ago
- MCP server implementation for Telegram☆14Updated 2 weeks ago
- ☆37Updated this week
- WIP exploration using Twilio Media Streams and Generative AI☆40Updated last year
- Voice Agent Framework for Conversational AI☆50Updated 3 weeks ago
- A Framework for Narrative Agents☆35Updated 8 months ago
- AI Testing Agent: Open Source AI Agent for Software Testing☆22Updated 5 months ago
- StoryDiffusion serverless worker☆17Updated last year
- TypeScript-based library for real-time audio transcription, integrating OpenAI's Whisper model for accurate speech-to-text conversion.☆69Updated last year
- Open Source multi-modal LLM environment. Host your own web and mobile chat interface, powered by real-time bots and voice AI functionalit…☆40Updated 6 months ago
- A mono-repo to house the various supported Transport options to be used with Pipecat's client-js package☆23Updated this week
- ☆25Updated last year
- Simple text to phones converter using eSpeak NG.☆29Updated 4 months ago
- OpenAI compatible TTS for Sesame CSM:1b & dia:1.6b - Voice Cloning from File/YT☆344Updated last month
- Finetune Sesame's CSM 1B model, for fun and profit☆16Updated 2 months ago
- API server for Instant voice cloning by MyShell.☆93Updated 8 months ago
- kokoro text to speech using javascript☆57Updated 4 months ago
- Collection of templates, guides, and best practices to help you get the most out of Browserbase.☆75Updated 2 weeks ago
- Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor…☆12Updated last year
- Real-Time Voice Inference Web SDK☆241Updated this week