deepgram / deepgram-js-captionsLinks
This package is the JavaScript implementation of Deepgram's WebVTT and SRT formatting. Given a transcription, this package can return a valid string to store as WebVTT or SRT caption files.
☆14Updated 11 months ago
Alternatives and similar repositories for deepgram-js-captions
Users that are interested in deepgram-js-captions are comparing it to the libraries listed below
Sorting:
- A mono-repo to house the various supported Transport options to be used with Pipecat's client-js package☆28Updated this week
- ☆18Updated 3 years ago
- A basic voice agent built with Node.js agents framework☆33Updated this week
- A realtime drawing game showcasing the use of LiveKit data capabilities in an Agents-based app.☆31Updated this week
- Cog wrapper for collabora/WhisperSpeech☆25Updated last year
- ☆33Updated 4 months ago
- Voice agent using LiveKit (orchestration), Cartesia (TTS), OpenAI (LLM), and Deepgram (STT)☆18Updated 2 months ago
- Web api for using PiperTTS based models in the browser!☆20Updated last year
- ☆10Updated last year
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆96Updated last year
- Cog wrapper for Coqui / xtts-v2☆76Updated 8 months ago
- Play.ht's Text to Speech API☆90Updated last year
- ☆16Updated last year
- Cloud Video Renderer SDK using layerhub components☆17Updated 2 years ago
- Buildings block for voice-enabled applications in the browser☆37Updated 3 months ago
- CLI for Replicate☆81Updated 10 months ago
- ☆94Updated 4 months ago
- An JS web client for connecting to Pipecat bots with voice and vision☆45Updated 7 months ago
- Remotion Mapbox example☆29Updated 4 months ago
- Bringing large-language models and chat to web browsers. Everything runs inside the browser with no server support.☆14Updated last year
- Developer showcase of projects built on Cartesia☆17Updated 11 months ago
- An example Voice Pipeline Agent with Cartesia☆26Updated 4 months ago
- AI 3D avatar voice interface in browser. VAD -> STT -> LLM -> TTS -> VRM (Prototype/Proof-of-Concept)☆71Updated 2 years ago
- StoryTeller is an experimental web application that creates short audio stories for pre-school kids.☆90Updated last year
- ASR + diarization model server with speculative decoding☆62Updated last year
- A basic voice agent built with Python agents framework☆49Updated 3 weeks ago
- A web GUI built with Nuxt.js for outpainting with Stable Diffusion using the Replicate API.☆52Updated 2 years ago
- Code for training & inference with FLAN family of models☆17Updated 2 years ago
- On-device voice activity detection (VAD) powered by deep learning☆223Updated last week
- The AssemblyAI JavaScript SDK provides an easy-to-use interface for interacting with the AssemblyAI API, which supports async and real-ti…☆57Updated this week