deepgram / deepgram-js-captionsLinks
This package is the JavaScript implementation of Deepgram's WebVTT and SRT formatting. Given a transcription, this package can return a valid string to store as WebVTT or SRT caption files.
☆16Updated last year
Alternatives and similar repositories for deepgram-js-captions
Users that are interested in deepgram-js-captions are comparing it to the libraries listed below
Sorting:
- A mono-repo to house the various supported Transport options to be used with Pipecat's client-js package☆29Updated last week
- This little utility library allows you to ask the most common question when working with video content - does the video contain something…☆62Updated 6 months ago
- ☆33Updated 7 months ago
- ☆10Updated 2 years ago
- Buildings block for voice-enabled applications in the browser☆37Updated 6 months ago
- Code for training & inference with FLAN family of models☆17Updated 2 years ago
- Voice agent using LiveKit (orchestration), Cartesia (TTS), OpenAI (LLM), and Deepgram (STT)☆17Updated this week
- ☆36Updated last year
- Cog wrapper for collabora/WhisperSpeech☆24Updated last year
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆97Updated last year
- Play.ht's Text to Speech API☆92Updated 2 months ago
- ☆18Updated 3 years ago
- Easily create video datasets with auto-captioning for Hunyuan-Video LoRA finetuning☆13Updated 7 months ago
- Tutorial for using Twilio Media Streams☆25Updated 10 months ago
- ☆16Updated 2 years ago
- Generate music videos starring yourself.☆10Updated 6 months ago
- Voice data <= 10 mins can also be used to train a good VC model!☆13Updated last year
- Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS (E2 TTS) in MLX☆28Updated last year
- ASR + diarization model server with speculative decoding☆63Updated last year
- Backend for https://github.com/pinokiocomputer/pinokio☆58Updated last month
- Cog wrapper for Coqui / xtts-v2☆78Updated 11 months ago
- Convert an audio file to a waveform video☆11Updated last year
- ☆99Updated 7 months ago
- Bringing large-language models and chat to web browsers. Everything runs inside the browser with no server support.☆14Updated last year
- Build Agents That Recall What Matters. Systematically engineer relevant context from chat history & business data. (TypeScript Client)☆63Updated last week
- CLI for Replicate☆79Updated last year
- On-device voice activity detection (VAD) powered by deep learning☆232Updated last month
- LiveKit + Next.js AI voice agent interface☆15Updated 8 months ago
- An JS web client for connecting to Pipecat bots with voice and vision☆45Updated 10 months ago
- A realtime drawing game showcasing the use of LiveKit data capabilities in an Agents-based app.☆37Updated this week