deepgram / deepgram-js-captionsLinks
This package is the JavaScript implementation of Deepgram's WebVTT and SRT formatting. Given a transcription, this package can return a valid string to store as WebVTT or SRT caption files.
☆15Updated last year
Alternatives and similar repositories for deepgram-js-captions
Users that are interested in deepgram-js-captions are comparing it to the libraries listed below
Sorting:
- A mono-repo to house the various supported Transport options to be used with Pipecat's client-js package☆28Updated last week
- ☆32Updated 5 months ago
- Voice agent using LiveKit (orchestration), Cartesia (TTS), OpenAI (LLM), and Deepgram (STT)☆17Updated 2 months ago
- Tutorial for using Twilio Media Streams☆24Updated 8 months ago
- ☆10Updated last year
- ☆18Updated 3 years ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆97Updated last year
- CLI for Replicate☆80Updated 11 months ago
- StoryDiffusion serverless worker☆17Updated last year
- Easily create video datasets with auto-captioning for Hunyuan-Video LoRA finetuning☆12Updated 5 months ago
- Play.ht's Text to Speech API☆92Updated 3 weeks ago
- A basic voice agent built with Node.js agents framework☆34Updated this week
- Official JavaScript SDK for Deepgram.☆218Updated 2 weeks ago
- Talk to GPT-4 and create a story together.☆91Updated last year
- Using headless Chrome on server side environments for true client side browser emulation with NVIDIA T4 GPUs for Web AI model testing or …☆81Updated last year
- Example code on how to generate viseme json☆14Updated 2 years ago
- ☆75Updated last year
- Integrate AI-powered voice translation into a Twilio Flex contact center using our prebuilt starter app, enabling live conversations betw…☆103Updated 11 months ago
- XTTS: Multilingual Voice Cloning TTS Model by Coqui Deployed to Replicate☆25Updated last year
- Record a sample of your own voice and let AI narrate the text in your own voice.☆79Updated last year
- An example Voice Pipeline Agent with Cartesia☆26Updated 5 months ago
- Cog implementation of "Sonic: Shifting Focus to Global Audio Perception in Portrait Animation"☆12Updated 4 months ago
- ☆33Updated last year
- Cog wrapper for collabora/WhisperSpeech☆25Updated last year
- Cloud Video Renderer SDK using layerhub components☆17Updated 2 years ago
- Cog template for Stable Diffusion 3 (ComfyUI implementation)☆17Updated last year
- Web api for using PiperTTS based models in the browser!☆20Updated last year
- A web GUI built with Nuxt.js for outpainting with Stable Diffusion using the Replicate API.☆52Updated 2 years ago
- Buildings block for voice-enabled applications in the browser☆37Updated 4 months ago
- Convert an audio file to a waveform video☆11Updated last year