deepgram / deepgram-python-captionsLinks
This package is the Python implementation of Deepgram's WebVTT and SRT formatting. Given a transcription, this package can return a valid string to store as WebVTT or SRT caption files.
☆21Updated last year
Alternatives and similar repositories for deepgram-python-captions
Users that are interested in deepgram-python-captions are comparing it to the libraries listed below
Sorting:
- Get started using Deepgram's Live Transcription with this Flask demo app☆40Updated last week
- Python client for Hume AI☆148Updated last week
- A WebRTC server that allows you to interact with an LLM using your speech and responds back with generated audio.☆139Updated last year
- Simli WebRTC AI Agent demo☆23Updated 11 months ago
- AI-powered tool for automatic podcast script and audio generation.☆77Updated 3 months ago
- VideoDB Python SDK☆84Updated last month
- POC Port of the openai-realtime-console to streamlit.☆53Updated last year
- Talk to GPT-4 and create a story together.☆91Updated last year
- faster-whisper as serverless endpoint☆125Updated 6 months ago
- Data Questionnaire Agent Chatbot☆69Updated last month
- The official Cartesia client for Python.☆116Updated last week
- Voice agent using LiveKit (orchestration), Cartesia (TTS), OpenAI (LLM), and Deepgram (STT)☆20Updated 3 weeks ago
- Extract information, summarize, ask questions, and search videos using OpenAI's Vision API 🚀🎦☆61Updated 2 years ago
- An JS web client for connecting to Pipecat bots with voice and vision☆45Updated 11 months ago
- Transcribe a phone call in real-time using Python, AssemblyAI, and Twilio☆18Updated 3 months ago
- This Repo focuses on defending against 'adversarial prompts,' detecting and attempting to mitigate objectionable content in real time.☆13Updated 2 years ago
- A basic voice agent built with Python agents framework☆50Updated last month
- ☆19Updated 2 months ago
- ☆42Updated last year
- ☆17Updated 2 years ago
- 😎 Awesome list of tools and projects with the awesome LangChain framework☆19Updated 2 years ago
- Multimodal Chat with Gemini API☆46Updated last year
- WhisperAnywhere: Effortless speech-to-text everywhere on your Mac. Use a hotkey to dictate in any app, powered by Whisper AI and Groq API…☆34Updated last year
- ☆36Updated 2 years ago
- 🎥➡️📝 Hermes: Blazing-fast video transcription powered by AI gods! Transcribe 6.5 minutes of video in just 1 second using Groq's LPU. Ch…☆79Updated last year
- LLM Siri with OpenAI, Perplexity, Ollama, Llama2, Mistral, Mixtral & Langchain☆62Updated last year
- A collection of apps powered by the LlamaIndex LLM framework.☆55Updated last month
- ☆42Updated last year
- PlayHT Python SDK - AI Text-to-Speech Streaming & Voice Cloning API☆217Updated 3 months ago
- Build reliable, secure, and production-ready AI apps easily.☆90Updated last week