oconnoob / realtime-stt-livekit-assemblyaiLinks
Add real-time Speech-to-Text to your LiveKit application with AssemblyAI
☆17Updated 5 months ago
Alternatives and similar repositories for realtime-stt-livekit-assemblyai
Users that are interested in realtime-stt-livekit-assemblyai are comparing it to the libraries listed below
Sorting:
- Voice agent using LiveKit (orchestration), Cartesia (TTS), OpenAI (LLM), and Deepgram (STT)☆20Updated 2 weeks ago
- LiveKit + Next.js AI voice agent interface☆15Updated 8 months ago
- AI Search engine☆12Updated last month
- Open-source clone of OpenAI's Deep Research. Works with any transformer, gpt4free, & runs in browser. No Firecrawl needed.☆12Updated 5 months ago
- A tool to extend camelai's plans and thoughts to browser-use web automation☆13Updated 8 months ago
- AgentParse is a high-performance parsing library designed to map various structured data formats (such as Pydantic models, JSON, YAML, an…☆16Updated last month
- Open Server is an OpenAI API Compatible Server for generating text, images, embeddings, and storing them in vector databases. It also inc…☆17Updated last year
- 🎥➡️📝 Hermes: Blazing-fast video transcription powered by AI gods! Transcribe 6.5 minutes of video in just 1 second using Groq's LPU. Ch…☆79Updated last year
- Bringing large-language models and chat to web browsers. Everything runs inside the browser with no server support.☆14Updated last year
- A swarm of LLM agents that will help you test, document, and productionize your code!☆16Updated 3 weeks ago
- BUD-E (Buddy) is an open-source voice assistant framework that facilitates seamless interaction with AI models and APIs, enabling the cre…☆23Updated last year
- ☆29Updated 4 months ago
- Various agents from all of the top agent frameworks to integrate into swarms! Langchain, Griptape, CrewAI, and more!☆15Updated last week
- Tool4AI: A model agnostic, LLM friendly router for tool/function call☆19Updated last year
- ☆20Updated last year
- This project provides a Flask-based API for generating high-quality text-to-speech (TTS) audio using F5-TTS, a flexible and powerful TTS …☆14Updated 3 months ago
- Streamlit Web UI for AGiXT☆28Updated 4 months ago
- Local11Labs allows generating high-quality text-to-speech and podcast content using the fast and tiny Kokoro-82M.☆50Updated 10 months ago
- Probably one of the lightest native RAG + Agent apps out there,experience the power of Agent-powered models and Agent-driven knowledge ba…☆26Updated 5 months ago
- an auto coder which automatically fixes errors and improves the code from simple user prompt☆36Updated 10 months ago
- Advanced Coding AI Assistant that uses a Gradio interface to stream coding related responses. ChatRAG supports local and API inference an…☆22Updated 6 months ago
- Luann (fka TypeAgent) allows you to create many LLM based agent(Various types of agent,scale up)☆22Updated 6 months ago
- An FFMPEG powered MCP server for basic Video and Audio editing☆40Updated 5 months ago
- The Swarm Ecosystem☆26Updated last year
- Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model☆22Updated last year
- Use this code to access pipeline to Gemini from inside notebookLM☆32Updated last year
- ☆21Updated last year
- Sky LiveKit Agent Perplexica is a local, free solution integrating LiveKit with advanced internet search. It uses a local Perplexica inst…☆22Updated 9 months ago
- An LLM playground similar to the OpenAI API playground☆20Updated last year
- WhisperAnywhere: Effortless speech-to-text everywhere on your Mac. Use a hotkey to dictate in any app, powered by Whisper AI and Groq API…☆34Updated last year