Yuan-ManX / ai-voice-agentsLinks
AI Voice Agents: Exploring the Next Generation of Human-Machine Interaction! ποΈπ€π§
β10Updated 10 months ago
Alternatives and similar repositories for ai-voice-agents
Users that are interested in ai-voice-agents are comparing it to the libraries listed below
Sorting:
- Roomey is a multi-purpose Voice Agent designed to run your personal and business life.β34Updated last month
- An open source chat bot architecture for voice/vision (and multimodal) assistants, local(CPU/GPU bound) and remote(I/O bound) to run.β59Updated this week
- a simple system for 2-way interruptible voice interactions between human and LLMβ30Updated last year
- β19Updated last week
- A list of language models with permissive licenses such as MIT or Apache 2.0β24Updated 4 months ago
- A lightweight Python library for running TTS models with a unified API.β20Updated 4 months ago
- Open TTS models, built for streaming on the edgeβ43Updated 4 months ago
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.β63Updated last month
- BUD-E (Buddy) is an open-source voice assistant framework that facilitates seamless interaction with AI models and APIs, enabling the creβ¦β20Updated 9 months ago
- Best ever Agentive Retrieval Augmented Generationβ7Updated last year
- AI Voice Assistant: Talk to an AI agent that helps you with event scheduling, contact management, accessing your knowledge base, and web β¦β51Updated 7 months ago
- Explore the latest AI Agent Framework!β65Updated 11 months ago
- A framework making it effortless to convert any llm model into a reasoning agent like o1 or DeepSeek's r1β21Updated 2 weeks ago
- A minimalistic streamlit chatbot UI to combine and customize tools for langchain llm agentsβ13Updated last year
- Advanced Coding AI Assistant that uses a Gradio interface to stream coding related responses. ChatRAG supports local and API inference anβ¦β22Updated 2 months ago
- A basic voice agent built with Python agents frameworkβ50Updated 2 months ago
- AgentParse is a high-performance parsing library designed to map various structured data formats (such as Pydantic models, JSON, YAML, anβ¦β13Updated 2 weeks ago
- Legible, Scalable, Reproducible Foundation Models with Named Tensors and Jaxβ14Updated last year
- SpeechAgents: Human-Communication Simulation with Multi-Modal Multi-Agent Systemsβ82Updated last year
- Transform unstructured documents into actionable, structured data with enterprise-grade precision and reliability, ready for large-scale β¦β19Updated 2 weeks ago
- Voice agent using LiveKit (orchestration), Cartesia (TTS), OpenAI (LLM), and Deepgram (STT)β18Updated last month
- β13Updated 2 months ago
- Multimodal Open Source Framework for Conversational Agent Research and Development.β19Updated 5 months ago
- Small Multimodal Vision Model "Imp-v1-3b" trained using Phi-2 and Siglip.β17Updated last year
- [NCMMSC'2024] Emotion-Aware Prosodic Phrasing for Expressive Text-to-Speechβ22Updated 10 months ago
- A tool to extend camelai's plans and thoughts to browser-use web automationβ12Updated 4 months ago
- Audio tokenization, in the fastest way possible!β52Updated 10 months ago
- LiveKit + Next.js AI voice agent interfaceβ11Updated 4 months ago
- Luann (fka TypeAgent) allows you to create many LLM based agent(Various types of agent,scale up)β21Updated 2 months ago
- Minimal zero-shot intent classifier for arbitrary intent slot filling, via LLM prompting w LangChain.β33Updated 2 years ago