TEN-framework / TEN-Agent
TEN Agent is a conversational voice AI agent powered by TEN, integrating Deepseek, Gemini, OpenAI, RTC, and hardware like ESP32. It enables realtime AI capabilities like seeing, hearing, and speaking, and is fully compatible with platforms like Dify and Coze.
☆5,493Updated this week
Alternatives and similar repositories for TEN-Agent:
Users that are interested in TEN-Agent are comparing it to the libraries listed below
- Eko (Eko Keeps Operating) - Build Production-ready Agentic Workflow with Natural Language - eko.fellou.ai☆2,916Updated this week
- TEN, a AI agent framework to create various AI agents which supports real-time conversation.☆609Updated this week
- "AutoAgent: Fully-Automated and Zero-Code LLM Agent Framework"☆3,658Updated this week
- Task-Aware Agent-driven Prompt Optimization Framework☆3,112Updated 2 weeks ago
- 🎨 Refly is an open-source AI-native creation engine. Its intuitive free-form canvas interface combines multi-threaded dialogues, artifac…☆3,417Updated this week
- The python library for real-time communication☆3,414Updated this week
- open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming…☆3,256Updated 5 months ago
- Like Manus, Computer Use Agent(CUA) and Omniparser, we are computer-using agents.AI-driven local automation assistant that uses natural l…☆2,984Updated last week
- A powerful framework for building realtime voice AI agents 🤖🎙️📹☆5,480Updated this week
- BISHENG is an open LLM devops platform for next generation Enterprise AI applications. Powerful and comprehensive features include: GenAI…☆8,001Updated this week
- ☆3,819Updated last month
- Towards Open-source GPT-4o with Vision, Speech and Duplex Capabilities。☆1,704Updated 2 months ago
- Resources of our paper "FilmAgent: A Multi-Agent Framework for End-to-End Film Automation in Virtual 3D Spaces". New versions in the maki…☆945Updated 3 weeks ago
- A high-performance LLM inference API and Chat UI that integrates DeepSeek R1's CoT reasoning traces with Anthropic Claude models.☆4,994Updated 2 months ago
- A GUI Agent application based on UI-TARS(Vision-Language Model) that allows you to control your computer using natural language.☆10,624Updated this week
- A fast multimodal LLM for real-time voice☆3,804Updated last month
- Keep searching, reading webpages, reasoning until it finds the answer (or exceeding the token budget)☆3,860Updated last week
- Nexa SDK is a comprehensive toolkit for supporting GGML and ONNX models. It supports text generation, image generation, vision-language m…☆4,483Updated last month
- PIKE-RAG: sPecIalized KnowledgE and Rationale Augmented Generation☆1,651Updated this week
- Build multimodal language agents for fast prototype and production☆2,456Updated 3 weeks ago
- MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone☆19,168Updated last month
- Agent framework and applications built upon Qwen>=2.0, featuring Function Calling, Code Interpreter, RAG, and Chrome extension.☆6,434Updated last week
- ☆2,754Updated 2 weeks ago
- ☆5,879Updated this week
- Open Source Deep Research Alternative to Reason and Search on Private Data. Written in Python.☆5,334Updated this week
- The open source platform for AI-native application development.☆5,095Updated 4 months ago
- "Your Fully-Automated Personal AI Assistant, and Open-Source & Cost-Efficient Alternative to OpenAI's Deep Research"☆888Updated last week
- 5ire is a cross-platform desktop AI assistant, MCP client. It compatible with major service providers, supports local knowledge base and…☆2,489Updated this week
- Profile-Based Long-Term Memory for AI Applications☆1,011Updated this week
- Open Source framework for voice and multimodal conversational AI☆5,455Updated this week