TEN-framework / TEN-Agent
TEN Agent is a conversational AI powered by the TEN, integrating Gemini 2.0 Live, OpenAI Realtime, RTC, and more. It delivers real-time capabilities to see, hear, and speak, while being fully compatible with popular workflow platforms like Dify and Coze.
☆4,034Updated this week
Alternatives and similar repositories for TEN-Agent:
Users that are interested in TEN-Agent are comparing it to the libraries listed below
- open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming…☆3,098Updated 2 months ago
- TEN, a voice agent framework to create conversational AI.☆501Updated this week
- Towards Open-source GPT-4o with Vision, Speech and Duplex Capabilities。☆1,579Updated last week
- 📃 A better UX for chat, writing content, and coding with LLMs.☆3,530Updated this week
- Eko (Eko Keeps Operating) - Build Production-ready Agentic Workflow with Natural Language - eko.fellou.ai☆1,521Updated this week
- A fast multimodal LLM for real-time voice☆3,245Updated last week
- Build multimodal language agents for fast prototype and production☆1,400Updated this week
- The open source platform for AI-native application development.☆5,026Updated last month
- MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone☆17,826Updated this week
- Nexa SDK is a comprehensive toolkit for supporting GGML and ONNX models. It supports text generation, image generation, vision-language m…☆4,290Updated this week
- Task-Aware Agent-driven Prompt Optimization Framework☆2,430Updated 2 weeks ago
- Build real-time multimodal AI applications 🤖🎙️📹☆4,891Updated this week
- "LightRAG: Simple and Fast Retrieval-Augmented Generation"☆11,354Updated this week
- A UI-Focused Agent for Windows OS Interaction.☆6,478Updated last week
- GLM-4-Voice | 端到端中英语音对话模型☆2,593Updated last month
- Let AI be your browser operator.☆5,229Updated this week
- BISHENG is an open LLM devops platform for next generation Enterprise AI applications. Powerful and comprehensive features include: GenAI…☆7,542Updated this week
- Speech To Speech: an effort for an open-sourced and modular GPT4-o☆3,691Updated last month
- Flexible and powerful framework for managing multiple AI agents and handling complex conversations☆3,917Updated last week
- Hallo2: Long-Duration and High-Resolution Audio-driven Portrait Image Animation☆3,448Updated this week
- Scira (Formerly MiniPerplx) is a minimalistic AI-powered search engine that helps you find information on the internet. Powered by Vercel…☆4,307Updated this week
- The Enterprise-Grade Production-Ready Multi-Agent Orchestration Framework Join our Community: https://discord.gg/jM3Z6M9uMq☆4,258Updated this week
- ☆1,435Updated this week
- Local realtime voice AI☆2,181Updated last week
- LLM-powered multiagent persona simulation for imagination enhancement and business insights.☆5,345Updated 3 weeks ago
- Riona 🌸 is built with Node.js and TypeScript 🛠️. Designed to run jobs 📸 effortlessly. Lightweight, efficient, and a work in progress �…☆2,290Updated last week
- An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Spe…☆2,116Updated this week
- A simple screen parsing tool towards pure vision based GUI agent☆5,587Updated 3 weeks ago