IDEA-CCNL / Real-GeminiLinks
Real-time video understanding and interaction through text,audio,image and video with large multi-modal model. 利用多模态大模型的实时视频理解和交互框架,通过文本、语音、图像和视频和这是世界进行问答和交流。
☆24Updated last year
Alternatives and similar repositories for Real-Gemini
Users that are interested in Real-Gemini are comparing it to the libraries listed below
Sorting:
- GPT+神器,简单实用的一站式AGI架构,内置本地化,LLM模型,agent,矢量数据库,智能链chain☆48Updated 2 years ago
- Luann (fka TypeAgent) allows you to create many LLM based agent(Various types of agent,scale up)☆21Updated 3 months ago
- Tracking the hot Github repos and update daily 每天自动追踪Github热门项目☆50Updated this week
- ☆13Updated last year
- AGM阿格姆:AI基因图谱模型,从token-weight权重微粒角度,探索AI模型,GPT\LLM大模型的内在运作机制。☆28Updated 2 years ago
- Simple Implementation of TinyGPTV in super simple Zeta lego blocks☆16Updated 8 months ago
- ☆19Updated 2 years ago
- AgentParse is a high-performance parsing library designed to map various structured data formats (such as Pydantic models, JSON, YAML, an…☆14Updated 2 weeks ago
- Official implementation for "OlaGPT: Empowering LLMs With Human-like Problem-Solving Abilities" (keep updating)☆59Updated last year
- AGI模块库架构图☆76Updated last year
- 大模型智能体Agent中文教程,博客代码仓库☆26Updated last month
- An open-source chat text to control actions agentic workflow framework/showcase powered by Agently AI application development framework.☆28Updated 10 months ago
- Various agents from all of the top agent frameworks to integrate into swarms! Langchain, Griptape, CrewAI, and more!☆12Updated this week
- LinChance Fine-tuning System 采用 Streamlit 结合 LLaMA-Factory 打造的模型微调 Web UI☆14Updated last year
- aigc evals☆11Updated last year
- ☆94Updated 8 months ago
- 🎮Manipulates mobile phones just like how you would. Official code for "MobA: A Two-Level Agent System for Efficient Mobile Task Automati…☆25Updated 3 months ago
- 最简易的R1结果在小模型上的复现,阐述类O1与DeepSeek R1最重 要的本质。Think is all your need。利用实验佐证,对于强推理能力,think思考过程性内容是AGI/ASI的核心。☆44Updated 6 months ago
- Finetune Llama 3, Mistral & Gemma LLMs 2-5x faster with 80% less memory☆28Updated last year
- My personal implementation of the model from "Qwen-VL: A Frontier Large Vision-Language Model with Versatile Abilities", they haven't rel…☆12Updated last year
- 🌟 Revolutionize Your Operations with One Sentence Automation: Utilizing large language models and Multi-Agents to generate operational c…☆54Updated last year
- 本项目是关于Yi的多模态系列模型,如Yi-VL-6B/34B等的实验与应用。☆14Updated last year
- pre-training llama3 using chinese☆13Updated last year
- LLM RAG 应用,支持 API 调用,语音交互。☆11Updated last year
- ☆74Updated last year
- Open-source examples and guides for building with the Qwen. Browse a collection of snippets, advanced techniques and walkthroughs.☆21Updated 8 months ago
- Exploration of the multi modal fuyu-8b model of Adept. 🤓 🔍☆28Updated last year
- PresentAgent: Multimodal Agent for Presentation Video Generation☆93Updated last week
- A minimal LLM sales agent framework for sales agent fast deployment and benchmark. Support OpenAI models, Claude, HuggingFace models, Gem…☆19Updated 11 months ago
- AgileGen: Empowering Agile-Based Generative Software Development through Human-AI Teamwork (accepted by ACM TOSEM)☆24Updated 9 months ago