IDEA-CCNL / Real-GeminiLinks
Real-time video understanding and interaction through text,audio,image and video with large multi-modal model. 利用多模态大模型的实时视频理解和交互框架,通过文本、语音、图像和视频和这是世界进行问答和交流。
☆25Updated last year
Alternatives and similar repositories for Real-Gemini
Users that are interested in Real-Gemini are comparing it to the libraries listed below
Sorting:
- Tracking the hot Github repos and update daily 每天自动追踪Github热门项目☆49Updated this week
- Luann (fka TypeAgent) allows you to create many LLM based agent(Various types of agent,scale up)☆22Updated 6 months ago
- Official implementation for "OlaGPT: Empowering LLMs With Human-like Problem-Solving Abilities" (keep updating)☆60Updated last year
- An open-source chat text to control actions agentic workflow framework/showcase powered by Agently AI application development framework.☆29Updated last year
- AgileGen: Empowering Agile-Based Generative Software Development through Human-AI Teamwork (accepted by ACM TOSEM)☆23Updated 11 months ago
- aigc evals☆10Updated last year
- the newest version of llama3,source code explained line by line using Chinese☆22Updated last year
- share data, prompt data , pretraining data☆36Updated last year
- 🎮Manipulates mobile phones just like how you would. Official code for "MobA: Multifaceted Memory-Enhanced Adaptive Planning for Efficien…☆25Updated 3 weeks ago
- AgentParse is a high-performance parsing library designed to map various structured data formats (such as Pydantic models, JSON, YAML, an…☆16Updated 2 weeks ago
- ☆13Updated 2 years ago
- Probably one of the lightest native RAG + Agent apps out there,experience the power of Agent-powered models and Agent-driven knowledge ba…☆27Updated 5 months ago
- LinChance Fine-tuning System 采用 Streamlit 结合 LLaMA-Factory 打造的模型微调 Web UI☆14Updated last year
- pre-training llama3 using chinese☆13Updated last year
- A minimal LLM sales agent framework for sales agent fast deployment and benchmark. Support OpenAI models, Claude, HuggingFace models, Gem…☆19Updated last year
- My personal implementation of the model from "Qwen-VL: A Frontier Large Vision-Language Model with Versatile Abilities", they haven't rel…☆11Updated last year
- 最简易的R1结果在小模型上的复现,阐述类O1与DeepSeek R1最重要的本质。Think is all your need。利用实验佐证,对于强推理能力,think思考过程性内容是AGI/ASI的核心。☆45Updated 8 months ago
- Simple Implementation of TinyGPTV in super simple Zeta lego blocks☆15Updated 11 months ago
- XVERSE-MoE-A4.2B: A multilingual large language model developed by XVERSE Technology Inc.☆39Updated last year
- 🌟 Revolutionize Your Operations with One Sentence Automation: Utilizing large language models and Multi-Agents to generate operational c…☆56Updated last year
- Here is a demo for PDF parser (Including OCR, object detection tools)☆36Updated last year
- 基于Llama3,通过进一步CPT,SFT,ORPO得到的中文版Llama3☆17Updated last year
- This is an NVIDIA AI Workbench example project that demonstrates an end-to-end model development workflow using Llamafactory.☆67Updated last year
- ☆20Updated 2 years ago
- Repo for for paper "AgentRE: An Agent-Based Framework for Navigating Complex Information Landscapes in Relation Extraction".☆73Updated last year
- Simple Implementation of a Transformer in the new framework MLX by Apple☆18Updated 11 months ago
- Exploration of the multi modal fuyu-8b model of Adept. 🤓 🔍☆27Updated last year
- Taking advantage of LlamaIndex's in-context learning paradigm, LlamaDoc empowers users to input PDF documents and pose any questions rela…☆14Updated 2 years ago
- Finetune any model on HF in less than 30 seconds☆55Updated last week
- 基于youtube、bilibili等视频平台、webpage网页等,利用零一万物大模型或ollama本地小模型构建大语言模型高质量训练数据集(计划支持可自定义输出的训练数据格式)☆19Updated last year