om-ai-lab / OmAgent
Build multimodal language agents for fast prototype and production
☆2,451Updated last week
Alternatives and similar repositories for OmAgent:
Users that are interested in OmAgent are comparing it to the libraries listed below
- Align Anything: Training All-modality Model with Feedback☆3,063Updated last week
- "VideoRAG: Retrieval-Augmented Generation with Extreme Long-Context Videos"☆510Updated this week
- "Your Fully-Automated Personal AI Assistant, and Open-Source & Cost-Efficient Alternative to OpenAI's Deep Research"☆838Updated last month
- Pioneering Multimodal Reasoning with CoT☆1,157Updated this week
- Real-time and accurate open-vocabulary end-to-end object detection☆1,307Updated 3 months ago
- The codes about "Uni-MoE: Scaling Unified Multimodal Models with Mixture of Experts"☆705Updated 2 months ago
- Easiest and laziest way for building multi-agent LLMs applications.☆1,417Updated last week
- Resources of our paper "FilmAgent: A Multi-Agent Framework for End-to-End Film Automation in Virtual 3D Spaces". New versions in the maki…☆938Updated last week
- Nexa SDK is a comprehensive toolkit for supporting GGML and ONNX models. It supports text generation, image generation, vision-language m…☆4,472Updated 3 weeks ago
- Towards Open-source GPT-4o with Vision, Speech and Duplex Capabilities。☆1,689Updated 2 months ago
- Accelerating the development of large multimodal models (LMMs) with one-click evaluation module - lmms-eval.☆2,268Updated this week
- Mulberry, an o1-like Reasoning and Reflection MLLM Implemented via Collective MCTS☆1,158Updated this week
- "MiniRAG: Making RAG Simpler with Small and Free Language Models"☆901Updated this week
- A tutorial based on MetaGPT to quickly help you understand the concept of agent and muti-agent and get started with coding development. 基…☆1,177Updated 10 months ago
- An Innovative Agent Framework Driven by KG Engine☆755Updated 2 months ago
- 【CVPR 2024 Highlight】Monkey (LMM): Image Resolution and Text Label Are Important Things for Large Multi-modal Models☆1,735Updated last week
- Your Automatic Prompt Engineering Assistant for GenAI Applications☆2,091Updated 11 months ago
- Next-Generation Interactive Intelligent Programming Assistant☆805Updated 5 months ago
- "AutoAgent: Fully-Automated and Zero-Code LLM Agent Framework"☆3,315Updated this week
- Eko (Eko Keeps Operating) - Build Production-ready Agentic Workflow with Natural Language - eko.fellou.ai☆2,904Updated this week
- Vchitect-2.0: Parallel Transformer for Scaling Up Video Diffusion Models☆908Updated 2 weeks ago
- Hallo2: Long-Duration and High-Resolution Audio-driven Portrait Image Animation☆3,518Updated last month
- DeepRetrieval - Hacking 🔥Real Search Engines and Text/Data Retrievers with LLM + RL☆201Updated this week
- "GraphAgent: Agentic Graph Language Assistant"☆292Updated last month
- In-depth study of the graphrag☆674Updated this week
- SDG is a specialized framework designed to generate high-quality structured tabular data.☆2,338Updated 3 weeks ago
- [NeurIPS 2024] An official implementation of ShareGPT4Video: Improving Video Understanding and Generation with Better Captions☆1,050Updated 5 months ago
- Parsing-free RAG supported by VLMs☆644Updated last month
- Awesome-GraphRAG: A curated list of resources (surveys, papers, benchmarks, and opensource projects) on graph-based retrieval-augmented g…☆900Updated last week
- An intelligent assistant serving the entire software development lifecycle, powered by a Multi-Agent Framework, working with DevOps Toolk…☆1,162Updated 9 months ago