om-ai-lab / OmAgent
Build multimodal language agents for fast prototype and production
☆1,777Updated this week
Alternatives and similar repositories for OmAgent:
Users that are interested in OmAgent are comparing it to the libraries listed below
- The codes about "Uni-MoE: Scaling Unified Multimodal Models with Mixture of Experts"☆689Updated 3 weeks ago
- Align Anything: Training All-modality Model with Feedback☆2,154Updated this week
- Nexa SDK is a comprehensive toolkit for supporting GGML and ONNX models. It supports text generation, image generation, vision-language m…☆4,367Updated this week
- Mulberry, an o1-like Reasoning and Reflection MLLM Implemented via Collective MCTS☆618Updated this week
- Medical o1, Towards medical complex reasoning with LLMs☆868Updated last month
- Accelerating the development of large multimodal models (LMMs) with one-click evaluation module - lmms-eval.☆2,116Updated this week
- Easiest and laziest way for building multi-agent LLMs applications.☆1,122Updated this week
- Towards Open-source GPT-4o with Vision, Speech and Duplex Capabilities。☆1,626Updated last month
- Resources of our paper "FilmAgent: A Multi-Agent Framework for End-to-End Film Automation in Virtual 3D Spaces". New versions in the maki…☆875Updated last week
- "MiniRAG: Making RAG Simpler with Small and Free Language Models"☆714Updated last week
- Vchitect-2.0: Parallel Transformer for Scaling Up Video Diffusion Models☆885Updated 3 weeks ago
- A tutorial based on MetaGPT to quickly help you understand the concept of agent and muti-agent and get started with coding development. 基…☆1,122Updated 9 months ago
- Real-time and accurate open-vocabulary end-to-end object detection☆1,167Updated 2 months ago
- Your Automatic Prompt Engineering Assistant for GenAI Applications☆2,082Updated 9 months ago
- 【CVPR 2024 Highlight】Monkey (LMM): Image Resolution and Text Label Are Important Things for Large Multi-modal Models☆1,708Updated this week
- Hallo2: Long-Duration and High-Resolution Audio-driven Portrait Image Animation☆3,481Updated 3 weeks ago
- An Innovative Agent Framework Driven by KG Engine☆703Updated last month
- SDG is a specialized framework designed to generate high-quality structured tabular data.☆2,308Updated last week
- Eko (Eko Keeps Operating) - Build Production-ready Agentic Workflow with Natural Language - eko.fellou.ai☆2,572Updated this week
- Open-sourced, Fast and Context-aware Action Grounding from GUI Instructions for GUI/Computer-use Agents☆323Updated last week
- "GraphAgent: Agentic Graph Language Assistant"☆258Updated last week
- [NeurIPS 2024] An official implementation of ShareGPT4Video: Improving Video Understanding and Generation with Better Captions☆1,039Updated 4 months ago
- GraphRAG-survey: A curated list of resources on graph-based retrieval-augmented generation for customized large language models.☆558Updated this week
- ☆1,381Updated 4 months ago
- An MBTI Exploration of Large Language Models☆457Updated last year
- The open source platform for AI-native application development.☆5,052Updated 2 months ago
- csghub-server is the backend server for CSGHub which helps user to manage datasets, modes, and also run Model Inference, Finetune and App…☆583Updated this week