om-ai-lab / OmAgent
Build multimodal language agents for very fast prototype and production
☆1,198Updated this week
Alternatives and similar repositories for OmAgent:
Users that are interested in OmAgent are comparing it to the libraries listed below
- Easiest and laziest way for building multi-agent LLMs applications.☆937Updated this week
- Towards Open-source GPT-4o with Vision, Speech and Duplex Capabilities。☆1,550Updated this week
- [NeurIPS 2024] An official implementation of ShareGPT4Video: Improving Video Understanding and Generation with Better Captions☆1,012Updated 3 months ago
- The codes about "Uni-MoE: Scaling Unified Multimodal Models with Mixture of Experts"☆796Updated last week
- Eko (Eko Keeps Operating) - Build Production-ready Agentic Workflow with Natural Language - eko.fellou.ai☆545Updated this week
- Real-time and accurate open-vocabulary end-to-end object detection☆1,135Updated last month
- 【CVPR 2024 Highlight】Monkey (LMM): Image Resolution and Text Label Are Important Things for Large Multi-modal Models☆1,683Updated 2 weeks ago
- An MBTI Exploration of Large Language Models☆447Updated 11 months ago
- Medical o1, Towards medical complex reasoning with LLMs☆656Updated last week
- "GraphAgent: Agentic Graph Language Assistant"☆230Updated 2 weeks ago
- An Innovative Agent Framework Driven by KG Engine☆623Updated this week
- Vchitect-2.0: Parallel Transformer for Scaling Up Video Diffusion Models☆602Updated 3 months ago
- Accelerating the development of large multimodal models (LMMs) with one-click evaluation module - lmms-eval.☆1,987Updated this week
- A tutorial based on MetaGPT to quickly help you understand the concept of agent and muti-agent and get started with coding development. 基…☆1,092Updated 8 months ago
- improve Llama-2's proficiency in comprehension, generation, and translation of Chinese.☆449Updated 9 months ago
- Next-Generation Interactive Intelligent Programming Assistant☆730Updated 3 months ago
- LLM-And-More is a professional, plug-and-play, llm trainer and application builder that guides you through the complete LLM workflow from…☆364Updated 6 months ago
- Open-sourced, Fast and Context-aware Action Grounding from GUI Instructions for GUI/Computer-use Agents☆262Updated this week
- DocGenome: An Open Large-scale Scientific Document Benchmark for Training and Testing Multi-modal Large Models☆127Updated this week
- Your Automatic Prompt Engineering Assistant for GenAI Applications☆2,082Updated 8 months ago
- Nexa SDK is a comprehensive toolkit for supporting GGML and ONNX models. It supports text generation, image generation, vision-language m…☆4,247Updated last week
- The official repository of the paper "(Perhaps) Beyond Human Translation: Harnessing Multi-Agent Collaboration for Translating Ultra-Long…☆533Updated 6 months ago
- ☆369Updated last month
- Awesome LLMs on Device: A Comprehensive Survey☆910Updated this week
- The official implementation of Self-Play Preference Optimization (SPPO)☆452Updated last month
- "MiniRAG: Making RAG Simpler with Small and Free Language Models"☆215Updated this week
- An intelligent assistant serving the entire software development lifecycle, powered by a Multi-Agent Framework, working with DevOps Toolk…☆1,087Updated 6 months ago
- [AAAI 2025] Official repository of Imitate Before Detect: Aligning Machine Stylistic Preference for Machine-Revised Text Detection☆168Updated last week
- EAGLE: Exploring The Design Space for Multimodal LLMs with Mixture of Encoders☆528Updated 3 months ago
- The open source platform for AI-native application development.☆5,063Updated last month