OmniGAIA: Towards Native Omni-Modal AI Agents
☆82Mar 16, 2026Updated this week
Alternatives and similar repositories for OmniGAIA
Users that are interested in OmniGAIA are comparing it to the libraries listed below
Sorting:
- [AAAI'25] SPRING: Learning Scalable and Pluggable Virtual Tokens for Retrieval-Augmented Large Language Models☆26Sep 24, 2025Updated 5 months ago
- Open source code of the paper: "OmniEval: An Omnidirectional and Automatic RAG Evaluation Benchmark in Financial Domain"☆82Dec 20, 2024Updated last year
- This repository contains the code for the paper “Neuro-Symbolic Query Compiler”, accepted to the Findings of ACL 2025.☆16Oct 20, 2025Updated 5 months ago
- Official repository for ToolScope: An Agentic Framework for Vision-Guided and Long-Horizon Tool Use☆29Nov 4, 2025Updated 4 months ago
- Some example codes for drawing figures in research paper☆35Mar 3, 2022Updated 4 years ago
- ⚡FlashRAG: A Python Toolkit for Efficient RAG Research☆16Dec 8, 2024Updated last year
- ☆16Sep 17, 2024Updated last year
- ☆28May 23, 2024Updated last year
- ☆18Jun 10, 2025Updated 9 months ago
- 🔍 Awesome Agentic Search is a curated list of papers, tools, and resources on agentic search—where AI agents plan, search, and reason to…☆55Aug 28, 2025Updated 6 months ago
- EMNLP 2025 | TongSearch-QR☆41Dec 4, 2025Updated 3 months ago
- HierSearch: A Hierarchical Enterprise Deep Search Framework Integrating Local and Web Searches☆37Oct 9, 2025Updated 5 months ago
- The demo, code and data of FollowRAG☆76Jun 30, 2025Updated 8 months ago
- CORAL: Benchmarking Multi-turn Conversational Retrieval-Augmentation Generation☆64May 21, 2025Updated 10 months ago
- ☆80Jan 22, 2026Updated last month
- (ICCV2025) Official repository of paper "ViSpeak: Visual Instruction Feedback in Streaming Videos"☆47Jul 1, 2025Updated 8 months ago
- ☆58Feb 27, 2025Updated last year
- ☆65Jan 26, 2026Updated last month
- ☆14Dec 18, 2024Updated last year
- [ACL 2025] RetroLLM: Empowering LLMs to Retrieve Fine-grained Evidence within Generation☆117Jan 23, 2025Updated last year
- Official repo for paper "EMMA: Efficient Multimodal Understanding, Generation, and Editing with a Unified Architecture."☆62Dec 16, 2025Updated 3 months ago
- ☆175Feb 24, 2026Updated 3 weeks ago
- ☆58Updated this week
- We propose a novel modular framework that learns to dynamically mix low-rank adapters (LoRAs) to improve visual analogy learning, enablin…☆68Feb 18, 2026Updated last month
- Dream-VL and Dream-VLA, a diffusion VLM and a diffusion VLA.☆111Jan 14, 2026Updated 2 months ago
- A scalable automated alignment method for large language models. Resources for "Aligning Large Language Models via Self-Steering Optimiza…☆20Nov 21, 2024Updated last year
- [NeurIPS 2025] Official Implementation for "Enhancing Vision-Language Model Reliability with Uncertainty-Guided Dropout Decoding"☆22Dec 8, 2024Updated last year
- [ACL 2025] Can MLLMs Understand the Deep Implication Behind Chinese Images?☆21Oct 20, 2025Updated 5 months ago
- Official repository of paper "LOVE-R1: Advancing Long Video Understanding with Adaptive Zoom-in Mechanism via Multi-Step Reasoning"☆22Nov 1, 2025Updated 4 months ago
- SIGIR 2021: Proactive Retrieval-based Chatbots based on Relevant Knowledge and Goals☆11Jul 30, 2021Updated 4 years ago
- This repo contains evaluation code for the paper "AV-Odyssey: Can Your Multimodal LLMs Really Understand Audio-Visual Information?"☆31Dec 23, 2024Updated last year
- Neuro-Symbolic Hierarchical Rule Induction☆14Dec 31, 2022Updated 3 years ago
- [CVPR 2026] Thinking with Programming Vision: Towards a Unified View for Thinking with Images☆63Jan 23, 2026Updated last month
- ☆14Apr 1, 2024Updated last year
- A minimal example of Abductive Learning☆18Dec 6, 2023Updated 2 years ago
- Suri: Multi-constraint instruction following for long-form text generation (EMNLP’24)☆27Oct 3, 2025Updated 5 months ago
- \infty-Video: A Training-Free Approach to Long Video Understanding via Continuous-Time Memory Consolidation☆19Feb 14, 2025Updated last year
- Official repo for paper "IC-Effect: Precise and Efficient Video Effects Editing via In-Context Learning"☆41Jan 29, 2026Updated last month
- A Comprehensive Library for Memory of LLM-based Agents.☆107May 13, 2025Updated 10 months ago