OmniGAIA: Towards Native Omni-Modal AI Agents
☆46Updated this week
Alternatives and similar repositories for OmniGAIA
Users that are interested in OmniGAIA are comparing it to the libraries listed below
Sorting:
- RAG methods, benchmarks, and toolkits☆19Nov 28, 2024Updated last year
- [AAAI'25] SPRING: Learning Scalable and Pluggable Virtual Tokens for Retrieval-Augmented Large Language Models☆25Sep 24, 2025Updated 5 months ago
- This repository contains the code for the paper “Neuro-Symbolic Query Compiler”, accepted to the Findings of ACL 2025.☆16Oct 20, 2025Updated 4 months ago
- ⚡FlashRAG: A Python Toolkit for Efficient RAG Research☆22Dec 11, 2024Updated last year
- ☆67Aug 14, 2025Updated 6 months ago
- ☆16Sep 17, 2024Updated last year
- Open source code of the paper: "OmniEval: An Omnidirectional and Automatic RAG Evaluation Benchmark in Financial Domain"☆81Dec 20, 2024Updated last year
- ☆14Dec 18, 2024Updated last year
- ☆27May 23, 2024Updated last year
- From Accuracy to Robustness: A Study of Rule- and Model-based Verifiers in Mathematical Reasoning.☆25Oct 7, 2025Updated 4 months ago
- RAG-RewardBench: Benchmarking Reward Models in Retrieval Augmented Generation for Preference Alignment☆16Dec 19, 2024Updated last year
- ⚡FlashRAG: A Python Toolkit for Efficient RAG Research☆16Dec 8, 2024Updated last year
- Some example codes for drawing figures in research paper☆35Mar 3, 2022Updated 3 years ago
- Official repository for ToolScope: An Agentic Framework for Vision-Guided and Long-Horizon Tool Use☆28Nov 4, 2025Updated 3 months ago
- 🔍 Awesome Agentic Search is a curated list of papers, tools, and resources on agentic search—where AI agents plan, search, and reason to…☆54Aug 28, 2025Updated 6 months ago
- HierSearch: A Hierarchical Enterprise Deep Search Framework Integrating Local and Web Searches☆37Oct 9, 2025Updated 4 months ago
- EMNLP 2025 | TongSearch-QR☆41Dec 4, 2025Updated 2 months ago
- ☆138Nov 17, 2025Updated 3 months ago
- A scalable automated alignment method for large language models. Resources for "Aligning Large Language Models via Self-Steering Optimiza…☆20Nov 21, 2024Updated last year
- ☆58Feb 27, 2025Updated last year
- The code for paper: Hierarchical Document Refinement for Long-context Retrieval-augmented Generation [ACL2025 Oral]☆42Aug 25, 2025Updated 6 months ago
- Suri: Multi-constraint instruction following for long-form text generation (EMNLP’24)☆27Oct 3, 2025Updated 4 months ago
- A comprehensive benchmark for evaluating deep research agents on academic survey tasks☆50Sep 4, 2025Updated 5 months ago
- ☆78Jan 22, 2026Updated last month
- CORAL: Benchmarking Multi-turn Conversational Retrieval-Augmentation Generation☆64May 21, 2025Updated 9 months ago
- [ICML 2025] Beyond Bradley-Terry Models: A General Preference Model for Language Model Alignment (https://arxiv.org/abs/2410.02197)☆39Sep 8, 2025Updated 5 months ago
- Official repo for paper "EMMA: Efficient Multimodal Understanding, Generation, and Editing with a Unified Architecture."☆62Dec 16, 2025Updated 2 months ago
- A Text2SQL benchmark for evaluation of Large Language Models☆41Updated this week
- The first spoken long-text dataset derived from live streams, designed to reflect the redundancy-rich and conversational nature of real-w…☆12Jun 28, 2025Updated 8 months ago
- MAKGED is the first multi-agent framework for collaborative error detection in knowledge graphs.☆30Jul 20, 2025Updated 7 months ago
- ☆34Feb 6, 2026Updated 3 weeks ago
- 🔧Tool-Star: Empowering LLM-brained Multi-Tool Reasoner via Reinforcement Learning☆319Jan 3, 2026Updated last month
- [ICML 2025] Official resources of "KBQA-o1: Agentic Knowledge Base Question Answering with Monte Carlo Tree Search".☆35Dec 6, 2025Updated 2 months ago
- ☆18Jun 10, 2025Updated 8 months ago
- [NeurIPS ENLSP Workshop'24] CSKV: Training-Efficient Channel Shrinking for KV Cache in Long-Context Scenarios☆16Oct 18, 2024Updated last year
- The demo, code and data of FollowRAG☆75Jun 30, 2025Updated 8 months ago
- This repository is aim to reproduce the R1-Zero on medical domain.☆32Jun 11, 2025Updated 8 months ago
- MultiMath: Bridging Visual and Mathematical Reasoning for Large Language Models☆32Jan 22, 2025Updated last year
- The official implement of paper 《DaMo: Data Mixing Optimizer in Fine-tuning Multimodal LLMs for Mobile Phone Agents》☆29Oct 23, 2025Updated 4 months ago