LHRLAB / KBQA-o1View external linksLinks
[ICML 2025] Official resources of "KBQA-o1: Agentic Knowledge Base Question Answering with Monte Carlo Tree Search".
☆34Dec 6, 2025Updated 2 months ago
Alternatives and similar repositories for KBQA-o1
Users that are interested in KBQA-o1 are comparing it to the libraries listed below
Sorting:
- Compiler-R1: Towards Agentic Compiler Auto-tuning with Reinforcement Learning☆28Jul 14, 2025Updated 7 months ago
- ☆25Apr 15, 2025Updated 9 months ago
- Official repository of Graph RAG-Tool Fusion and ToolLinkOS dataset.☆22Feb 13, 2025Updated last year
- Prompt-R1: Collaborative Automatic Prompting Framework via End-to-end Reinforcement Learning☆55Dec 27, 2025Updated last month
- ☆17Aug 1, 2025Updated 6 months ago
- A comprehensive benchmark for evaluating deep research agents on academic survey tasks☆49Sep 4, 2025Updated 5 months ago
- ☆14Dec 18, 2024Updated last year
- A holistic framework for advancing LLMs as data science agents☆30Feb 3, 2026Updated last week
- Official InfiniBench: A Benchmark for Large Multi-Modal Models in Long-Form Movies and TV Shows☆19Nov 4, 2025Updated 3 months ago
- ReaRAG: Knowledge-guided Reasoning Enhances Factuality of Large Reasoning Models with Iterative Retrieval Augmented Generation☆25Aug 24, 2025Updated 5 months ago
- ROUTE: Robust Multitask Tuning and Collaboration for Text-to-SQL (ICLR 2025 Pytorch Code)☆17May 15, 2025Updated 8 months ago
- [AAAI 2026] ReCode: Reinforced Code Knowledge Editing for API Updates☆22Jul 1, 2025Updated 7 months ago
- RAG-RewardBench: Benchmarking Reward Models in Retrieval Augmented Generation for Preference Alignment☆16Dec 19, 2024Updated last year
- ☆19Jun 4, 2025Updated 8 months ago
- From Accuracy to Robustness: A Study of Rule- and Model-based Verifiers in Mathematical Reasoning.☆24Oct 7, 2025Updated 4 months ago
- [EMNLP Findings 2022] ReaRev: Adaptive Reasoning for Question Answering over Knowledge Graphs☆37Mar 12, 2023Updated 2 years ago
- Set-Encoder: Permutation-Invariant Inter-Passage Attention for Listwise Passage Re-Ranking with Cross-Encoders☆18May 23, 2025Updated 8 months ago
- [ACL'25 (Findings)] Explorer: Scaling Exploration-driven Web Trajectory Synthesis for Multimodal Web Agents☆26Oct 15, 2025Updated 3 months ago
- Reproducing R1 for Code with Reliable Rewards☆12Apr 9, 2025Updated 10 months ago
- This repository contains the code for the paper “Neuro-Symbolic Query Compiler”, accepted to the Findings of ACL 2025.☆16Oct 20, 2025Updated 3 months ago
- Your efficient and accurate answer verification system for RL training.☆41Jun 23, 2025Updated 7 months ago
- [ACL 2025] A Generalizable and Purely Unsupervised Self-Training Framework☆71Jun 1, 2025Updated 8 months ago
- Official Implementation of "ToolSafe: Enhancing Tool Invocation Safety of LLM-based Agents via Proactive Step-level Guardrail and Feedbac…☆34Jan 23, 2026Updated 3 weeks ago
- Code and data for paper "Can LLM Watermarks Robustly Prevent Unauthorized Knowledge Distillation?". (ACL 2025 Main)☆20Jun 18, 2025Updated 7 months ago
- ☆18Oct 28, 2025Updated 3 months ago
- Control LLM☆22Apr 6, 2025Updated 10 months ago
- The official implementation of "Well Begun is Half Done: Low-resource Preference Alignment by Weak-to-Strong Decoding"☆23Jun 26, 2025Updated 7 months ago
- content-neutral dataset of logical reasoning☆19Mar 21, 2025Updated 10 months ago
- [EMNLP 2025] Code for paper "Table-R1: Inference-Time Scaling for Table Reasoning"☆29Jun 3, 2025Updated 8 months ago
- ☆42Sep 19, 2024Updated last year
- Code repo for "Harnessing Negative Signals: Reinforcement Distillation from Teacher Data for LLM Reasoning"☆32Jul 25, 2025Updated 6 months ago
- [AAAI 2026] Multimodal Deepresearcher: Generating Text-Chart Interleaved Reports From Scratch with Agentic Framework☆44Jan 25, 2026Updated 2 weeks ago
- [NeurIPS 2024] Plan-on-Graph: Self-Correcting Adaptive Planning of Large Language Model on Knowledge Graphs☆88Jan 13, 2025Updated last year
- ☆19Nov 25, 2024Updated last year
- ☆19Mar 10, 2025Updated 11 months ago
- A scalable automated alignment method for large language models. Resources for "Aligning Large Language Models via Self-Steering Optimiza…☆20Nov 21, 2024Updated last year
- Official implementation of the paper "From Complex to Simple: Enhancing Multi-Constraint Complex Instruction Following Ability of Large L…☆53Jun 24, 2024Updated last year
- ☆18Sep 5, 2024Updated last year
- PyTorch implementation of "Spectral Heterogeneous Graph Convolutions via Positive Noncommutative Polynomials."☆19Jan 14, 2025Updated last year