HKUSTDial / LEADLinks
🔥[VLDB'26] Official repository for the paper "LEAD: Iterative Data Selection for Efficient LLM Instruction Tuning".
☆80Updated 6 months ago
Alternatives and similar repositories for LEAD
Users that are interested in LEAD are comparing it to the libraries listed below
Sorting:
- Official repository for the paper "EllieSQL: Cost-Efficient Text-to-SQL with Complexity-Aware Routing".☆20Updated 4 months ago
- 🔥[SIGKDD'25] NL2SQL-BUGs: A Benchmark for Detecting Semantic Errors in NL2SQL Translation.☆26Updated 2 months ago
- Official repository for the paper “HAIChart: Human and AI Paired Visualization System” (VLDB'24)☆32Updated last year
- DataMosaic: Explainable and Verifiable Document-Based Data Analytics☆20Updated 5 months ago
- ✨ A synthetic dataset generation framework that produces diverse coding questions and verifiable solutions - all in one framwork☆297Updated 2 months ago
- 🔥 [NeurIPS'25] nvBench 2.0: Resolving Ambiguity in Text-to-Visualization through Stepwise Reasoning☆22Updated 3 weeks ago
- 🔥[NeurIPS'24] Official repository for the paper “Are Large Language Models Good Statisticians?”☆32Updated 7 months ago
- [ICLR 2025 Oral] Spider 2.0: Evaluating Language Models on Real-World Enterprise Text-to-SQL Workflows☆671Updated 3 weeks ago
- 🔥[ICML'25] Official repository for the paper "Alpha-SQL: Zero-Shot Text-to-SQL using Monte Carlo Tree Search"☆129Updated last month
- Officical repository for the paper“ChartInsights: Evaluating Multimodal Large Language Models for Low-Level Chart Question Answering”(EMN…☆22Updated last year
- In-depth study of the graphrag☆1,458Updated 5 months ago
- Train your Agent model via our easy and efficient framework☆1,635Updated 2 weeks ago
- Codebase for Iterative DPO Using Rule-based Rewards☆263Updated 7 months ago
- ☆329Updated 3 months ago
- adds Sequence Parallelism into LLaMA-Factory☆596Updated last month
- Open source code for Paper: Evaluating Memory in LLM Agents via Incremental Multi-Turn Interactions☆171Updated this week
- [COLM’25] DeepRetrieval — 🔥 The First Search Agent Trained by On-Policy Reinforcement Learning☆675Updated last month
- [NeurIPS'25] Official Repository for the Paper "SQL-R1: Training Natural Language to SQL Reasoning Model By Reinforcement Learning"☆112Updated 2 weeks ago
- A curated list of awesome works in Routing LLMs paradigm (👉 Welcome to submit your contributions to this code repository)☆91Updated last week
- Official Repository of "LLM × DATA" Survey Paper☆572Updated last month
- [Up-to-date] Large Language Model Agent: A Survey on Methodology, Applications and Challenges☆2,178Updated 3 weeks ago
- verl-agent is an extension of veRL, designed for training LLM/VLM agents via RL. verl-agent is also the official code for paper "Group-in…☆1,230Updated last month
- ☆516Updated 2 months ago
- Awesome List for Agentic RL☆571Updated last week
- [NeurIPS 2025 Main] SWE-SQL: Illuminating LLM Pathways to Solve User SQL Issues in Real-World Applications☆775Updated 2 months ago
- A Comprehensive Benchmark for Routing LLMs to Explore Model-level Scaling Up in Large Language Models☆97Updated 3 weeks ago
- A scalable, end-to-end training pipeline for general-purpose agents☆361Updated 5 months ago
- Continuously updated paper list on advancements in Data Agents. Companion repo to our paper "A Survey of Data Agents: Emerging Paradigm o…☆305Updated 2 weeks ago
- ☆22Updated 3 months ago
- Awesome-Efficient-Inference-for-LRMs is a collection of state-of-the-art, novel, exciting, token-efficient methods for Large Reasoning Mo…☆233Updated 5 months ago