HKUSTDial / LEADLinks
🐂 🔥Official repository for the paper "LEAD: Iterative Data Selection for Efficient LLM Instruction Tuning".
☆67Updated 4 months ago
Alternatives and similar repositories for LEAD
Users that are interested in LEAD are comparing it to the libraries listed below
Sorting:
- Official repository for the paper "EllieSQL: Cost-Efficient Text-to-SQL with Complexity-Aware Routing".☆15Updated 2 months ago
- ✨ A synthetic dataset generation framework that produces diverse coding questions and verifiable solutions - all in one framwork☆279Updated last month
- Official repository for the paper “HAIChart: Human and AI Paired Visualization System” (VLDB'24)☆31Updated 11 months ago
- DataMosaic: Explainable and Verifiable Document-Based Data Analytics☆20Updated 3 months ago
- 🔥[SIGKDD'25] NL2SQL-BUGs: A Benchmark for Detecting Semantic Errors in NL2SQL Translation.☆25Updated 3 weeks ago
- In-depth study of the graphrag☆1,432Updated 3 months ago
- [COLM’25] DeepRetrieval — 🔥 The First Search Agent Trained by On-Policy Reinforcement Learning☆650Updated last week
- Train your Agent model via our easy and efficient framework☆1,554Updated last week
- adds Sequence Parallelism into LLaMA-Factory☆574Updated last week
- ☆319Updated last month
- [Up-to-date] Large Language Model Agent: A Survey on Methodology, Applications and Challenges☆1,828Updated last week
- Codebase for Iterative DPO Using Rule-based Rewards☆258Updated 6 months ago
- 🔥 [NeurIPS'25] nvBench 2.0: Resolving Ambiguity in Text-to-Visualization through Stepwise Reasoning☆19Updated 3 weeks ago
- Unified KV Cache Compression Methods for Auto-Regressive Models☆1,256Updated 9 months ago
- 🔥[NeurIPS'24] Official repository for the paper “Are Large Language Models Good Statisticians?”☆30Updated 6 months ago
- Continuously updated handbook and official repository for our survey on Data Agents.☆18Updated last month
- [NIPS'25 Spotlight] Mulberry, an o1-like Reasoning and Reflection MLLM Implemented via Collective MCTS☆1,217Updated last month
- minimal-cost for training 0.5B R1-Zero☆777Updated 5 months ago
- [NeurIPS 2025 Main] SWE-SQL: Illuminating LLM Pathways to Solve User SQL Issues in Real-World Applications☆771Updated 2 weeks ago
- 🔥[ICML'25] Official repository for the paper "Alpha-SQL: Zero-Shot Text-to-SQL using Monte Carlo Tree Search"☆96Updated 4 months ago
- [ICLR 2025] Vision-Centric Evaluation for Retrieval-Augmented Multimodal Models☆60Updated 8 months ago
- [NeurIPS'25] Official Repository for the Paper "SQL-R1: Training Natural Language to SQL Reasoning Model By Reinforcement Learning"☆97Updated last week
- verl-agent is an extension of veRL, designed for training LLM/VLM agents via RL. verl-agent is also the official code for paper "Group-in…☆1,005Updated last week
- Awesome-Efficient-Inference-for-LRMs is a collection of state-of-the-art, novel, exciting, token-efficient methods for Large Reasoning Mo…☆224Updated 4 months ago
- ☆475Updated last month
- Officical repository for the paper“ChartInsights: Evaluating Multimodal Large Language Models for Low-Level Chart Question Answering”(EMN…☆21Updated 11 months ago
- Official Repository of "LLM × DATA" Survey Paper☆474Updated 2 weeks ago
- A tutorial based on MetaGPT to quickly help you understand the concept of agent and muti-agent and get started with coding development. 基…☆1,299Updated last year
- ☆122Updated 3 weeks ago
- An Awesome List of Agentic Model trained with Reinforcement Learning☆502Updated this week