HKUSTDial / LEADLinks
🔥[VLDB'26] Official repository for the paper "LEAD: Iterative Data Selection for Efficient LLM Instruction Tuning".
☆82Updated 5 months ago
Alternatives and similar repositories for LEAD
Users that are interested in LEAD are comparing it to the libraries listed below
Sorting:
- Official repository for the paper "EllieSQL: Cost-Efficient Text-to-SQL with Complexity-Aware Routing".☆18Updated 3 months ago
- Official repository for the paper “HAIChart: Human and AI Paired Visualization System” (VLDB'24)☆31Updated last year
- 🔥[SIGKDD'25] NL2SQL-BUGs: A Benchmark for Detecting Semantic Errors in NL2SQL Translation.☆26Updated last month
- 🔥[NeurIPS'24] Official repository for the paper “Are Large Language Models Good Statisticians?”☆32Updated 6 months ago
- DataMosaic: Explainable and Verifiable Document-Based Data Analytics☆20Updated 4 months ago
- ✨ A synthetic dataset generation framework that produces diverse coding questions and verifiable solutions - all in one framwork☆289Updated 2 months ago
- 🔥 [NeurIPS'25] nvBench 2.0: Resolving Ambiguity in Text-to-Visualization through Stepwise Reasoning☆21Updated last month
- In-depth study of the graphrag☆1,448Updated 4 months ago
- Codebase for Iterative DPO Using Rule-based Rewards☆260Updated 7 months ago
- 🔥[ICML'25] Official repository for the paper "Alpha-SQL: Zero-Shot Text-to-SQL using Monte Carlo Tree Search"☆117Updated 2 weeks ago
- Officical repository for the paper“ChartInsights: Evaluating Multimodal Large Language Models for Low-Level Chart Question Answering”(EMN…☆22Updated 11 months ago
- ☆322Updated 2 months ago
- [Up-to-date] Large Language Model Agent: A Survey on Methodology, Applications and Challenges☆2,025Updated this week
- adds Sequence Parallelism into LLaMA-Factory☆588Updated 3 weeks ago
- A curated list of awesome works in Routing LLMs paradigm (👉 Welcome to submit your contributions to this code repository)☆82Updated 3 weeks ago
- ☆224Updated last month
- [COLM’25] DeepRetrieval — 🔥 The First Search Agent Trained by On-Policy Reinforcement Learning☆666Updated 3 weeks ago
- Train your Agent model via our easy and efficient framework☆1,613Updated last week
- [NeurIPS'25] Official Repository for the Paper "SQL-R1: Training Natural Language to SQL Reasoning Model By Reinforcement Learning"☆104Updated 2 weeks ago
- Awesome-Efficient-Inference-for-LRMs is a collection of state-of-the-art, novel, exciting, token-efficient methods for Large Reasoning Mo…☆233Updated 4 months ago
- ☆20Updated 2 months ago
- [ICLR 2025 Oral] Spider 2.0: Evaluating Language Models on Real-World Enterprise Text-to-SQL Workflows☆627Updated last week
- [NeurIPS 2025 Main] SWE-SQL: Illuminating LLM Pathways to Solve User SQL Issues in Real-World Applications☆772Updated last month
- HIPPO: Enhancing the Table Understanding Capability of Large Language Models through Hybrid-Modal Preference Optimization☆16Updated 5 months ago
- MarkLLM: An Open-Source Toolkit for LLM Watermarking.(EMNLP 2024 System Demonstration)☆656Updated 3 weeks ago
- A scalable, end-to-end training pipeline for general-purpose agents☆361Updated 4 months ago
- [NIPS'25 Spotlight] Mulberry, an o1-like Reasoning and Reflection MLLM Implemented via Collective MCTS☆1,223Updated last month
- Continuously updated paper list on advancements in Data Agents. Companion repo to our paper "A Survey of Data Agents: Emerging Paradigm o…☆223Updated last week
- ☆125Updated last month
- Unified KV Cache Compression Methods for Auto-Regressive Models☆1,275Updated 10 months ago