HKUSTDial / LEADLinks
🔥[VLDB'26] Official repository for the paper "LEAD: Iterative Data Selection for Efficient LLM Instruction Tuning".
☆108Updated 7 months ago
Alternatives and similar repositories for LEAD
Users that are interested in LEAD are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2025 Main] SWE-SQL: Illuminating LLM Pathways to Solve User SQL Issues in Real-World Applications☆1,082Updated 2 weeks ago
- ☆357Updated 6 months ago
- Repo-level benchmark for real-world Code Agents: from repo understanding → env setup → incremental dev/bug-fixing → task delivery, with c…☆244Updated 3 months ago
- ☆13Updated 11 months ago
- Multilingual Translations of "Foundations of Large Language Models" and NLPBook.☆232Updated 4 months ago
- MATEval is the first multi-agent framework simulating human collaborative discussion for open-ended text evaluation.☆28Updated 7 months ago
- My implementations on the 5 assignments of cs336☆209Updated last month
- [TKDE2025] Next-Generation Database Interfaces: A Survey of LLM-based Text-to-SQL | A curated list of resources (surveys, papers, benchma…☆1,242Updated last month
- ☆128Updated 3 months ago
- Marco Search Agent for Realistic and Challenging Agentic Search☆240Updated 2 months ago
- ☆206Updated 3 weeks ago
- The code of AMoPO: Adaptive Multi-objective Preference Optimization without Rewards and References.☆45Updated 4 months ago
- Science-Star: A Platform for Building, Extending, and Experimenting with Scientific Agents.☆739Updated 3 months ago
- A tool for translating the content of LaTeX documents into various other natural languages (e.g., translating an arXiv paper from English…☆429Updated 2 months ago
- 从0è®ç»ƒç±» o1 大è¯è¨€æ¨¡åž‹ã€‚☆132Updated last week
- Selective Prompt Anchoring☆97Updated last month
- The Python implementation of some deep text hashing (also called deep semantic hashing) Models☆80Updated last month
- A reading list for trustworthy audio large language models.☆112Updated last week
- ☆49Updated last week
- ☆33Updated 2 months ago
- The codes for the paper One-bit Deep Hashing: Towards a Resource-Efficient Hashing Model with Binary Neural Networks (ACMMM24)☆45Updated 10 months ago
- ☆250Updated 3 weeks ago
- [ACL 2025 Oral] QAEncoder: Towards Aligned Representation Learning in Question Answering Systems☆176Updated 6 months ago
- MarkDiffusion: An Open-Source Toolkit for Generative Watermarking of Latent Diffusion Models☆297Updated 3 weeks ago
- [BIRD-INTERACT] Re-imagines Text-to-SQL evaluation via lens of dynamic interactions.☆455Updated 3 weeks ago
- [ACL2024 Findings] Knowledge-to-SQL: Enhancing SQL Generation with Data Expert LLM☆56Updated 4 months ago
- [TMLR'25] The Curse of CoT: On the Limitations of Chain-of-Thought in In-Context Learning☆53Updated 9 months ago
- ☆104Updated 7 months ago
- RAS: Retrieval-And-Structuring for Knowledge-Intensive LLM Generation☆59Updated 2 months ago
- Group Expectation Policy Optimization for Heterogeneous Reinforcement Learning☆164Updated 2 months ago