xiatingyu / SFT-DataSelection-at-scaleView external linksLinks
☆31Feb 9, 2025Updated last year
Alternatives and similar repositories for SFT-DataSelection-at-scale
Users that are interested in SFT-DataSelection-at-scale are comparing it to the libraries listed below
Sorting:
- ☆12Dec 13, 2023Updated 2 years ago
- ☆18Feb 20, 2024Updated last year
- ☆24Oct 14, 2024Updated last year
- ☆16Sep 4, 2025Updated 5 months ago
- Official implementation of Self-Taught Agentic Long Context Understanding (ACL 2025).☆12Sep 22, 2025Updated 4 months ago
- ☆30Nov 5, 2024Updated last year
- [ICLR'25] Official repository of paper: Ranking-aware adapter for text-driven image ordering with CLIP☆16Apr 17, 2025Updated 9 months ago
- ☆14Apr 16, 2024Updated last year
- ☆21Jul 18, 2024Updated last year
- ☆19Feb 25, 2023Updated 2 years ago
- [ACM MM25] LongWriter-V: Enabling Ultra-Long and High-Fidelity Generation in Vision-Language Models☆23Mar 29, 2025Updated 10 months ago
- [ACL 2025 Findings] Official implementation of the paper "Unveiling the Key Factors for Distilling Chain-of-Thought Reasoning".☆20Feb 26, 2025Updated 11 months ago
- [NeurIPS 2024] | An Efficient Recipe for Long Context Extension via Middle-Focused Positional Encoding☆21Oct 10, 2024Updated last year
- ☆21Dec 6, 2025Updated 2 months ago
- Collection of works for evaluating (and analyzing) large audio-language models (LALMs)☆40Aug 11, 2025Updated 6 months ago
- Fantastic Data Engineering for Large Language Models☆93Dec 29, 2024Updated last year
- The official GitHub page for paper "NegativePrompt: Leveraging Psychology for Large Language Models Enhancement via Negative Emotional St…☆25May 10, 2024Updated last year
- A Text2SQL benchmark for evaluation of Large Language Models☆41Updated this week
- The first spoken long-text dataset derived from live streams, designed to reflect the redundancy-rich and conversational nature of real-w…☆13Jun 28, 2025Updated 7 months ago
- ☆28May 24, 2025Updated 8 months ago
- diagnosis_zero, R1 Zero reproduce on disease diagnosis☆34Jul 24, 2025Updated 6 months ago
- The official implementation of the DIFFA series for dLLM-based large audio language model☆59Feb 2, 2026Updated last week
- Official code for the publication "Large Language Models as Zero-shot Dialogue State Tracker through Function Calling" https//arxiv.org/a…☆67Aug 14, 2024Updated last year
- The Code and Script of "David's Slingshot: A Strategic Coordination Framework of Small LLMs Matches Large LLMs in Data Synthesis"☆35Jun 13, 2025Updated 8 months ago
- ☆39Aug 4, 2025Updated 6 months ago
- [ACL'24] Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuning☆188Jun 25, 2025Updated 7 months ago
- 通义点金:中文金融行业大模型 Resources☆85Jan 7, 2025Updated last year
- The code and data of DPA-RAG, accepted by WWW 2025 main conference.☆63Oct 23, 2025Updated 3 months ago
- [NeurIPS ENLSP Workshop'24] CSKV: Training-Efficient Channel Shrinking for KV Cache in Long-Context Scenarios☆16Oct 18, 2024Updated last year
- The Good, The Bad, and The Greedy: Evaluation of LLMs Should Not Ignore Non-Determinism☆30Jul 17, 2024Updated last year
- ☆34Jul 23, 2024Updated last year
- ☆18Jun 10, 2025Updated 8 months ago
- [ACL 2025] We introduce ScaleQuest, a scalable, novel and cost-effective data synthesis method to unleash the reasoning capability of LLM…☆68Oct 27, 2024Updated last year
- ☆35May 16, 2025Updated 8 months ago
- The repository of the project "Fine-tuning Large Language Models with Sequential Instructions", code base comes from open-instruct and LA…☆30Nov 24, 2024Updated last year
- [ACL 2024] The official codebase for the paper "Self-Distillation Bridges Distribution Gap in Language Model Fine-tuning".☆146Nov 2, 2024Updated last year
- [ACL 2025] Exploring Compositional Generalization of Multimodal LLMs for Medical Imaging☆38Jun 4, 2025Updated 8 months ago
- Analyzing and Reducing Catastrophic Forgetting in Parameter Efficient Tuning☆36Nov 17, 2024Updated last year
- JudgeLRM: Large Reasoning Models as a Judge☆41Jan 29, 2026Updated 2 weeks ago