[ACL'25] Mosaic-IT: Cost-Free Compositional Data Synthesis for Instruction Tuning
☆20Sep 27, 2025Updated 5 months ago
Alternatives and similar repositories for Mosaic-IT
Users that are interested in Mosaic-IT are comparing it to the libraries listed below
Sorting:
- [NAACL'25] RuleR: Improving LLM Controllability by Rule-based Data Recycling☆14Sep 27, 2025Updated 5 months ago
- [ACL'24] Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuning☆189Jun 25, 2025Updated 8 months ago
- [ACL'24] Can LLMs Speak For Diverse People? Tuning LLMs via Debate to Generate Controllable Controversial Statements☆24Sep 14, 2024Updated last year
- An implementation of MSSRM method☆11Mar 23, 2023Updated 2 years ago
- [ACL'25 Oral] What Happened in LLMs Layers when Trained for Fast vs. Slow Thinking: A Gradient Perspective☆76Jun 25, 2025Updated 8 months ago
- [ICML2022] "Identity-Disentangled Adversarial Augmentation for Self-Supervised Learning"☆10Jul 24, 2022Updated 3 years ago
- Findings of ACL 2021☆24May 8, 2021Updated 4 years ago
- Control LLM☆22Apr 6, 2025Updated 11 months ago
- [ICLR 2026] Fast-Slow Toolpath Agent with Subroutine Mining for Efficient Multi-turn Image Editing☆29Feb 6, 2026Updated last month
- AutoMoE: Neural Architecture Search for Efficient Sparsely Activated Transformers☆48Oct 21, 2022Updated 3 years ago
- [NAACL'24] Self-data filtering of LLM instruction-tuning data using a novel perplexity-based difficulty score, without using any other mo…☆416Jun 25, 2025Updated 8 months ago
- AutoHallusion Codebase (EMNLP 2024)☆22Dec 6, 2024Updated last year
- [ACL'24] Selective Reflection-Tuning: Student-Selected Data Recycling for LLM Instruction-Tuning☆367Sep 6, 2024Updated last year
- OrcaLoca: An LLM Agent Framework for Software Issue Localization [ICML 25]☆35Apr 7, 2025Updated 11 months ago
- Build a medical knowledge graph based on Unified Language Medical System (UMLS)☆26Dec 25, 2021Updated 4 years ago
- ☆30Nov 5, 2024Updated last year
- Official code for "Accelerating Feedforward Computation via Parallel Nonlinear Equation Solving", ICML 2021☆29Sep 25, 2021Updated 4 years ago
- [EMNLP 2025] CompassVerifier: A Unified and Robust Verifier for LLMs Evaluation and Outcome Reward☆63Aug 10, 2025Updated 6 months ago
- [EMNLP 2024 Findings] Benchmarking Language Model Agents for Data-Driven Science☆34Oct 25, 2024Updated last year
- 第二届“泰迪杯”数据分析职业技能大赛A题☆10Sep 15, 2020Updated 5 years ago
- The released data for paper "Measuring and Improving Chain-of-Thought Reasoning in Vision-Language Models".☆34Sep 16, 2023Updated 2 years ago
- Big Data Analysis of Tinder done at Universitat Rovira i Virgili and Universitat Politècnica de Catalunya · BarcelonaTech☆13Jan 3, 2023Updated 3 years ago
- Vstream - Video Analytics pipeline with Hardware based accelerations (dev - stage)☆10Feb 2, 2024Updated 2 years ago
- [NAACL 2024] A Synthetic, Scalable and Systematic Evaluation Suite for Large Language Models☆33Jun 10, 2024Updated last year
- (ICLR 2026) Optimas: Optimizing Compound AI Systems☆75Feb 6, 2026Updated last month
- ☆12Sep 21, 2023Updated 2 years ago
- The source code of [WWW 2025] MoDiCF☆12Jul 12, 2025Updated 7 months ago
- Multiprocessing in python☆10Aug 20, 2021Updated 4 years ago
- Comparative Study and Implementation of Five Factor Model and Myers-Briggs Type Indicator Model☆11Sep 28, 2023Updated 2 years ago
- Codes for Merging Large Language Models☆35Aug 7, 2024Updated last year
- Diverse Client Selection for Federated Learning via Submodular Maximization☆35Aug 3, 2022Updated 3 years ago
- ☆14May 1, 2023Updated 2 years ago
- YouTube Assistant☆12May 15, 2023Updated 2 years ago
- 李鲁鲁老师的 Copilot-Python 学习。和ChatGPT等大语言模型协同进化。☆10Jun 3, 2025Updated 9 months ago
- ☆44Feb 9, 2026Updated last month
- 小鸡词典🐤的Alfred🎩插件 咯咯咯☆11Apr 19, 2023Updated 2 years ago
- Dataset and codes for SEntFiN☆10May 31, 2023Updated 2 years ago
- 记录有用的Git repos☆12Jul 28, 2024Updated last year
- DNH Werewolf Discord bot☆13Dec 19, 2024Updated last year