TsinghuaC3I / Intuitive-Fine-TuningView external linksLinks
[ACL 2025, Main Conference, Oral] Intuitive Fine-Tuning: Towards Simplifying Alignment into a Single Process
☆30Aug 2, 2024Updated last year
Alternatives and similar repositories for Intuitive-Fine-Tuning
Users that are interested in Intuitive-Fine-Tuning are comparing it to the libraries listed below
Sorting:
- Modified Beam Search with periodical restart☆12Sep 12, 2024Updated last year
- Use the tokenizer in parallel to achieve superior acceleration☆20Mar 21, 2024Updated last year
- Code for Paper (Preserving Diversity in Supervised Fine-tuning of Large Language Models)☆51May 12, 2025Updated 9 months ago
- [COLM 2024] Large Language Models as Biomedical Hypothesis Generators: A Comprehensive Evaluation☆15Jul 15, 2024Updated last year
- Source codes for "Preference-grounded Token-level Guidance for Language Model Fine-tuning" (NeurIPS 2023).☆17Jan 8, 2025Updated last year
- Self-Supervised Alignment with Mutual Information☆20May 24, 2024Updated last year
- trying to reproduce suno v3☆35Jan 29, 2025Updated last year
- ☆29Jan 23, 2024Updated 2 years ago
- ☆36Sep 26, 2024Updated last year
- Official repository for ACL 2025 paper "Model Extrapolation Expedites Alignment"☆75May 20, 2025Updated 8 months ago
- Official code for "MAmmoTH2: Scaling Instructions from the Web" [NeurIPS 2024]☆149Oct 27, 2024Updated last year
- An ambient noise detector☆10Aug 23, 2020Updated 5 years ago
- Branch Metrics Win32/C++ SDK☆10Jun 10, 2025Updated 8 months ago
- A library with extensible implementations of DPO, KTO, PPO, ORPO, and other human-aware loss functions (HALOs).☆905Sep 30, 2025Updated 4 months ago
- Reference implementation for Token-level Direct Preference Optimization(TDPO)☆151Feb 14, 2025Updated 11 months ago
- Official repository for ORPO☆471May 31, 2024Updated last year
- TOD-Flow: Modeling the Structure of Task-Oriented Dialogues☆13Feb 7, 2024Updated 2 years ago
- 实现一个自己的小语言模型☆11Jun 15, 2024Updated last year
- Strawberry architecture analysis and reconstruction☆16Dec 16, 2025Updated last month
- ☆41Jun 19, 2024Updated last year
- 競馬予想プログラム☆12May 6, 2023Updated 2 years ago
- FamilyTool benchmark☆12Sep 10, 2025Updated 5 months ago
- ☆12Jun 15, 2023Updated 2 years ago
- A python tool help to interact with chatgpt.☆10Dec 11, 2022Updated 3 years ago
- Code for EMNLP'24 paper - On Diversified Preferences of Large Language Model Alignment☆16Aug 6, 2024Updated last year
- ☆10Aug 9, 2018Updated 7 years ago
- Public code release for the paper "Reawakening knowledge: Anticipatory recovery from catastrophic interference via structured training"☆11Oct 27, 2025Updated 3 months ago
- ☆11Sep 27, 2022Updated 3 years ago
- Contains the model patches and the eval logs from the passing swe-bench-lite run.☆10Jun 28, 2024Updated last year
- 青空文庫からテキストをいい感じに取り出します☆11Jun 6, 2021Updated 4 years ago
- Unofficial implementation of the Ask-LLM paper 'How to Train Data-Efficient LLMs', arXiv:2402.09668.☆12Jun 19, 2024Updated last year
- ☆13Nov 5, 2024Updated last year
- Ilya Sutskever 推荐的30篇Deep learning 必读论文 (中英文对照翻译版)☆13Dec 18, 2024Updated last year
- 刹那是永恒☆13Feb 26, 2020Updated 5 years ago
- Robot simulator using web technologies, just JavaScript☆10Feb 13, 2020Updated 6 years ago
- a Video Quality Analysis Toolkit☆13May 16, 2025Updated 8 months ago
- ☆10May 16, 2024Updated last year
- Accepted to MLSys 2026☆70Jan 29, 2026Updated 2 weeks ago
- https://avocado-captioner.github.io/☆29Oct 16, 2025Updated 3 months ago