[ACL 2025, Main Conference, Oral] Intuitive Fine-Tuning: Towards Simplifying Alignment into a Single Process
☆30Aug 2, 2024Updated last year
Alternatives and similar repositories for Intuitive-Fine-Tuning
Users that are interested in Intuitive-Fine-Tuning are comparing it to the libraries listed below
Sorting:
- Modified Beam Search with periodical restart☆12Sep 12, 2024Updated last year
- Code for Paper (Preserving Diversity in Supervised Fine-tuning of Large Language Models)☆52May 12, 2025Updated 9 months ago
- Source codes for "Preference-grounded Token-level Guidance for Language Model Fine-tuning" (NeurIPS 2023).☆17Jan 8, 2025Updated last year
- Code and models for EMNLP 2024 paper "WPO: Enhancing RLHF with Weighted Preference Optimization"☆41Sep 24, 2024Updated last year
- Code for the arXiv preprint "The Unreasonable Effectiveness of Easy Training Data"☆48Jan 17, 2024Updated 2 years ago
- Self-Supervised Alignment with Mutual Information☆20May 24, 2024Updated last year
- Latest Evaluation Toolkit (LatestEval). Assessing the language models with latest, uncontaminated materials.☆28Feb 17, 2025Updated last year
- trying to reproduce suno v3☆35Jan 29, 2025Updated last year
- ☆29Jan 23, 2024Updated 2 years ago
- ☆37Sep 26, 2024Updated last year
- Official repository for ACL 2025 paper "Model Extrapolation Expedites Alignment"☆75May 20, 2025Updated 9 months ago
- OMGEval😮: An Open Multilingual Generative Evaluation Benchmark for Foundation Models☆36Jul 19, 2024Updated last year
- Branch Metrics Win32/C++ SDK☆10Jun 10, 2025Updated 8 months ago
- Official code for "MAmmoTH2: Scaling Instructions from the Web" [NeurIPS 2024]☆149Oct 27, 2024Updated last year
- A library with extensible implementations of DPO, KTO, PPO, ORPO, and other human-aware loss functions (HALOs).☆906Sep 30, 2025Updated 5 months ago
- Reference implementation for Token-level Direct Preference Optimization(TDPO)☆152Feb 14, 2025Updated last year
- Official repository for ORPO☆472May 31, 2024Updated last year
- RecGPT: Generative Pre-training for Text-based Recommendation (ACL 2024)☆40Sep 22, 2024Updated last year
- Multi-Candidate Speculative Decoding☆39Apr 22, 2024Updated last year
- MTalk-Bench: Evaluating Speech-to-Speech Models in Multi-Turn Dialogues via Arena-style and Rubrics Protocols☆17Nov 19, 2025Updated 3 months ago
- Strawberry architecture analysis and reconstruction☆16Dec 16, 2025Updated 2 months ago
- Neural Homomorphic Vocoder optimized for singing voice synthesis☆18Updated this week
- Topological-LSTM for Information Cascade Modeling☆12Nov 2, 2017Updated 8 years ago
- LipSync AI is your ultimate solution for flawless lip-syncing in videos. Our AI model precisely synchronizes audio and video, creating li…☆15Jul 18, 2023Updated 2 years ago
- TOD-Flow: Modeling the Structure of Task-Oriented Dialogues☆13Feb 7, 2024Updated 2 years ago
- ☆12Dec 12, 2019Updated 6 years ago
- ☆10Dec 24, 2023Updated 2 years ago
- 在监控画质下实现对校园自行车的重识别,包含REID模型识别,向量数据库检索,UI展示☆10Feb 13, 2024Updated 2 years ago
- ☆41Jun 19, 2024Updated last year
- Ming-omni-tts: Simple and Efficient Unified Generation of Speech, Music, and Sound with Precise Control☆160Feb 26, 2026Updated last week
- ☆11Dec 15, 2025Updated 2 months ago
- Neural Network Image Compression☆13Jan 12, 2018Updated 8 years ago
- Public code release for the paper "Reawakening knowledge: Anticipatory recovery from catastrophic interference via structured training"☆11Oct 27, 2025Updated 4 months ago
- ☆13Jan 26, 2024Updated 2 years ago
- A python tool help to interact with chatgpt.☆10Dec 11, 2022Updated 3 years ago
- ☆11Oct 29, 2022Updated 3 years ago
- Trust Region Preference Approximation: A simple and stable reinforcement learning algorithm for LLM reasoning☆14Jun 28, 2025Updated 8 months ago
- Unofficial implementation of the Ask-LLM paper 'How to Train Data-Efficient LLMs', arXiv:2402.09668.☆12Jun 19, 2024Updated last year
- CVMHT : Complementary-View Multiple Human Tracking (AAAI 2020).☆10Dec 9, 2021Updated 4 years ago