[ACL 2025, Main Conference, Oral] Intuitive Fine-Tuning: Towards Simplifying Alignment into a Single Process
☆30Aug 2, 2024Updated last year
Alternatives and similar repositories for Intuitive-Fine-Tuning
Users that are interested in Intuitive-Fine-Tuning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Source codes for "Preference-grounded Token-level Guidance for Language Model Fine-tuning" (NeurIPS 2023).☆17Jan 8, 2025Updated last year
- Modified Beam Search with periodical restart☆12Sep 12, 2024Updated last year
- trying to reproduce suno v3☆34Jan 29, 2025Updated last year
- Use the tokenizer in parallel to achieve superior acceleration☆20Mar 21, 2024Updated 2 years ago
- ☆14Dec 13, 2024Updated last year
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Official repository for ICLR 2025 paper "Amulet: ReAlignment During Test Time for Personalized Preference Adaptation of LLMs"☆16Mar 18, 2025Updated last year
- RecGPT: Generative Pre-training for Text-based Recommendation (ACL 2024)☆41Sep 22, 2024Updated last year
- [CVPR-2024] NAYER: Noisy Layer Data Generation for Efficient and Effective Data-free Knowledge Distillation☆16Oct 19, 2024Updated last year
- Official code for "MAmmoTH2: Scaling Instructions from the Web" [NeurIPS 2024]☆149Oct 27, 2024Updated last year
- Official repository for ORPO☆473May 31, 2024Updated last year
- Trust Region Preference Approximation: A simple and stable reinforcement learning algorithm for LLM reasoning☆15Jun 28, 2025Updated 8 months ago
- ☆29Jan 23, 2024Updated 2 years ago
- The official implementation of "ICDPO: Effectively Borrowing Alignment Capability of Others via In-context Direct Preference Optimization…☆16Feb 15, 2024Updated 2 years ago
- [CVPR-2024] Text-Enhanced Data-free Approach for Federated Class-Incremental Learning☆18Dec 26, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- A library with extensible implementations of DPO, KTO, PPO, ORPO, and other human-aware loss functions (HALOs).☆907Sep 30, 2025Updated 5 months ago
- Official code for ACT: Empowering Decision Transformer with Dynamic Programming via Advantage Conditioning (AAAI'24)☆17Feb 10, 2024Updated 2 years ago
- A collections of audio codecs with a standardized API☆36May 27, 2025Updated 10 months ago
- Reference implementation for Token-level Direct Preference Optimization(TDPO)☆152Feb 14, 2025Updated last year
- WavSpA: Wavelet Space Attention for Enhancing Transformer's Long Sequence Learning☆12Feb 24, 2024Updated 2 years ago
- Repository for Giuseppe Russo's master thesis code.☆13Oct 2, 2020Updated 5 years ago
- ☆11Oct 29, 2022Updated 3 years ago
- Contains the model patches and the eval logs from the passing swe-bench-lite run.☆10Jun 28, 2024Updated last year
- Topological-LSTM for Information Cascade Modeling☆12Nov 2, 2017Updated 8 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Self-Supervised Alignment with Mutual Information☆20May 24, 2024Updated last year
- Official repository for ACL 2025 paper "Model Extrapolation Expedites Alignment"☆75May 20, 2025Updated 10 months ago
- Multi-Candidate Speculative Decoding☆40Apr 22, 2024Updated last year
- 基于Qt GraphicsView绘制图形(长方形、角度、圆等)☆12Jan 26, 2018Updated 8 years ago
- 机器学习部分算法实现,分类、聚类、回归(LR、Kmeans、GMM、PCA)☆11Mar 12, 2019Updated 7 years ago
- Implementation of "Decoding-time Realignment of Language Models", ICML 2024.☆21Jun 17, 2024Updated last year
- ☆16Jun 14, 2023Updated 2 years ago
- An ambient noise detector☆10Aug 23, 2020Updated 5 years ago
- ☆15Aug 6, 2025Updated 7 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Official resource for paper Investigating and Mitigating the Multimodal Hallucination Snowballing in Large Vision-Language Models (ACL 20…☆15Aug 12, 2024Updated last year
- Complexity Based Prompting for Multi-Step Reasoning☆17Mar 10, 2023Updated 3 years ago
- Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Fl…☆78Aug 17, 2024Updated last year
- Code for "Hierarchical Diffusion Attention Network" (IJCAI 2019)☆14Apr 23, 2020Updated 5 years ago
- ☆20Sep 5, 2024Updated last year
- Direct preference optimization with f-divergences.☆16Nov 3, 2024Updated last year
- 论文Low-Shot Learning with Imprinted Weights 的keras 版简要实现;☆15Dec 28, 2018Updated 7 years ago