GeneZC / MiniMA
Code for paper titled "Towards the Law of Capacity Gap in Distilling Language Models"
☆100Updated 8 months ago
Alternatives and similar repositories for MiniMA:
Users that are interested in MiniMA are comparing it to the libraries listed below
- Unofficial implementation of AlpaGasus☆90Updated last year
- ☆98Updated 5 months ago
- An Experiment on Dynamic NTK Scaling RoPE☆62Updated last year
- LongEmbed: Extending Embedding Models for Long Context Retrieval (EMNLP 2024)☆130Updated 4 months ago
- Official implementation for 'Extending LLMs’ Context Window with 100 Samples'☆75Updated last year
- [ICLR 2024] CLEX: Continuous Length Extrapolation for Large Language Models☆76Updated last year
- FuseAI Project☆83Updated last month
- MultilingualSIFT: Multilingual Supervised Instruction Fine-tuning☆89Updated last year
- Official code for "MAmmoTH2: Scaling Instructions from the Web" [NeurIPS 2024]☆136Updated 4 months ago
- Positional Skip-wise Training for Efficient Context Window Extension of LLMs to Extremely Length (ICLR 2024)☆205Updated 9 months ago
- Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks☆140Updated 6 months ago
- [ICML'24] The official implementation of “Rethinking Optimization and Architecture for Tiny Language Models”☆121Updated 2 months ago
- Reformatted Alignment☆114Updated 5 months ago
- Implementations of online merging optimizers proposed by Online Merging Optimizers for Boosting Rewards and Mitigating Tax in Alignment☆74Updated 9 months ago
- ☆76Updated 2 months ago
- This repository contains the joint use of CPO and SimPO method for better reference-free preference learning methods.☆51Updated 7 months ago
- Self-Alignment with Principle-Following Reward Models☆156Updated last year
- Flacuna was developed by fine-tuning Vicuna on Flan-mini, a comprehensive instruction collection encompassing various tasks. Vicuna is al…☆111Updated last year
- Experiments on speculative sampling with Llama models☆125Updated last year
- [ACL'24] Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuning☆146Updated 6 months ago
- The official repo for "LLoCo: Learning Long Contexts Offline"☆115Updated 9 months ago
- This is the official repository for Inheritune.☆109Updated last month
- Code for ACL2023 paper: Pre-Training to Learn in Context☆108Updated 7 months ago
- 🐋 An unofficial implementation of Self-Alignment with Instruction Backtranslation.☆137Updated 8 months ago
- Spherical Merge Pytorch/HF format Language Models with minimal feature loss.☆117Updated last year
- evol augment any dataset online☆59Updated last year
- Skywork-MoE: A Deep Dive into Training Techniques for Mixture-of-Experts Language Models☆129Updated 9 months ago
- Unofficial implementation for the paper "Mixture-of-Depths: Dynamically allocating compute in transformer-based language models"☆153Updated 9 months ago
- We aim to provide the best references to search, select, and synthesize high-quality and large-quantity data for post-training your LLMs.☆52Updated 5 months ago
- [EMNLP 2024] LongAlign: A Recipe for Long Context Alignment of LLMs☆241Updated 3 months ago