Leeroo-AI / mergoo
A library for easily merging multiple LLM experts, and efficiently train the merged LLM.
☆472Updated 8 months ago
Alternatives and similar repositories for mergoo:
Users that are interested in mergoo are comparing it to the libraries listed below
- A bagel, with everything.☆320Updated last year
- [ICML'24 Spotlight] LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning☆652Updated 11 months ago
- Automated Identification of Redundant Layer Blocks for Pruning in Large Language Models☆236Updated last year
- ☆515Updated 5 months ago
- This is our own implementation of 'Layer Selective Rank Reduction'☆237Updated 11 months ago
- An Open Source Toolkit For LLM Distillation☆586Updated this week
- EvolKit is an innovative framework designed to automatically enhance the complexity of instructions used for fine-tuning Large Language M…☆215Updated 6 months ago
- Official repository for ORPO☆450Updated 11 months ago
- Memory optimization and training recipes to extrapolate language models' context length to 1 million tokens, with minimal hardware.☆719Updated 7 months ago
- Automatically evaluate your LLMs in Google Colab☆620Updated 11 months ago
- awesome synthetic (text) datasets☆278Updated 6 months ago
- Load multiple LoRA modules simultaneously and automatically switch the appropriate combination of LoRA modules to generate the best answe…☆150Updated last year
- [ACL'24] Selective Reflection-Tuning: Student-Selected Data Recycling for LLM Instruction-Tuning☆354Updated 7 months ago
- The Truth Is In There: Improving Reasoning in Language Models with Layer-Selective Rank Reduction☆386Updated 9 months ago
- A compact LLM pretrained in 9 days by using high quality data☆312Updated 3 weeks ago
- This repository contains code to quantitatively evaluate instruction-tuned models such as Alpaca and Flan-T5 on held-out tasks.☆546Updated last year
- FuseAI Project☆563Updated 3 months ago
- The official evaluation suite and dynamic data release for MixEval.☆238Updated 5 months ago
- Chat Templates for 🤗 HuggingFace Large Language Models☆655Updated 4 months ago
- Steer LLM outputs towards a certain topic/subject and enhance response capabilities using activation engineering by adding steering vecto…☆235Updated 2 months ago
- Q-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank Gradients.☆198Updated 9 months ago
- Banishing LLM Hallucinations Requires Rethinking Generalization☆273Updated 9 months ago
- Spherical Merge Pytorch/HF format Language Models with minimal feature loss.☆121Updated last year
- Implementation of paper Data Engineering for Scaling Language Models to 128K Context☆459Updated last year
- A simple unified framework for evaluating LLMs☆209Updated 3 weeks ago
- ☆115Updated 3 weeks ago
- Low-Rank adapter extraction for fine-tuned transformers models☆173Updated last year
- [ICLR 2025] Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing. Your efficient and high-quality synthetic data …☆693Updated last month
- Official repo for "Make Your LLM Fully Utilize the Context"☆251Updated 11 months ago
- Official code for ReLoRA from the paper Stack More Layers Differently: High-Rank Training Through Low-Rank Updates☆452Updated last year