jondurbin / bagelView external linksLinks
A bagel, with everything.
☆326Apr 11, 2024Updated last year
Alternatives and similar repositories for bagel
Users that are interested in bagel are comparing it to the libraries listed below
Sorting:
- Customizable implementation of the self-instruct paper.☆1,050Mar 7, 2024Updated last year
- This is our own implementation of 'Layer Selective Rank Reduction'☆240May 26, 2024Updated last year
- QLoRA: Efficient Finetuning of Quantized LLMs☆79Apr 10, 2024Updated last year
- Repository containing the SPIN experiments on the DIBT 10k ranked prompts☆23Mar 12, 2024Updated last year
- Tools for merging pretrained large language models.☆6,783Jan 26, 2026Updated 2 weeks ago
- Low-Rank adapter extraction for fine-tuned transformers models☆180May 2, 2024Updated last year
- QLoRA: Efficient Finetuning of Quantized LLMs☆11Jul 22, 2023Updated 2 years ago
- Go ahead and axolotl questions☆11,289Updated this week
- Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifi…☆3,084Jan 26, 2026Updated 2 weeks ago
- Generate textbook-quality synthetic LLM pretraining data☆509Oct 19, 2023Updated 2 years ago
- Generate Synthetic Data Using OpenAI, MistralAI or AnthropicAI☆222Apr 29, 2024Updated last year
- A benchmark for emotional intelligence in large language models☆400Jul 26, 2024Updated last year
- an implementation of Self-Extend, to expand the context window via grouped attention☆119Jan 7, 2024Updated 2 years ago
- Automatically evaluate your LLMs in Google Colab☆685May 7, 2024Updated last year
- Multipack distributed sampler for fast padding-free training of LLMs☆204Aug 10, 2024Updated last year
- Training LLMs with QLoRA + FSDP☆1,537Nov 9, 2024Updated last year
- Merge Transformers language models by use of gradient parameters.☆214Aug 8, 2024Updated last year
- Official repository for ORPO☆471May 31, 2024Updated last year
- Create Custom LLMs☆1,806Nov 8, 2025Updated 3 months ago
- Demonstration that finetuning RoPE model on larger sequences than the pre-trained model adapts the model context limit☆63Jun 21, 2023Updated 2 years ago
- The official implementation of Self-Play Fine-Tuning (SPIN)☆1,234May 8, 2024Updated last year
- Large-scale LLM inference engine☆1,651Jan 21, 2026Updated 3 weeks ago
- A library with extensible implementations of DPO, KTO, PPO, ORPO, and other human-aware loss functions (HALOs).☆905Sep 30, 2025Updated 4 months ago
- Just a bunch of benchmark logs for different LLMs☆119Jul 28, 2024Updated last year
- AllenAI's post-training codebase☆3,573Updated this week
- an auto-sleeping and -waking framework around llama.cpp☆12Feb 8, 2025Updated last year
- Tools for formatting large language model prompts.☆13Dec 19, 2023Updated 2 years ago
- ☆74Sep 5, 2023Updated 2 years ago
- YaRN: Efficient Context Window Extension of Large Language Models☆1,669Apr 17, 2024Updated last year
- Chat language model that can use tools and interpret the results☆1,590Dec 3, 2025Updated 2 months ago
- Simple Model Similarities Analysis☆21Feb 3, 2024Updated 2 years ago
- Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.☆2,885Updated this week
- A collection of modular datasets generated by GPT-4, General-Instruct - Roleplay-Instruct - Code-Instruct - and Toolformer☆1,630Sep 15, 2023Updated 2 years ago
- An unsupervised model merging algorithm for Transformers-based language models.☆108Apr 29, 2024Updated last year
- [ACL2023] We introduce LLM-Blender, an innovative ensembling framework to attain consistently superior performance by leveraging the dive…☆976Oct 22, 2024Updated last year
- A fast inference library for running LLMs locally on modern consumer-class GPUs☆4,445Dec 9, 2025Updated 2 months ago
- Python bindings for the Transformer models implemented in C/C++ using GGML library.☆1,879Jan 28, 2024Updated 2 years ago
- Robust recipes to align language models with human and AI preferences☆5,495Sep 8, 2025Updated 5 months ago
- A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.☆2,911Sep 30, 2023Updated 2 years ago