A bagel, with everything.
☆326Apr 11, 2024Updated last year
Alternatives and similar repositories for bagel
Users that are interested in bagel are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Customizable implementation of the self-instruct paper.☆1,052Mar 7, 2024Updated 2 years ago
- This is our own implementation of 'Layer Selective Rank Reduction'☆241May 26, 2024Updated last year
- QLoRA: Efficient Finetuning of Quantized LLMs☆79Apr 10, 2024Updated last year
- Repository containing the SPIN experiments on the DIBT 10k ranked prompts☆23Mar 12, 2024Updated 2 years ago
- QLoRA: Efficient Finetuning of Quantized LLMs☆11Jul 22, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Tools for merging pretrained large language models.☆6,895Mar 15, 2026Updated 2 weeks ago
- Go ahead and axolotl questions☆11,508Updated this week
- Low-Rank adapter extraction for fine-tuned transformers models☆181May 2, 2024Updated last year
- Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifi…☆3,143Updated this week
- Generate Synthetic Data Using OpenAI, MistralAI or AnthropicAI☆222Apr 29, 2024Updated last year
- an implementation of Self-Extend, to expand the context window via grouped attention☆119Jan 7, 2024Updated 2 years ago
- Generate textbook-quality synthetic LLM pretraining data☆509Oct 19, 2023Updated 2 years ago
- Implementation of the training framework proposed in Self-Rewarding Language Model, from MetaAI☆1,407Apr 11, 2024Updated last year
- ☆74Sep 5, 2023Updated 2 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Multipack distributed sampler for fast padding-free training of LLMs☆206Aug 10, 2024Updated last year
- Automatically evaluate your LLMs in Google Colab☆687May 7, 2024Updated last year
- evol augment any dataset online☆61Aug 3, 2023Updated 2 years ago
- Training LLMs with QLoRA + FSDP☆1,539Nov 9, 2024Updated last year
- The official implementation of Self-Play Fine-Tuning (SPIN)☆1,234May 8, 2024Updated last year
- Official repository for ORPO☆473May 31, 2024Updated last year
- Merge Transformers language models by use of gradient parameters.☆214Aug 8, 2024Updated last year
- A benchmark for emotional intelligence in large language models☆419Jul 26, 2024Updated last year
- YaRN: Efficient Context Window Extension of Large Language Models☆1,686Apr 17, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- AllenAI's post-training codebase☆3,643Mar 23, 2026Updated last week
- Create Custom LLMs☆1,820Nov 8, 2025Updated 4 months ago
- Collection of autoregressive model implementation☆85Feb 23, 2026Updated last month
- Just a bunch of benchmark logs for different LLMs☆120Jul 28, 2024Updated last year
- Manage scalable open LLM inference endpoints in Slurm clusters☆284Jul 11, 2024Updated last year
- Tools for formatting large language model prompts.☆13Dec 19, 2023Updated 2 years ago
- Large-scale LLM inference engine☆1,681Mar 12, 2026Updated 2 weeks ago
- Using open source LLMs to build synthetic datasets for direct preference optimization☆72Feb 29, 2024Updated 2 years ago
- Robust recipes to align language models with human and AI preferences☆5,535Sep 8, 2025Updated 6 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- A fast inference library for running LLMs locally on modern consumer-class GPUs☆4,476Mar 4, 2026Updated 3 weeks ago
- Using multiple LLMs for ensemble Forecasting☆16Jan 17, 2024Updated 2 years ago
- Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.☆2,965Mar 16, 2026Updated 2 weeks ago
- Code for fine-tuning Platypus fam LLMs using LoRA☆629Feb 4, 2024Updated 2 years ago
- Repository for "I am a Strange Dataset: Metalinguistic Tests for Language Models"☆45Jan 11, 2024Updated 2 years ago
- A collection of modular datasets generated by GPT-4, General-Instruct - Roleplay-Instruct - Code-Instruct - and Toolformer☆1,631Sep 15, 2023Updated 2 years ago
- Automated Identification of Redundant Layer Blocks for Pruning in Large Language Models☆263Apr 23, 2024Updated last year