A bagel, with everything.
☆326Apr 11, 2024Updated 2 years ago
Alternatives and similar repositories for bagel
Users that are interested in bagel are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Customizable implementation of the self-instruct paper.☆1,053Mar 7, 2024Updated 2 years ago
- This is our own implementation of 'Layer Selective Rank Reduction'☆240May 26, 2024Updated last year
- QLoRA: Efficient Finetuning of Quantized LLMs☆79Apr 10, 2024Updated 2 years ago
- Repository containing the SPIN experiments on the DIBT 10k ranked prompts☆23Mar 12, 2024Updated 2 years ago
- QLoRA: Efficient Finetuning of Quantized LLMs☆11Jul 22, 2023Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Tools for merging pretrained large language models.☆7,052Mar 15, 2026Updated last month
- Go ahead and axolotl questions☆11,842May 1, 2026Updated last week
- Low-Rank adapter extraction for fine-tuned transformers models☆181May 2, 2024Updated 2 years ago
- Generate Synthetic Data Using OpenAI, MistralAI or AnthropicAI☆222Apr 29, 2024Updated 2 years ago
- Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifi…☆3,209Apr 27, 2026Updated last week
- an implementation of Self-Extend, to expand the context window via grouped attention☆119Jan 7, 2024Updated 2 years ago
- Generate textbook-quality synthetic LLM pretraining data☆508Oct 19, 2023Updated 2 years ago
- Implementation of the training framework proposed in Self-Rewarding Language Model, from MetaAI☆1,408Apr 11, 2024Updated 2 years ago
- ☆74Sep 5, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Multipack distributed sampler for fast padding-free training of LLMs☆208Aug 10, 2024Updated last year
- evol augment any dataset online☆61Aug 3, 2023Updated 2 years ago
- Automatically evaluate your LLMs in Google Colab☆688May 7, 2024Updated 2 years ago
- Training LLMs with QLoRA + FSDP☆1,542Nov 9, 2024Updated last year
- The official implementation of Self-Play Fine-Tuning (SPIN)☆1,239May 8, 2024Updated 2 years ago
- Official repository for ORPO☆483May 31, 2024Updated last year
- Merge Transformers language models by use of gradient parameters.☆214Aug 8, 2024Updated last year
- YaRN: Efficient Context Window Extension of Large Language Models☆1,710Apr 17, 2024Updated 2 years ago
- Collection of autoregressive model implementation☆85Feb 23, 2026Updated 2 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- AllenAI's post-training codebase☆3,708May 3, 2026Updated last week
- Just a bunch of benchmark logs for different LLMs☆124Jul 28, 2024Updated last year
- A benchmark for emotional intelligence in large language models☆424Jul 26, 2024Updated last year
- Create Custom LLMs☆1,835Apr 24, 2026Updated 2 weeks ago
- Tools for formatting large language model prompts.☆13Dec 19, 2023Updated 2 years ago
- Manage scalable open LLM inference endpoints in Slurm clusters☆287Jul 11, 2024Updated last year
- Using open source LLMs to build synthetic datasets for direct preference optimization☆72Feb 29, 2024Updated 2 years ago
- Large-scale LLM inference engine☆1,719Updated this week
- Robust recipes to align language models with human and AI preferences☆5,593Apr 8, 2026Updated last month
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Using multiple LLMs for ensemble Forecasting☆16Jan 17, 2024Updated 2 years ago
- A fast inference library for running LLMs locally on modern consumer-class GPUs☆4,514Mar 4, 2026Updated 2 months ago
- Code for fine-tuning Platypus fam LLMs using LoRA☆628Feb 4, 2024Updated 2 years ago
- Repository for "I am a Strange Dataset: Metalinguistic Tests for Language Models"☆46Jan 11, 2024Updated 2 years ago
- A collection of modular datasets generated by GPT-4, General-Instruct - Roleplay-Instruct - Code-Instruct - and Toolformer☆1,648Sep 15, 2023Updated 2 years ago
- Automated Identification of Redundant Layer Blocks for Pruning in Large Language Models☆268Apr 23, 2024Updated 2 years ago
- Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.☆3,033Apr 20, 2026Updated 2 weeks ago