Tune MPTs
☆84Jun 17, 2023Updated 2 years ago
Alternatives and similar repositories for mpttune
Users that are interested in mpttune are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A repo for finetuning MPT using LoRA. It is currently configured to work with the Alpaca dataset from Stanford but can easily be adapted …☆18Jun 12, 2023Updated 2 years ago
- Command-line script for inferencing from models such as MPT-7B-Chat☆99Jul 6, 2023Updated 2 years ago
- Finetune Falcon, LLaMA, MPT, and RedPajama on consumer hardware using PEFT LoRA☆105Updated this week
- Tune any FALCON in 4-bit☆462Sep 1, 2023Updated 2 years ago
- Patch for MPT-7B which allows using and training a LoRA☆58May 20, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆553Feb 8, 2026Updated 2 months ago
- ☆12Jun 18, 2019Updated 6 years ago
- ☆17Jun 19, 2023Updated 2 years ago
- Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks☆31May 22, 2024Updated last year
- Trying to deconstruct RWKV in understandable terms☆14May 6, 2023Updated 3 years ago
- A finetuning pipeline for instruct tuning Raven 14bn using QLORA 4bit and the Ditty finetuning library☆28Jun 5, 2024Updated last year
- Optimizing bit-level Jaccard Index and Population Counts for large-scale quantized Vector Search via Harley-Seal CSA and Lookup Tables☆22May 18, 2025Updated 11 months ago
- tinygrad port of the RWKV large language model.☆44Mar 9, 2025Updated last year
- Wasm based bindings for cue in javascript☆13Dec 8, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆15Sep 8, 2023Updated 2 years ago
- Code for the paper "SparseGPT: Massive Language Models Can Be Accurately Pruned in One-Shot" with LLaMA implementation.☆71Mar 30, 2023Updated 3 years ago
- Large Language Model Hosting Container☆92Apr 13, 2026Updated 3 weeks ago
- Finetuning Large Language Models on One Consumer GPU in 2 Bits☆732May 25, 2024Updated last year
- data prep utilities for LLMs, using LLMs☆16Nov 7, 2023Updated 2 years ago
- [ICML 2024] SqueezeLLM: Dense-and-Sparse Quantization☆718Aug 13, 2024Updated last year
- Code for the EMNLP2020 long paper "Lifelong Language Knowledge Distillation" https://arxiv.org/abs/2010.02123☆12Jul 13, 2021Updated 4 years ago
- The Sinclair ZX Spectrum BASIC compiler!☆12Jan 27, 2026Updated 3 months ago
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆60Oct 18, 2025Updated 6 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- An ONNX converter script focused on embedding models☆33Jan 14, 2025Updated last year
- 5X faster 60% less memory QLoRA finetuning☆21May 28, 2024Updated last year
- ☆37May 31, 2023Updated 2 years ago
- ☆12Oct 3, 2024Updated last year
- A repository for a Java implementation of AutoGPT. It is heavily inspired by AutoGPT if not a clone of some of its functionality, though …☆27May 9, 2023Updated 2 years ago
- A library for simplifying training with multi gpu setups in the HuggingFace / PyTorch ecosystem.☆16Apr 27, 2026Updated last week
- Deploy your GGML models to HuggingFace Spaces with Docker and gradio☆38Jun 6, 2023Updated 2 years ago
- Demonstration that finetuning RoPE model on larger sequences than the pre-trained model adapts the model context limit☆63Jun 21, 2023Updated 2 years ago
- ☆27Jan 8, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆78Dec 26, 2023Updated 2 years ago
- Using elm to make a simple chatroom☆15Nov 23, 2016Updated 9 years ago
- OpenAlpaca: A Fully Open-Source Instruction-Following Model Based On OpenLLaMA☆304Jun 13, 2023Updated 2 years ago
- Experimental sampler to make LLMs more creative☆31Aug 2, 2023Updated 2 years ago
- ☆19May 6, 2023Updated 3 years ago
- Pydantic-based HTTP forms☆19Jun 2, 2025Updated 11 months ago
- ☆16Dec 11, 2023Updated 2 years ago