Tune MPTs
☆84Jun 17, 2023Updated 2 years ago
Alternatives and similar repositories for mpttune
Users that are interested in mpttune are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A repo for finetuning MPT using LoRA. It is currently configured to work with the Alpaca dataset from Stanford but can easily be adapted …☆18Jun 12, 2023Updated 2 years ago
- Command-line script for inferencing from models such as MPT-7B-Chat☆100Jul 6, 2023Updated 2 years ago
- Finetune Falcon, LLaMA, MPT, and RedPajama on consumer hardware using PEFT LoRA☆104May 20, 2025Updated 10 months ago
- Tune any FALCON in 4-bit☆463Sep 1, 2023Updated 2 years ago
- Patch for MPT-7B which allows using and training a LoRA☆58May 20, 2023Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆553Feb 8, 2026Updated last month
- ☆17Jun 19, 2023Updated 2 years ago
- Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks☆31May 22, 2024Updated last year
- Trying to deconstruct RWKV in understandable terms☆14May 6, 2023Updated 2 years ago
- Optimizing bit-level Jaccard Index and Population Counts for large-scale quantized Vector Search via Harley-Seal CSA and Lookup Tables☆21May 18, 2025Updated 10 months ago
- tinygrad port of the RWKV large language model.☆45Mar 9, 2025Updated last year
- Code for the paper "SparseGPT: Massive Language Models Can Be Accurately Pruned in One-Shot" with LLaMA implementation.☆71Mar 30, 2023Updated 2 years ago
- Large Language Model Hosting Container☆91Mar 11, 2026Updated 2 weeks ago
- Finetuning Large Language Models on One Consumer GPU in 2 Bits☆734May 25, 2024Updated last year
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- data prep utilities for LLMs, using LLMs☆16Nov 7, 2023Updated 2 years ago
- [ICML 2024] SqueezeLLM: Dense-and-Sparse Quantization☆713Aug 13, 2024Updated last year
- Code for the EMNLP2020 long paper "Lifelong Language Knowledge Distillation" https://arxiv.org/abs/2010.02123☆12Jul 13, 2021Updated 4 years ago
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆59Oct 18, 2025Updated 5 months ago
- Less-wrong single-file Numba-accelerated Python implementation of Gotoh affine gap penalty extensions for the Needleman–Wunsch, Smith-Wat…☆12Oct 30, 2025Updated 4 months ago
- ☆37May 31, 2023Updated 2 years ago
- ☆12Oct 3, 2024Updated last year
- Paper Implementation of Self-Rewarding Language Models☆13Feb 1, 2024Updated 2 years ago
- Information and artifacts for "LoRA Learns Less and Forgets Less" (TMLR, 2024)☆20Sep 27, 2024Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Port of Facebook's LLaMA model in C/C++☆21Nov 6, 2023Updated 2 years ago
- A library for simplifying training with multi gpu setups in the HuggingFace / PyTorch ecosystem.☆16Mar 16, 2026Updated last week
- Deploy your GGML models to HuggingFace Spaces with Docker and gradio☆38Jun 6, 2023Updated 2 years ago
- Implementation of 1D, 2D, and 3D FFT convolutions in PyTorch. Much faster than direct convolutions for large kernel sizes.☆14Jul 9, 2023Updated 2 years ago
- ☆56Jun 26, 2025Updated 9 months ago
- Demonstration that finetuning RoPE model on larger sequences than the pre-trained model adapts the model context limit☆63Jun 21, 2023Updated 2 years ago
- Inference Llama 2 in one file of pure C☆14Jul 24, 2023Updated 2 years ago
- ☆78Dec 26, 2023Updated 2 years ago
- A simple C++ documentation generator☆12Jun 4, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- OpenAlpaca: A Fully Open-Source Instruction-Following Model Based On OpenLLaMA☆304Jun 13, 2023Updated 2 years ago
- implement llava using candle☆15Jun 9, 2024Updated last year
- ☆19May 6, 2023Updated 2 years ago
- Generate usage documentation in Markdown format from Python scripts using argparse☆17Mar 12, 2026Updated 2 weeks ago
- ☆16Dec 11, 2023Updated 2 years ago
- Local Startup Advisor Chatbot☆32Dec 30, 2023Updated 2 years ago
- An open-source project building a customizable ChatGPT-like clone. Built with Django and Next.js, it features chat history, streaming res…☆16Mar 5, 2024Updated 2 years ago