Tune MPTs
☆84Jun 17, 2023Updated 2 years ago
Alternatives and similar repositories for mpttune
Users that are interested in mpttune are comparing it to the libraries listed below
Sorting:
- Tune any FALCON in 4-bit☆463Sep 1, 2023Updated 2 years ago
- Finetune Falcon, LLaMA, MPT, and RedPajama on consumer hardware using PEFT LoRA☆104May 20, 2025Updated 9 months ago
- ☆17Jun 19, 2023Updated 2 years ago
- A Next.js chat app to use Llama 2 locally using node-llama-cpp☆12Oct 27, 2024Updated last year
- Extism Plug-in Development Kit (PDK) for C☆14Oct 22, 2024Updated last year
- ☆553Feb 8, 2026Updated 3 weeks ago
- Wasm based bindings for cue in javascript☆13Dec 8, 2022Updated 3 years ago
- Trying to deconstruct RWKV in understandable terms☆14May 6, 2023Updated 2 years ago
- Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks☆31May 22, 2024Updated last year
- Patch for MPT-7B which allows using and training a LoRA☆58May 20, 2023Updated 2 years ago
- ☆13Jan 27, 2019Updated 7 years ago
- An open-source project building a customizable ChatGPT-like clone. Built with Django and Next.js, it features chat history, streaming res…☆16Mar 5, 2024Updated 2 years ago
- ☆15Sep 8, 2023Updated 2 years ago
- ☆16Jul 20, 2023Updated 2 years ago
- [ICML 2024] SqueezeLLM: Dense-and-Sparse Quantization☆712Aug 13, 2024Updated last year
- Optimizing bit-level Jaccard Index and Population Counts for large-scale quantized Vector Search via Harley-Seal CSA and Lookup Tables☆21May 18, 2025Updated 9 months ago
- Generate usage documentation in Markdown format from Python scripts using argparse☆17Jan 14, 2026Updated last month
- Large Language Model Hosting Container☆91Oct 9, 2025Updated 4 months ago
- ☆21May 27, 2023Updated 2 years ago
- Port of Facebook's LLaMA model in C/C++☆21Nov 6, 2023Updated 2 years ago
- Finetuning Large Language Models on One Consumer GPU in 2 Bits☆734May 25, 2024Updated last year
- 5X faster 60% less memory QLoRA finetuning☆21May 28, 2024Updated last year
- rmp data ranking☆13Nov 4, 2025Updated 4 months ago
- ☆56Jun 26, 2025Updated 8 months ago
- An automation platform for graphically modeled workflows. Focus on network automation. Open Source under Apache License.☆11Nov 13, 2025Updated 3 months ago
- Convert all of libgen to high quality markdown☆255Dec 13, 2023Updated 2 years ago
- A large-scale information-rich web dataset, featuring millions of real clicked query-document labels☆346Dec 16, 2024Updated last year
- We finetune Bloomz-7b1-mt using LoRA with the chatdoctor-200k dataset at here https://huggingface.co/LinhDuong/doctorwithbloomz-7b1-mt an…☆30Apr 4, 2023Updated 2 years ago
- Example ML projects that use the Determined library.☆33Sep 11, 2024Updated last year
- An experiment to see if we can process G2 reviews to extract topics from reviews☆10Feb 5, 2024Updated 2 years ago
- Code for fine-tuning Platypus fam LLMs using LoRA☆629Feb 4, 2024Updated 2 years ago
- [TACL, EMNLP 2025 Oral] Code, datasets, and checkpoints for the paper "CRAFT Your Dataset: Task-Specific Synthetic Dataset Generation Thr…☆34Dec 5, 2025Updated 3 months ago
- Lite weight wrapper for the independent implementation of SPLADE++ models for search & retrieval pipelines. Models and Library created by…☆34Aug 24, 2024Updated last year
- TorchServe+Streamlit for easily serving your HuggingFace NER models☆33Jul 4, 2022Updated 3 years ago
- ☆37May 31, 2023Updated 2 years ago
- OpenAlpaca: A Fully Open-Source Instruction-Following Model Based On OpenLLaMA☆304Jun 13, 2023Updated 2 years ago
- [MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration☆3,443Jul 17, 2025Updated 7 months ago
- Official repository for LongChat and LongEval☆533May 24, 2024Updated last year
- Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.☆83Sep 10, 2023Updated 2 years ago