geronimi73 / qlora-minimal
☆84Updated last year
Alternatives and similar repositories for qlora-minimal:
Users that are interested in qlora-minimal are comparing it to the libraries listed below
- inference code for mixtral-8x7b-32kseqlen☆99Updated last year
- ☆48Updated last year
- Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.☆82Updated last year
- ☆87Updated last year
- Convenient wrapper for fine-tuning and inference of Large Language Models (LLMs) with several quantization techniques (GTPQ, bitsandbytes…☆147Updated last year
- Simple examples using Argilla tools to build AI☆52Updated 5 months ago
- A set of scripts and notebooks on LLM finetunning and dataset creation☆106Updated 7 months ago
- Set of scripts to finetune LLMs☆37Updated last year
- Build a Streamlit Chatbot using Langchain, ColBERT, Ragatouille, and ChromaDB☆119Updated last year
- Doing simple retrieval from LLM models at various context lengths to measure accuracy☆99Updated last year
- Just a bunch of benchmark logs for different LLMs☆119Updated 9 months ago
- Scripts to create your own moe models using mlx☆89Updated last year
- QLoRA: Efficient Finetuning of Quantized LLMs☆78Updated last year
- ☆28Updated last year
- Some simple scripts that I use day-to-day when working with LLMs and Huggingface Hub☆160Updated last year
- Data preparation code for Amber 7B LLM☆88Updated 11 months ago
- ☆129Updated 8 months ago
- Full finetuning of large language models without large memory requirements☆94Updated last year
- Fully fine-tune large models like Mistral, Llama-2-13B, or Qwen-14B completely for free☆231Updated 5 months ago
- ☆92Updated last year
- ☆153Updated 9 months ago
- ☆112Updated 4 months ago
- Collection of autoregressive model implementation☆85Updated this week
- KMD is a collection of conversational exchanges between patients and doctors on various medical topics. It aims to capture the intricaci…☆24Updated last year
- Notus is a collection of fine-tuned LLMs using SFT, DPO, SFT+DPO, and/or any other RLHF techniques, while always keeping a data-first app…☆167Updated last year
- run embeddings in MLX☆86Updated 7 months ago
- GRDN.AI app for garden optimization☆70Updated last year
- Low-Rank adapter extraction for fine-tuned transformers models☆173Updated 11 months ago
- Mixing Language Models with Self-Verification and Meta-Verification☆104Updated 4 months ago
- Merge Transformers language models by use of gradient parameters.☆206Updated 8 months ago