georgesung / llm_qloraLinks
Fine-tuning LLMs using QLoRA
☆260Updated last year
Alternatives and similar repositories for llm_qlora
Users that are interested in llm_qlora are comparing it to the libraries listed below
Sorting:
- A bagel, with everything.☆323Updated last year
- Tune any FALCON in 4-bit☆465Updated last year
- ☆168Updated 2 years ago
- Merge Transformers language models by use of gradient parameters.☆206Updated 11 months ago
- This is our own implementation of 'Layer Selective Rank Reduction'☆239Updated last year
- ☆416Updated last year
- QLoRA: Efficient Finetuning of Quantized LLMs☆78Updated last year
- Convenient wrapper for fine-tuning and inference of Large Language Models (LLMs) with several quantization techniques (GTPQ, bitsandbytes…☆146Updated last year
- Some simple scripts that I use day-to-day when working with LLMs and Huggingface Hub☆162Updated last year
- ☆122Updated 2 years ago
- Inference code for Mistral and Mixtral hacked up into original Llama implementation☆371Updated last year
- Customizable implementation of the self-instruct paper.☆1,048Updated last year
- Generate textbook-quality synthetic LLM pretraining data☆502Updated last year
- Fine-tune mistral-7B on 3090s, a100s, h100s☆717Updated last year
- A library for easily merging multiple LLM experts, and efficiently train the merged LLM.☆489Updated 11 months ago
- Fully fine-tune large models like Mistral, Llama-2-13B, or Qwen-14B completely for free☆232Updated 9 months ago
- Small finetuned LLMs for a diverse set of useful tasks☆127Updated 2 years ago
- Extend existing LLMs way beyond the original training length with constant memory usage, without retraining☆702Updated last year
- Falcon LLM ggml framework with CPU and GPU support☆246Updated last year
- ☆535Updated last year
- Automatically evaluate your LLMs in Google Colab☆649Updated last year
- Patch for MPT-7B which allows using and training a LoRA☆58Updated 2 years ago
- A joint community effort to create one central leaderboard for LLMs.☆304Updated 11 months ago
- Embed arbitrary modalities (images, audio, documents, etc) into large language models.☆185Updated last year
- FineTune LLMs in few lines of code (Text2Text, Text2Speech, Speech2Text)☆240Updated last year
- TheBloke's Dockerfiles☆305Updated last year
- Landmark Attention: Random-Access Infinite Context Length for Transformers☆423Updated last year
- Low-Rank adapter extraction for fine-tuned transformers models☆175Updated last year
- Domain Adapted Language Modeling Toolkit - E2E RAG☆325Updated 8 months ago
- Instruct-tuning LLaMA on consumer hardware☆66Updated 2 years ago