AblateIt / finetune-studyView external linksLinks
Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.
☆83Sep 10, 2023Updated 2 years ago
Alternatives and similar repositories for finetune-study
Users that are interested in finetune-study are comparing it to the libraries listed below
Sorting:
- ☆22Aug 27, 2023Updated 2 years ago
- clean up your LLM datasets☆114May 30, 2023Updated 2 years ago
- QLoRA with Enhanced Multi GPU Support☆37Aug 8, 2023Updated 2 years ago
- A repository of prompts and Python scripts for intelligent transformation of raw text into diverse formats.☆31May 29, 2023Updated 2 years ago
- Public reports detailing responses to sets of prompts by Large Language Models.☆32Jan 4, 2025Updated last year
- ☆95Jul 26, 2023Updated 2 years ago
- Run evaluation on LLMs using human-eval benchmark☆427Sep 12, 2023Updated 2 years ago
- Just a bunch of benchmark logs for different LLMs☆119Jul 28, 2024Updated last year
- Knowledge Graph Generator app☆34Apr 18, 2024Updated last year
- Generate textbook-quality synthetic LLM pretraining data☆509Oct 19, 2023Updated 2 years ago
- Multi-Domain Expert Learning☆67Jan 23, 2024Updated 2 years ago
- Just large language models. Hackable, with as little abstraction as possible. Done for my own purposes, feel free to rip.☆44Sep 6, 2023Updated 2 years ago
- finetune script for SDXL adapted from waifu-diffusion trainer☆11Aug 21, 2023Updated 2 years ago
- ☆45Oct 13, 2023Updated 2 years ago
- batched loras☆349Sep 6, 2023Updated 2 years ago
- LLM plugin for models hosted by Anyscale Endpoints☆35Apr 22, 2024Updated last year
- Content, code, and resources for the book How to Train Your Robot.☆25Dec 2, 2023Updated 2 years ago
- ☆74Sep 5, 2023Updated 2 years ago
- Faster Pytorch bitsandbytes 4bit fp4 nn.Linear ops☆30Mar 16, 2024Updated last year
- data cleaning and curation for unstructured text☆328Aug 6, 2024Updated last year
- This repository shows various ways of deploying a vision model (TensorFlow) from 🤗 Transformers.☆30Aug 22, 2022Updated 3 years ago
- Code for PHATGOOSE introduced in "Learning to Route Among Specialized Experts for Zero-Shot Generalization"☆91Feb 27, 2024Updated last year
- ☆63Sep 23, 2024Updated last year
- A place to store reusable transformer components of my own creation or found on the interwebs☆72Feb 6, 2026Updated last week
- ☀️☆12Aug 24, 2022Updated 3 years ago
- ☆15Oct 31, 2023Updated 2 years ago
- Highly commented implementations of Transformers in PyTorch☆138Aug 2, 2023Updated 2 years ago
- ☆13May 7, 2023Updated 2 years ago
- ☆415Nov 2, 2023Updated 2 years ago
- [CVPR-2023] Towards Any Structural Pruning☆17Apr 27, 2023Updated 2 years ago
- A framework for few-shot evaluation of autoregressive language models.☆16Aug 23, 2023Updated 2 years ago
- ☆15Sep 8, 2023Updated 2 years ago
- ☆14Jul 21, 2023Updated 2 years ago
- Multipack distributed sampler for fast padding-free training of LLMs☆204Aug 10, 2024Updated last year
- ☆14Jul 26, 2023Updated 2 years ago
- Website for Applied-LLMs work☆27Jan 13, 2026Updated last month
- An abstract P2E engine, allowing developers an interface to develop minigames and turn them over to their communities☆15Apr 10, 2022Updated 3 years ago
- A Data Source for Reasoning Embodied Agents☆19Sep 18, 2023Updated 2 years ago
- LlamaWorksDB is a Retrieval Augmented Generation (RAG) product designed to interact with the documentation of various products such as Ll…☆17May 3, 2024Updated last year