Set of scripts to finetune LLMs
☆38Mar 30, 2024Updated last year
Alternatives and similar repositories for Various-Finetuning
Users that are interested in Various-Finetuning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An unofficial implementation of SOLAR-10.7B model and the newly proposed interlocked-DUS(iDUS) implementation and experiment details.☆14Mar 20, 2024Updated 2 years ago
- Modular task agnostic training pipeline using LFM2 from Liquid AI with unsloth.☆16Sep 13, 2025Updated 6 months ago
- A single repo with all scripts and utils to train / fine-tune the Mamba model with or without FIM☆61Apr 8, 2024Updated last year
- Layout Analysis Dataset with Segmonto (LADaS)☆24Jul 12, 2025Updated 8 months ago
- Repo hosting codes and materials related to speeding LLMs' inference using token merging.☆37Oct 9, 2025Updated 5 months ago
- 🐜🔧 A minimalistic tool to fine-tune your LLMs☆18Aug 17, 2023Updated 2 years ago
- A Flutter plugin for integrating Liquid AI's LEAP SDK, enabling on-device deployment of small language models in Flutter applications.☆23Sep 3, 2025Updated 6 months ago
- ☆27Mar 13, 2024Updated 2 years ago
- Understanding the correlation between different LLM benchmarks☆29Jan 11, 2024Updated 2 years ago
- a version of baby agi using dspy and typed predictors☆16Mar 9, 2024Updated 2 years ago
- Simple Model Similarities Analysis☆21Feb 3, 2024Updated 2 years ago
- Toolkit for attaching, training, saving and loading of new heads for transformer models☆294Feb 12, 2026Updated last month
- Code for Blog Post: Can Better Cold-Start Strategies Improve RL Training for LLMs?☆20Mar 9, 2025Updated last year
- Automated Identification of Redundant Layer Blocks for Pruning in Large Language Models☆263Apr 23, 2024Updated last year
- ☆32Jan 1, 2024Updated 2 years ago
- Zeus LLM Trainer is a rewrite of Stanford Alpaca aiming to be the trainer for all Large Language Models☆70Aug 27, 2023Updated 2 years ago
- Luber : A ridesharing App☆14Dec 13, 2017Updated 8 years ago
- A list of graph datasets for machine learning projects including Graph Neural Networks.☆29Aug 5, 2022Updated 3 years ago
- Interact with ChatGPT and GPT-4 in alternative ways☆13Mar 17, 2024Updated 2 years ago
- An automated data pipeline scaling RL to pretraining levels☆74Oct 11, 2025Updated 5 months ago
- The unofficial CLI of Amazon S3 Vectors (Preview) in Rust☆15Jul 19, 2025Updated 8 months ago
- ☆19Jul 25, 2025Updated 8 months ago
- ☆67Mar 4, 2024Updated 2 years ago
- Implementation of SelfExtend from the paper "LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning" from Pytorch and Zeta☆13Nov 11, 2024Updated last year
- Efficient Infinite Context Transformers with Infini-attention Pytorch Implementation + QwenMoE Implementation + Training Script + 1M cont…☆88May 9, 2024Updated last year
- An Ash extension to implement archival (soft deletion) for resources.☆26Mar 1, 2026Updated 3 weeks ago
- Genetics for Language Models☆17Jul 1, 2024Updated last year
- Feel the Vibes☆13Feb 26, 2025Updated last year
- Source code for Activated LoRA☆24Nov 22, 2025Updated 4 months ago
- ☆12Apr 17, 2024Updated last year
- Fine-tune copilot based on your codebase☆12Mar 26, 2024Updated last year
- Implementation of the LDP module block in PyTorch and Zeta from the paper: "MobileVLM: A Fast, Strong and Open Vision Language Assistant …☆15Mar 11, 2024Updated 2 years ago
- implementation of https://arxiv.org/pdf/2312.09299☆21Jul 3, 2024Updated last year
- A simple web server written in Lua☆16Sep 24, 2022Updated 3 years ago
- a small demo repo to show how I got neuralbeagle14-7b running locally on my 8GB GPU☆14Jan 29, 2024Updated 2 years ago
- Simplify Google Gemini 1.5 Pro's authentication☆14Apr 11, 2024Updated last year
- ☆15Oct 31, 2023Updated 2 years ago
- From Llama to Deepseek, grpo/mtp implemented. With pt/sft/lora/qlora included☆30Apr 21, 2025Updated 11 months ago
- Simple Graph Memory for AI applications☆91Feb 23, 2026Updated last month