Set of scripts to finetune LLMs
☆38Mar 30, 2024Updated 2 years ago
Alternatives and similar repositories for Various-Finetuning
Users that are interested in Various-Finetuning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Modular task agnostic training pipeline using LFM2 from Liquid AI with unsloth.☆16Sep 13, 2025Updated 7 months ago
- A single repo with all scripts and utils to train / fine-tune the Mamba model with or without FIM☆61Apr 8, 2024Updated 2 years ago
- Layout Analysis Dataset with Segmonto (LADaS)☆25Jul 12, 2025Updated 9 months ago
- Repo hosting codes and materials related to speeding LLMs' inference using token merging.☆37Oct 9, 2025Updated 6 months ago
- 🐜🔧 A minimalistic tool to fine-tune your LLMs☆18Aug 17, 2023Updated 2 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- A Flutter plugin for integrating Liquid AI's LEAP SDK, enabling on-device deployment of small language models in Flutter applications.☆23Sep 3, 2025Updated 8 months ago
- Using multiple LLMs for ensemble Forecasting☆16Jan 17, 2024Updated 2 years ago
- A framework for few-shot evaluation of autoregressive language models.☆16Aug 23, 2023Updated 2 years ago
- ☆27Mar 13, 2024Updated 2 years ago
- Understanding the correlation between different LLM benchmarks☆29Jan 11, 2024Updated 2 years ago
- a version of baby agi using dspy and typed predictors☆16Mar 9, 2024Updated 2 years ago
- Modeling code for a BitNet b1.58 Llama-style model.☆25Apr 30, 2024Updated 2 years ago
- Simple Model Similarities Analysis☆21Feb 3, 2024Updated 2 years ago
- Toolkit for attaching, training, saving and loading of new heads for transformer models☆297Feb 12, 2026Updated 2 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Code for Blog Post: Can Better Cold-Start Strategies Improve RL Training for LLMs?☆20Mar 9, 2025Updated last year
- Automated Identification of Redundant Layer Blocks for Pruning in Large Language Models☆268Apr 23, 2024Updated 2 years ago
- ☆32Jan 1, 2024Updated 2 years ago
- Luber : A ridesharing App☆14Dec 13, 2017Updated 8 years ago
- Zeus LLM Trainer is a rewrite of Stanford Alpaca aiming to be the trainer for all Large Language Models☆70Aug 27, 2023Updated 2 years ago
- alternative way to calculating self attention☆18May 25, 2024Updated last year
- Interact with ChatGPT and GPT-4 in alternative ways☆13Mar 17, 2024Updated 2 years ago
- R package for online training of regression models using FTRL Proximal☆12Feb 7, 2017Updated 9 years ago
- An automated data pipeline scaling RL to pretraining levels☆76Oct 11, 2025Updated 6 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A lightweight library for generating synthetic instruction tuning datasets for your data without GPT.☆824Jul 15, 2025Updated 9 months ago
- QLoRA: Efficient Finetuning of Quantized LLMs☆11Jul 22, 2023Updated 2 years ago
- ☆67Mar 4, 2024Updated 2 years ago
- Supervised instruction finetuning for LLM with HF trainer and Deepspeed☆37Jul 6, 2023Updated 2 years ago
- Code for NOLA, an implementation of "nola: Compressing LoRA using Linear Combination of Random Basis"☆57Aug 25, 2024Updated last year
- BPE modification that implements removing of the intermediate tokens during tokenizer training.☆27Nov 25, 2024Updated last year
- Efficient Infinite Context Transformers with Infini-attention Pytorch Implementation + QwenMoE Implementation + Training Script + 1M cont…☆91May 9, 2024Updated last year
- Feel the Vibes☆13Feb 26, 2025Updated last year
- Opper Python SDK☆19Jan 2, 2026Updated 4 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- An Ash extension to implement archival (soft deletion) for resources.☆27Apr 2, 2026Updated last month
- An open source ATS that implements the HR Open Standards data model☆19Nov 4, 2022Updated 3 years ago
- implementation of https://arxiv.org/pdf/2312.09299☆21Jul 3, 2024Updated last year
- a simple variational auto encoder with some exploration☆12Nov 22, 2024Updated last year
- Fine-tune copilot based on your codebase☆12Mar 26, 2024Updated 2 years ago
- Implementation of the LDP module block in PyTorch and Zeta from the paper: "MobileVLM: A Fast, Strong and Open Vision Language Assistant …☆15Mar 11, 2024Updated 2 years ago
- An introduction to DSPy☆34Aug 30, 2025Updated 8 months ago