Set of scripts to finetune LLMs
☆38Mar 30, 2024Updated 2 years ago
Alternatives and similar repositories for Various-Finetuning
Users that are interested in Various-Finetuning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A single repo with all scripts and utils to train / fine-tune the Mamba model with or without FIM☆61Apr 8, 2024Updated 2 years ago
- Layout Analysis Dataset with Segmonto (LADaS)☆24Jul 12, 2025Updated 9 months ago
- Repo hosting codes and materials related to speeding LLMs' inference using token merging.☆37Oct 9, 2025Updated 6 months ago
- Final training script from HuggingFace Whisper Fine tuning event - to get best results on finetuned model.☆12Dec 24, 2022Updated 3 years ago
- A Flutter plugin for integrating Liquid AI's LEAP SDK, enabling on-device deployment of small language models in Flutter applications.☆23Sep 3, 2025Updated 7 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Using multiple LLMs for ensemble Forecasting☆16Jan 17, 2024Updated 2 years ago
- ☆26Mar 13, 2024Updated 2 years ago
- Understanding the correlation between different LLM benchmarks☆29Jan 11, 2024Updated 2 years ago
- a version of baby agi using dspy and typed predictors☆16Mar 9, 2024Updated 2 years ago
- Modeling code for a BitNet b1.58 Llama-style model.☆25Apr 30, 2024Updated last year
- Toolkit for attaching, training, saving and loading of new heads for transformer models☆295Feb 12, 2026Updated 2 months ago
- Code for Blog Post: Can Better Cold-Start Strategies Improve RL Training for LLMs?☆20Mar 9, 2025Updated last year
- Automated Identification of Redundant Layer Blocks for Pruning in Large Language Models☆266Apr 23, 2024Updated last year
- Code repository corresponding to the paper "Prompt Tuned Embedding Classification for Multi-Label Industry Sector Allocation" (NAACL 2024…☆10May 31, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆32Jan 1, 2024Updated 2 years ago
- Research in compressing convolutional layers of CNN using low-rank Tucker tensor decomposition☆11Nov 1, 2023Updated 2 years ago
- Zeus LLM Trainer is a rewrite of Stanford Alpaca aiming to be the trainer for all Large Language Models☆70Aug 27, 2023Updated 2 years ago
- Interact with ChatGPT and GPT-4 in alternative ways☆13Mar 17, 2024Updated 2 years ago
- The ISC Anomaly Detection and Classification Framework implemented for Apache Flink.☆13Dec 14, 2016Updated 9 years ago
- An automated data pipeline scaling RL to pretraining levels☆75Oct 11, 2025Updated 6 months ago
- Finetune Sesame's CSM 1B model, for fun and profit☆17Mar 24, 2025Updated last year
- The unofficial CLI of Amazon S3 Vectors (Preview) in Rust☆17Jul 19, 2025Updated 8 months ago
- A lightweight library for generating synthetic instruction tuning datasets for your data without GPT.☆824Jul 15, 2025Updated 8 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- QLoRA: Efficient Finetuning of Quantized LLMs☆11Jul 22, 2023Updated 2 years ago
- ☆67Mar 4, 2024Updated 2 years ago
- ☆166Aug 8, 2025Updated 8 months ago
- Implementation of algorithm S3VDC (Simple, Scalable, and Stable Variational Deep Clustering)☆12Jul 30, 2024Updated last year
- Supervised instruction finetuning for LLM with HF trainer and Deepspeed