official-elinas/zeus-llm-trainer

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/official-elinas/zeus-llm-trainer)

official-elinas / zeus-llm-trainer

Zeus LLM Trainer is a rewrite of Stanford Alpaca aiming to be the trainer for all Large Language Models

☆69

Alternatives and similar repositories for zeus-llm-trainer

Users that are interested in zeus-llm-trainer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

teknium1 / stanford_alpaca-replit
View on GitHub
Modified Stanford-Alpaca Trainer for Training Replit's Code Model
☆47Jun 1, 2023Updated 3 years ago
vibrantlabsai / Funtuner
View on GitHub
Supervised instruction finetuning for LLM with HF trainer and Deepspeed
☆37Jul 6, 2023Updated 3 years ago
dmahan93 / lm-evaluation-harness
View on GitHub
A framework for few-shot evaluation of autoregressive language models.
☆16Aug 23, 2023Updated 2 years ago
teknium1 / RawTransform
View on GitHub
A repository of prompts and Python scripts for intelligent transformation of raw text into diverse formats.
☆34May 29, 2023Updated 3 years ago
jb-01 / LoRA-TLE
View on GitHub
Token-level adaptation of LoRA matrices for downstream task generalization.
☆15Apr 14, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
AblateIt / finetune-study
View on GitHub
Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.
☆82Sep 10, 2023Updated 2 years ago
Aemon-Algiz / qlora
View on GitHub
QLoRA: Efficient Finetuning of Quantized LLMs
☆11Jun 1, 2023Updated 3 years ago
kaiokendev / cutoff-len-is-context-len
View on GitHub
Demonstration that finetuning RoPE model on larger sequences than the pre-trained model adapts the model context limit
☆62Jun 21, 2023Updated 3 years ago
teknium1 / LLM-Logbook
View on GitHub
Public reports detailing responses to sets of prompts by Large Language Models.
☆40Jan 4, 2025Updated last year
mlabonne / tinytuner
View on GitHub
🐜🔧 A minimalistic tool to fine-tune your LLMs
☆19Aug 17, 2023Updated 2 years ago
SebastianBodza / EnsembleForecasting
View on GitHub
Using multiple LLMs for ensemble Forecasting
☆16Jan 17, 2024Updated 2 years ago
gauss5930 / iDUS
View on GitHub
An unofficial implementation of SOLAR-10.7B model and the newly proposed interlocked-DUS(iDUS) implementation and experiment details.
☆14Mar 20, 2024Updated 2 years ago
Pleias / Various-Finetuning
View on GitHub
Set of scripts to finetune LLMs
☆38Mar 30, 2024Updated 2 years ago
argilla-io / distilabel-spin-dibt
View on GitHub
Repository containing the SPIN experiments on the DIBT 10k ranked prompts
☆24Mar 12, 2024Updated 2 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
Birch-san / booru-embed
View on GitHub
[WIP] Transformer to embed Danbooru labelsets
☆13Mar 31, 2024Updated 2 years ago
Pervasive-AI-Lab / LuckyMera
View on GitHub
☆16Oct 4, 2024Updated last year
fblgit / model-similarity
View on GitHub
Simple Model Similarities Analysis
☆21Feb 3, 2024Updated 2 years ago
ModelCloud / Evalution
View on GitHub
Evalution: evolve your LLMs with better evals.
☆16Updated this week
imoneoi / multipack
View on GitHub
Multipack distributed sampler for fast padding-free training of LLMs
☆207Aug 10, 2024Updated last year
JoshVarty / SelfSupervisedLearning
View on GitHub
Experiments with self-supervised learning
☆11Mar 9, 2020Updated 6 years ago
imoneoi / bf16_fused_adam
View on GitHub
BFloat16 Fused Adam Operator for PyTorch
☆20Nov 16, 2024Updated last year
teknium1 / LLM-Benchmark-Logs
View on GitHub
Just a bunch of benchmark logs for different LLMs
☆130Jul 28, 2024Updated last year
AlpinDale / sparsegpt-for-LLaMA
View on GitHub
Code for the paper "SparseGPT: Massive Language Models Can Be Accurately Pruned in One-Shot" with LLaMA implementation.
☆71Mar 30, 2023Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
facebookresearch / NeuralMemory
View on GitHub
A Data Source for Reasoning Embodied Agents
☆20Sep 18, 2023Updated 2 years ago
cloneofsimo / fim-llama-deepspeed
View on GitHub
☆32Jan 1, 2024Updated 2 years ago
NERSC / intro-HPC-bootcamp-2023
View on GitHub
☆14Sep 7, 2023Updated 2 years ago
uukuguy / multi_loras
View on GitHub
Load multiple LoRA modules simultaneously and automatically switch the appropriate combination of LoRA modules to generate the best answe…
☆162Feb 9, 2024Updated 2 years ago
FanaHOVA / smol-scheduler
View on GitHub
🐣🕐📅 A simple utility to draft scheduling emails.
☆12Sep 13, 2023Updated 2 years ago
UpstageAI / evalverse-IFEval
View on GitHub
Submodule of evalverse forked from [google-research/instruction_following_eval](https://github.com/google-research/google-research/tree/m…
☆15May 4, 2024Updated 2 years ago
para-lost / ReBase
View on GitHub
ReBase: Training Task Experts through Retrieval Based Distillation
☆28Feb 5, 2025Updated last year
ctlllll / understanding_llm_benchmarks
View on GitHub
Understanding the correlation between different LLM benchmarks
☆30Jan 11, 2024Updated 2 years ago
SkunkworksAI / hydra-moe
View on GitHub
☆416Nov 2, 2023Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
reactorsh / ambrosia
View on GitHub
clean up your LLM datasets
☆113May 30, 2023Updated 3 years ago
jondurbin / airoboros
View on GitHub
Customizable implementation of the self-instruct paper.
☆1,051Mar 7, 2024Updated 2 years ago
gpt4life / alpagasus
View on GitHub
Unofficial implementation of AlpaGasus
☆94Sep 23, 2023Updated 2 years ago
UmerHA / triton_util
View on GitHub
Make triton easier
☆49Jun 12, 2024Updated 2 years ago
Pleias / marginalia
View on GitHub
☆67Mar 4, 2024Updated 2 years ago
Naman-ntc / FastCode
View on GitHub
Utilities for efficient fine-tuning, inference and evaluation of code generation models
☆20Oct 3, 2023Updated 2 years ago
deep-diver / LLM-Pref-Mark-UI
View on GitHub
☆37May 31, 2023Updated 3 years ago