official-elinas / zeus-llm-trainerView external linksLinks
Zeus LLM Trainer is a rewrite of Stanford Alpaca aiming to be the trainer for all Large Language Models
☆70Aug 27, 2023Updated 2 years ago
Alternatives and similar repositories for zeus-llm-trainer
Users that are interested in zeus-llm-trainer are comparing it to the libraries listed below
Sorting:
- Supervised instruction finetuning for LLM with HF trainer and Deepspeed☆36Jul 6, 2023Updated 2 years ago
- A framework for few-shot evaluation of autoregressive language models.☆16Aug 23, 2023Updated 2 years ago
- Token-level adaptation of LoRA matrices for downstream task generalization.☆15Apr 14, 2024Updated last year
- QLoRA: Efficient Finetuning of Quantized LLMs☆11Jun 1, 2023Updated 2 years ago
- Tools for the LLaMA language model☆12Apr 4, 2023Updated 2 years ago
- Demonstration that finetuning RoPE model on larger sequences than the pre-trained model adapts the model context limit☆63Jun 21, 2023Updated 2 years ago
- A repository of prompts and Python scripts for intelligent transformation of raw text into diverse formats.☆31May 29, 2023Updated 2 years ago
- Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.☆83Sep 10, 2023Updated 2 years ago
- Experiments with self-supervised learning☆11Mar 9, 2020Updated 5 years ago
- ReBase: Training Task Experts through Retrieval Based Distillation☆29Feb 5, 2025Updated last year
- Public reports detailing responses to sets of prompts by Large Language Models.☆32Jan 4, 2025Updated last year
- [WIP] Transformer to embed Danbooru labelsets☆13Mar 31, 2024Updated last year
- CartPole Gym OpenAI solution using QLearning with Keras☆11Jun 21, 2016Updated 9 years ago
- An unofficial implementation of SOLAR-10.7B model and the newly proposed interlocked-DUS(iDUS) implementation and experiment details.☆14Mar 20, 2024Updated last year
- ☆40Mar 25, 2023Updated 2 years ago
- Multipack distributed sampler for fast padding-free training of LLMs☆204Aug 10, 2024Updated last year
- An introduction to DSPy☆33Aug 30, 2025Updated 5 months ago
- Tokenization across languages. Useful as preprocessing for subword tokenization.☆21Feb 7, 2023Updated 3 years ago
- 模型可视化工具netron的Flask版本☆19Jul 20, 2022Updated 3 years ago
- Using multiple LLMs for ensemble Forecasting☆16Jan 17, 2024Updated 2 years ago
- [ICML 2024] SqueezeLLM: Dense-and-Sparse Quantization☆713Aug 13, 2024Updated last year
- kaggle competition: https://www.kaggle.com/c/web-traffic-time-series-forecasting☆16Sep 12, 2017Updated 8 years ago
- Developer showcase of projects built on Cartesia☆20Aug 28, 2024Updated last year
- Make triton easier☆50Jun 12, 2024Updated last year
- Collection of various text datasets to assist ML researchers in training or fine-tuning their models☆20Apr 1, 2023Updated 2 years ago
- Simple Model Similarities Analysis☆21Feb 3, 2024Updated 2 years ago
- Approximating the joint distribution of language models via MCTS☆22Nov 3, 2024Updated last year
- Just a bunch of benchmark logs for different LLMs☆119Jul 28, 2024Updated last year
- ☆114Oct 28, 2025Updated 3 months ago
- ☆415Nov 2, 2023Updated 2 years ago
- Utilities for efficient fine-tuning, inference and evaluation of code generation models☆21Oct 3, 2023Updated 2 years ago
- QLoRA for Masked Language Modeling☆22Sep 11, 2023Updated 2 years ago
- Experiments with BitNet inference on CPU☆55Apr 1, 2024Updated last year
- Unofficial implementation of AlpaGasus☆94Sep 23, 2023Updated 2 years ago
- Repository containing the SPIN experiments on the DIBT 10k ranked prompts☆23Mar 12, 2024Updated last year
- Reproducible Language Agent Research☆33Jun 25, 2025Updated 7 months ago
- Code for the paper: CodeTree: Agent-guided Tree Search for Code Generation with Large Language Models☆31Apr 1, 2025Updated 10 months ago
- Projects developed by Domino's R&D team☆77Apr 14, 2022Updated 3 years ago
- Full finetuning of large language models without large memory requirements☆94Sep 22, 2025Updated 4 months ago