iantbutler01 / dittyLinks

A library for simplifying fine tuning with multi gpu setups in the Huggingface ecosystem.

☆16

Alternatives and similar repositories for ditty

Users that are interested in ditty are comparing it to the libraries listed below

Sorting:

ElleLeonne / Lightning-ReLoRA
A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.
☆35Updated last year
recursal / GoldFinch-paper
GoldFinch and other hybrid transformer components
☆45Updated last year
TRI-ML / linear_open_lm
A repository for research on medium sized language models.
☆78Updated last year
kaiokendev / cutoff-len-is-context-len
Demonstration that finetuning RoPE model on larger sequences than the pre-trained model adapts the model context limit
☆63Updated 2 years ago
RobertCsordas / moe
Official repository for the paper "Approximating Two-Layer Feedforward Networks for Efficient Transformers"
☆38Updated 5 months ago
Zyphra / Zyda_processing
☆39Updated last year
tanyuqian / cappy
NeurIPS 2023 - Cappy: Outperforming and Boosting Large Multi-Task LMs with a Small Scorer
☆44Updated last year
official-elinas / zeus-llm-trainer
Zeus LLM Trainer is a rewrite of Stanford Alpaca aiming to be the trainer for all Large Language Models
☆70Updated 2 years ago
kyegomez / SelfExtend
Implementation of SelfExtend from the paper "LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning" from Pytorch and Zeta
☆13Updated last year
kyegomez / MM1
PyTorch Implementation of the paper "MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training"
☆24Updated last week
catid / lllm
Latent Large Language Models
☆19Updated last year
laramohan / wikillm
LLMs as Collaboratively Edited Knowledge Bases
☆46Updated last year
EQ-bench / eqbench3
☆36Updated 3 months ago
arcee-ai / DAM
☆55Updated last year
minosvasilias / simple_grpo
Simple GRPO scripts and configurations.
☆59Updated 9 months ago
The-Inscrutable-X / TACQ
Official Repository for Task-Circuit Quantization
☆24Updated 5 months ago
s-smits / grpo-optuna
Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna
☆58Updated last month
kyegomez / OpenStrawberry
An open source replication of the stawberry method that leverages Monte Carlo Search with PPO and or DPO
☆29Updated this week
LegallyCoder / mamba-hf
Implementation of the Mamba SSM with hf_integration.
☆56Updated last year
Zyphra / zcookbook
Training hybrid models for dummies.
☆29Updated 3 weeks ago
jquesnelle / ctranslate2-rs
Rust bindings for CTranslate2
☆14Updated 2 years ago
AtakanTekparmak / agento
Very minimal (and stateless) agent framework
☆45Updated 10 months ago
ArmelRandy / tree-of-problems
[EMNLP 2024] Tree of Problems: Improving structured problem solving with compositionality
☆19Updated 8 months ago
VITA-Group / ChainCoder
[ICML 2023] "Outline, Then Details: Syntactically Guided Coarse-To-Fine Code Generation", Wenqing Zheng, S P Sharan, Ajay Kumar Jaiswal, …
☆43Updated 2 years ago
eth-easl / fmengine
Utilities for Training Very Large Models
☆58Updated last year
xjdr-alt / muzero_sketch
☆40Updated last year
zaydzuhri / flame
Fork of Flame repo for training of some new stuff in development
☆19Updated last week
LLM360 / crystalcoder-data-prep
Data preparation code for CrystalCoder 7B LLM
☆45Updated last year
IST-DASLab / QIGen
Repository for CPU Kernel Generation for LLM Inference
☆27Updated 2 years ago
kiddyboots216 / lottery-ticket-adaptation
Lottery Ticket Adaptation
☆40Updated last year