yk / litterLinks

☆70

Alternatives and similar repositories for litter

Users that are interested in litter are comparing it to the libraries listed below

Sorting:

xrsrke / pipegoose
Large scale 4D parallelism pre-training for 🤗 transformers in Mixture of Experts *(still work in progress)*
☆87Updated last year
ml-gde / jflux
JAX Implementation of Black Forest Labs' Flux.1 family of models
☆39Updated last month
lucidrains / llama-qrlhf
Implementation of the Llama architecture with RLHF + Q-learning
☆167Updated 8 months ago
lucidrains / TPDNE
Thispersondoesnotexist went down, so this time, while building it back up, I am going to open source all of it.
☆90Updated 2 years ago
borisdayma / clip-jax
Train vision models using JAX and 🤗 transformers
☆99Updated last month
NolanoOrg / smol-gpt
Smol but mighty language model
☆61Updated 2 years ago
sytelus / pcprep
Various handy scripts to quickly setup new Linux and Windows sandboxes, containers and WSL.
☆40Updated last week
closedai-project / closedai
Drop in replacement for OpenAI, but with Open models.
☆153Updated 2 years ago
geov-ai / geov
The GeoV model is a large langauge model designed by Georges Harik and uses Rotary Positional Embeddings with Relative distances (RoPER).…
☆121Updated 2 years ago
alvarobartt / safejax
Serialize JAX, Flax, Haiku, or Objax model params with 🤗`safetensors`
☆47Updated last year
kyleliang919 / Long-context-transformers
Exploring finetuning public checkpoints on filter 8K sequences on Pile
☆115Updated 2 years ago
lucidrains / PaLM-jax
Implementation of the specific Transformer architecture from PaLM - Scaling Language Modeling with Pathways - in Jax (Equinox framework)
☆188Updated 3 years ago
dvruette / barrel-rec-pytorch
☆53Updated last year
CERC-AAI / Robin
☆63Updated last year
xrsrke / stable-diffusion-from-scratch
Implementation of Stable Diffusion from scratch [WORK IN PROGRESS]
☆21Updated 2 years ago
patil-suraj / stable-diffusion-jax
☆89Updated 3 years ago
yk / llmvm
☆30Updated last year
crowsonkb / LDLM
Latent Diffusion Language Models
☆68Updated 2 years ago
rasbt / pytorch-memory-optim
This code repository contains the code used for my "Optimizing Memory Usage for Training LLMs and Vision Transformers in PyTorch" blog po…
☆92Updated 2 years ago
EleutherAI / magiCARP
One stop shop for all things carp
☆59Updated 3 years ago
joey00072 / ohara
Collection of autoregressive model implementation
☆86Updated 5 months ago
lucidrains / grokfast-pytorch
Explorations into the proposal from the paper "Grokfast, Accelerated Grokking by Amplifying Slow Gradients"
☆102Updated 9 months ago
idiap / sigma-gpt
σ-GPT: A New Approach to Autoregressive Models
☆68Updated last year
ConiferLabsWA / flan-ul2-alpaca
☆33Updated 2 years ago
euclaise / supertrainer2000
☆49Updated last year
leehanchung / lora-instruct
Finetune Falcon, LLaMA, MPT, and RedPajama on consumer hardware using PEFT LoRA
☆103Updated 5 months ago
cloneofsimo / min-max-gpt
Minimal (400 LOC) implementation Maximum (multi-node, FSDP) GPT training
☆132Updated last year
tensoic / Cerule
Cerule - A Tiny Mighty Vision Model
☆67Updated last year
euclaise / SlimTrainer
Full finetuning of large language models without large memory requirements
☆93Updated 3 weeks ago
Birch-san / booru-embed
[WIP] Transformer to embed Danbooru labelsets
☆13Updated last year