ml-gde / jflux
JAX Implementation of Black Forest Labs' Flux.1 family of models
☆14Updated last month
Related projects ⓘ
Alternatives and complementary repositories for jflux
- ☆22Updated last year
- A miniture AI training framework for PyTorch☆34Updated last year
- Collection of autoregressive model implementation☆67Updated this week
- Utilities for PyTorch distributed☆23Updated last year
- ☆21Updated last week
- ☆73Updated 4 months ago
- Genalog is an open source, cross-platform python package allowing generation of synthetic document images with custom degradations and te…☆42Updated 10 months ago
- ☆24Updated last year
- ☆49Updated 8 months ago
- ☆62Updated last month
- Large scale 4D parallelism pre-training for 🤗 transformers in Mixture of Experts *(still work in progress)*☆80Updated 11 months ago
- Pixel Parsing. A reproduction of OCR-free end-to-end document understanding models with open data☆18Updated 3 months ago
- Minimal (400 LOC) implementation Maximum (multi-node, FSDP) GPT training☆113Updated 7 months ago
- This is a port of Mistral-7B model in JAX☆30Updated 4 months ago
- Explorations into the proposal from the paper "Grokfast, Accelerated Grokking by Amplifying Slow Gradients"☆85Updated 2 months ago
- Fast, Modern, Memory Efficient, and Low Precision PyTorch Optimizers☆58Updated 4 months ago
- Notebook and Scripts that showcase running quantized diffusion models on consumer GPUs☆36Updated 3 weeks ago
- An introduction to LLM Sampling☆64Updated last week
- ML/DL Math and Method notes☆57Updated 11 months ago
- Transformer with Mu-Parameterization, implemented in Jax/Flax. Supports FSDP on TPU pods.☆29Updated 2 weeks ago
- Scripts to prep PC for development use after OS installs☆37Updated last week
- A place to store reusable transformer components of my own creation or found on the interwebs☆44Updated 2 weeks ago
- Train vision models using JAX and 🤗 transformers☆95Updated 3 weeks ago
- Code for the examples presented in the talk "Training a Llama in your backyard: fine-tuning very large models on consumer hardware" given…☆14Updated last year
- HomebrewNLP in JAX flavour for maintable TPU-Training☆46Updated 10 months ago
- Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.☆81Updated last year
- A set of Python scripts that makes your experience on TPU better☆40Updated 4 months ago
- QLoRA for Masked Language Modeling☆20Updated last year
- ☆39Updated 10 months ago
- ☆20Updated last year