facebookresearch / schedule_free
Schedule-Free Optimization in PyTorch
☆2,061Updated last month
Alternatives and similar repositories for schedule_free:
Users that are interested in schedule_free are comparing it to the libraries listed below
- A PyTorch library for implementing flow matching algorithms, featuring continuous and discrete flow matching implementations. It includes…☆1,774Updated 2 weeks ago
- PyTorch native quantization and sparsity for training and inference☆1,753Updated this week
- Puzzles for learning Triton☆1,300Updated last month
- Make PyTorch models up to 40% faster! Thunder is a source to source compiler for PyTorch. It enables using different hardware executors a…☆1,257Updated this week
- A PyTorch native library for large model training☆3,091Updated this week
- 🦁 Lion, new optimizer discovered by Google Brain using genetic algorithms that is purportedly better than Adam(w), in Pytorch☆2,080Updated last month
- 🚀 Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton☆1,669Updated this week
- Official repository for our work on micro-budget training of large-scale diffusion models.☆794Updated this week
- UNet diffusion model in pure CUDA☆596Updated 6 months ago
- Official codebase used to develop Vision Transformer, SigLIP, MLP-Mixer, LiT and more.☆2,516Updated 3 weeks ago
- Tensors, for human consumption☆1,178Updated last month
- 4M: Massively Multimodal Masked Modeling☆1,666Updated 3 months ago
- For optimization algorithm research and development.☆484Updated this week
- TensorDict is a pytorch dedicated tensor container.☆862Updated this week
- A JAX research toolkit for building, editing, and visualizing neural networks.☆1,714Updated last month
- NanoGPT (124M) in 3.4 minutes☆2,068Updated last week
- Tile primitives for speedy kernels☆1,923Updated this week
- Helpful tools and examples for working with flex-attention☆583Updated this week
- Structured state space sequence models☆2,524Updated 6 months ago
- Annotated version of the Mamba paper☆469Updated 10 months ago
- Code for BLT research paper☆1,314Updated this week
- Type annotations and runtime checking for shape and dtype of JAX/NumPy/PyTorch/etc. arrays. https://docs.kidger.site/jaxtyping/☆1,300Updated last week
- Minimalistic 4D-parallelism distributed training framework for education purpose☆644Updated this week
- A minimal PyTorch implementation of probabilistic diffusion models for 2D datasets.☆696Updated 8 months ago
- A subset of PyTorch's neural network modules, written in Python using OpenAI's Triton.☆505Updated 2 months ago
- Implementation of Rotary Embeddings, from the Roformer paper, in Pytorch☆609Updated last month
- Official Implementation of "ADOPT: Modified Adam Can Converge with Any β2 with the Optimal Rate"☆408Updated last month
- What would you do with 1000 H100s...☆948Updated last year
- FFCV: Fast Forward Computer Vision (and other ML workloads!)☆2,880Updated 7 months ago
- torchview: visualize pytorch models☆857Updated 2 months ago