Simple and efficient RevNet-Library for PyTorch with XLA and DeepSpeed support and parameter offload
☆132Aug 6, 2022Updated 3 years ago
Alternatives and similar repositories for revlib
Users that are interested in revlib are comparing it to the libraries listed below
Sorting:
- A case study of efficient training of large language models using commodity hardware.☆68Aug 4, 2022Updated 3 years ago
- HomebrewNLP in JAX flavour for maintable TPU-Training☆51Jan 20, 2024Updated 2 years ago
- Memory-efficient transformer. Work in progress.☆19Sep 17, 2022Updated 3 years ago
- Your fruity companion for transformers☆14May 25, 2022Updated 3 years ago
- Drop-in replacement for any ResNet with a significantly reduced memory footprint and better representation capabilities☆206Apr 24, 2024Updated last year
- Contrastive Language-Image Pretraining☆144Sep 6, 2022Updated 3 years ago
- Code for the paper "Secure Distributed Training at Scale" (ICML 2022)☆16Feb 4, 2025Updated last year
- A scalable Dreamer implementation in JAX☆10May 22, 2022Updated 3 years ago
- a libp2p-backed daemon wrapping the functionalities of go-libp2p for use in other languages☆11Feb 9, 2025Updated last year
- Transformers with doubly stochastic attention☆54Sep 14, 2022Updated 3 years ago
- Training a model similar to OpenAI DALL-E with volunteers from all over the Internet using hivemind and dalle-pytorch (NeurIPS 2021 demo)☆27May 29, 2023Updated 2 years ago
- ☆33Mar 1, 2023Updated 3 years ago
- Fast Discounted Cumulative Sums in PyTorch☆96Aug 28, 2021Updated 4 years ago
- ESGD-M is a stochastic non-convex second order optimizer, suitable for training deep learning models, for PyTorch.☆55Sep 18, 2022Updated 3 years ago
- Libp2p bindings for Python☆12Jan 26, 2026Updated last month
- Implicit MLE: Backpropagating Through Discrete Exponential Family Distributions☆259Oct 29, 2023Updated 2 years ago
- ☆26May 9, 2022Updated 3 years ago
- RUDOLPH: One Hyper-Tasking Transformer can be creative as DALL-E and GPT-3 and smart as CLIP☆253Feb 6, 2023Updated 3 years ago
- ☆15Feb 28, 2022Updated 4 years ago
- Neural Arithmetic Logic Units by Trask et al.☆12Apr 10, 2019Updated 6 years ago
- An open source implementation of CLIP.☆33Nov 7, 2022Updated 3 years ago
- CLASP - Contrastive Language-Aminoacid Sequence Pretraining☆142Sep 17, 2021Updated 4 years ago
- Stores the MHub models dockerfiles and scripts.☆11Jan 28, 2026Updated last month
- A supplementary code for Beyond Vector Spaces: Compact Data Representation as Differentiable Weighted Graphs.☆47Nov 2, 2019Updated 6 years ago
- To be a next-generation DL-based phenotype prediction from genome mutations.☆18May 17, 2021Updated 4 years ago
- ☆65Nov 4, 2021Updated 4 years ago
- Swarm training framework using Haiku + JAX + Ray for layer parallel transformer language models on unreliable, heterogeneous nodes☆241May 12, 2023Updated 2 years ago
- ☆32Sep 24, 2019Updated 6 years ago
- A fully invertible U-Net for memory efficiency in Pytorch.☆130Jul 9, 2022Updated 3 years ago
- Utilities for working with W&B and PyTorch Lightning in an educational context☆15Aug 4, 2021Updated 4 years ago
- ☆21Sep 24, 2020Updated 5 years ago
- ARC Community Project☆22Aug 2, 2024Updated last year
- Library for 8-bit optimizers and quantization routines.☆780Aug 18, 2022Updated 3 years ago
- Computational Neuroscience stuff☆13Aug 12, 2019Updated 6 years ago
- ☆14Dec 28, 2021Updated 4 years ago
- ☆10Oct 18, 2024Updated last year
- ☆18Oct 3, 2024Updated last year
- ☆39Oct 3, 2022Updated 3 years ago
- Implementation of accurate coresets for known problems from the field of machine learning.☆11Nov 21, 2019Updated 6 years ago