Memory-efficient transformer. Work in progress.
☆19Sep 17, 2022Updated 3 years ago
Alternatives and similar repositories for lean_transformer
Users that are interested in lean_transformer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code for the paper "Secure Distributed Training at Scale" (ICML 2022)☆16Feb 4, 2025Updated last year
- ☆15Sep 15, 2022Updated 3 years ago
- "Moshpit SGD: Communication-Efficient Decentralized Training on Heterogeneous Unreliable Devices", official implementation☆30Feb 4, 2025Updated last year
- Simple and efficient RevNet-Library for PyTorch with XLA and DeepSpeed support and parameter offload☆132Aug 6, 2022Updated 3 years ago
- Code for "Free-Lunch Saliency via Attention in Atari Agents"☆16Dec 18, 2020Updated 5 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆32Sep 24, 2019Updated 6 years ago
- ☆12Feb 28, 2022Updated 4 years ago
- ☆13Aug 7, 2021Updated 4 years ago
- ☆10Nov 28, 2017Updated 8 years ago
- JavaScript Implementation of the IPLD format - Ethereum Block☆12Nov 20, 2017Updated 8 years ago
- a MythX API client wrapper☆17Sep 26, 2024Updated last year
- My dotfiles: i3wm, neovim, zsh, xkb, termite.☆10Aug 6, 2020Updated 5 years ago
- ☆13Jan 11, 2017Updated 9 years ago
- Decentralized Bitcoin Backed Loan Protocol☆12Jan 4, 2023Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- PyTorch Tutorial at the LOD2021 conference☆21Oct 7, 2021Updated 4 years ago
- Send ILP payments over Ripple using XRP and payment channels☆10Feb 13, 2019Updated 7 years ago
- ☆12Feb 17, 2021Updated 5 years ago
- Label Studio + Pachyderm☆13Feb 24, 2021Updated 5 years ago
- Receiver operating characteristic curve (ROC) computation code in C++☆11Jul 17, 2017Updated 8 years ago
- "Towards Crowdsourced Training of Large Neural Networks using Decentralized Mixture-of-Experts" (NeurIPS 2020), original PyTorch implemen…☆56Nov 5, 2020Updated 5 years ago
- Principal Feature Visualization for convolutional neural networks☆11Jan 28, 2021Updated 5 years ago
- uct tree search + supervised lerning for atari games☆12Feb 14, 2017Updated 9 years ago
- Robust estimation of local affine maps and its applications to image matching☆16Mar 24, 2023Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- [ACL 2025] Outlier-Safe Pre-Training for Robust 4-Bit Quantization of Large Language Models☆39Nov 4, 2025Updated 7 months ago
- A badge for join telegram chat room or channel.☆15Jan 9, 2016Updated 10 years ago
- Notes on learning to use Lua and Torch from Java.☆13Aug 22, 2016Updated 9 years ago
- Pytorch Lightning seed project with hydra☆18Oct 8, 2020Updated 5 years ago
- Blog post: how to do deterministic policy gradient with gumbel softmax and why you should do it.☆12Jun 20, 2017Updated 8 years ago
- Artificial intelligence emotion in a conversational UI.☆17Oct 19, 2016Updated 9 years ago
- Official code for "Distributed Deep Learning in Open Collaborations" (NeurIPS 2021)☆118Jan 13, 2022Updated 4 years ago
- Staged Training for Transformer Language Models☆33Mar 31, 2022Updated 4 years ago
- Exploring Few-Shot Adaptation of Language Models with Tables☆24Aug 22, 2022Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Voice to vector [Russian]☆15Feb 5, 2017Updated 9 years ago
- Attention based aspect extraction via pytorch☆14Jun 8, 2020Updated 6 years ago
- generative models for speech☆20Jul 4, 2016Updated 9 years ago
- A C++ implementation of Network Simplex Algorithm☆11Nov 12, 2018Updated 7 years ago
- ☆16Mar 24, 2023Updated 3 years ago
- ☆21Apr 27, 2026Updated last month
- RevBiFPN: The Fully Reversible Bidirectional Feature Pyramid Network☆15Oct 18, 2022Updated 3 years ago