Memory-efficient transformer. Work in progress.
☆19Sep 17, 2022Updated 3 years ago
Alternatives and similar repositories for lean_transformer
Users that are interested in lean_transformer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆14Dec 28, 2021Updated 4 years ago
- ☆15Sep 15, 2022Updated 3 years ago
- Training a model similar to OpenAI DALL-E with volunteers from all over the Internet using hivemind and dalle-pytorch (NeurIPS 2021 demo)☆27May 29, 2023Updated 2 years ago
- A peer to peer machine intelligence benchmark☆30Mar 24, 2023Updated 3 years ago
- Simple and efficient RevNet-Library for PyTorch with XLA and DeepSpeed support and parameter offload☆132Aug 6, 2022Updated 3 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Demo diagnosis tools for Covid-19 Chest Xray. This repo is implemented to our paper "Automatic detection of Covid-19 from chest X-ray and…☆13May 2, 2023Updated 2 years ago
- A case study of efficient training of large language models using commodity hardware.☆68Aug 4, 2022Updated 3 years ago
- Portable TLauncher Minecraft Launcher☆12Dec 29, 2023Updated 2 years ago
- CLI tool for participating in Cosmos Fundraiser☆12Jun 29, 2020Updated 5 years ago
- ICCV 2019 Tutorial: Global Optimization for Geometric Understanding with Provable Guarantees☆15Oct 20, 2022Updated 3 years ago
- Utilities for Neural Network training☆19Dec 9, 2020Updated 5 years ago
- ☆10Nov 28, 2017Updated 8 years ago
- ☆14Apr 12, 2017Updated 9 years ago
- Simple repository contribution statistics☆15Apr 6, 2026Updated last week
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Online Spatial Concept and Lexical Acquisition with Simultaneous Localization and Mapping☆10Sep 11, 2020Updated 5 years ago
- A Tetris(TM)-clone for the console written in C, using the ncurses-library.☆16Jan 7, 2025Updated last year
- PyTorch Tutorial at the LOD2021 conference☆21Oct 7, 2021Updated 4 years ago
- Label Studio + Pachyderm☆13Feb 24, 2021Updated 5 years ago
- Receiver operating characteristic curve (ROC) computation code in C++☆11Jul 17, 2017Updated 8 years ago
- Libp2p bindings for Python☆12Mar 21, 2026Updated 3 weeks ago
- [ICML 2024] Temporal Spiking Neural Networks with Synaptic Delay for Graph Reasoning☆11Jun 1, 2024Updated last year
- Semantic Segmentation on the LaPa dataset using Pytorch Lightning☆14Nov 4, 2021Updated 4 years ago
- Optimal Transport and Optimization related experiments.☆10Jul 22, 2018Updated 7 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Collaborative inference of latent diffusion via hivemind☆12May 29, 2023Updated 2 years ago
- Principal Feature Visualization for convolutional neural networks☆11Jan 28, 2021Updated 5 years ago
- uct tree search + supervised lerning for atari games☆12Feb 14, 2017Updated 9 years ago
- Robust estimation of local affine maps and its applications to image matching☆16Mar 24, 2023Updated 3 years ago
- An implementation of the Sequence to Sequence model using the Lasagne library (WIP)☆12Aug 11, 2016Updated 9 years ago
- Talking Graphs: Your Data Speaks Up☆17Jan 23, 2024Updated 2 years ago
- A badge for join telegram chat room or channel.☆15Jan 9, 2016Updated 10 years ago
- ☆22Jan 5, 2022Updated 4 years ago
- Notes on learning to use Lua and Torch from Java.☆13Aug 22, 2016Updated 9 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A forest of autonomous agents.☆20Jan 27, 2025Updated last year
- Blog post: how to do deterministic policy gradient with gumbel softmax and why you should do it.☆12Jun 20, 2017Updated 8 years ago
- Temporal Network Noise Contrastive Estimation (teneNCE) for Dynamic Link Prediction☆15Aug 31, 2024Updated last year
- ☆54Nov 3, 2024Updated last year
- Staged Training for Transformer Language Models☆33Mar 31, 2022Updated 4 years ago
- Exploring Few-Shot Adaptation of Language Models with Tables☆24Aug 22, 2022Updated 3 years ago
- ☆19Nov 5, 2025Updated 5 months ago