Memory-efficient transformer. Work in progress.
☆19Sep 17, 2022Updated 3 years ago
Alternatives and similar repositories for lean_transformer
Users that are interested in lean_transformer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆15Sep 15, 2022Updated 3 years ago
- a libp2p-backed daemon wrapping the functionalities of go-libp2p for use in other languages☆11Feb 9, 2025Updated last year
- Training a model similar to OpenAI DALL-E with volunteers from all over the Internet using hivemind and dalle-pytorch (NeurIPS 2021 demo)☆27May 29, 2023Updated 2 years ago
- A peer to peer machine intelligence benchmark☆29Mar 24, 2023Updated 3 years ago
- A supplementary code for Beyond Vector Spaces: Compact Data Representation as Differentiable Weighted Graphs.☆47Nov 2, 2019Updated 6 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆15Aug 3, 2021Updated 4 years ago
- Simple and efficient RevNet-Library for PyTorch with XLA and DeepSpeed support and parameter offload☆132Aug 6, 2022Updated 3 years ago
- Code for "Free-Lunch Saliency via Attention in Atari Agents"☆16Dec 18, 2020Updated 5 years ago
- ☆32Sep 24, 2019Updated 6 years ago
- A case study of efficient training of large language models using commodity hardware.☆68Aug 4, 2022Updated 3 years ago
- ☆12Feb 28, 2022Updated 4 years ago
- ☆13Aug 7, 2021Updated 4 years ago
- CLI tool for participating in Cosmos Fundraiser☆12Jun 29, 2020Updated 5 years ago
- ICCV 2019 Tutorial: Global Optimization for Geometric Understanding with Provable Guarantees☆15Oct 20, 2022Updated 3 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Based on SciPy's normalized git stats, adapted for Deep Learning frameworks☆16Feb 15, 2017Updated 9 years ago
- ☆10Nov 28, 2017Updated 8 years ago
- ☆14Apr 12, 2017Updated 8 years ago
- a MythX API client wrapper☆17Sep 26, 2024Updated last year
- A Tetris(TM)-clone for the console written in C, using the ncurses-library.☆16Jan 7, 2025Updated last year
- APAR: LLMs Can Do Auto-Parallel Auto-Regressive Decoding☆14Jul 22, 2024Updated last year
- ☆13Jan 11, 2017Updated 9 years ago
- ☆12Feb 17, 2021Updated 5 years ago
- Label Studio + Pachyderm☆13Feb 24, 2021Updated 5 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- Receiver operating characteristic curve (ROC) computation code in C++☆11Jul 17, 2017Updated 8 years ago
- [ICML 2024] Temporal Spiking Neural Networks with Synaptic Delay for Graph Reasoning☆11Jun 1, 2024Updated last year
- Neural MMO - A Massively Multiagent Environment for Artificial Intelligence Research☆15May 30, 2024Updated last year
- Semantic Segmentation on the LaPa dataset using Pytorch Lightning☆14Nov 4, 2021Updated 4 years ago
- Libp2p bindings for Python☆12Jan 26, 2026Updated 2 months ago
- Collaborative inference of latent diffusion via hivemind☆12May 29, 2023Updated 2 years ago
- Principal Feature Visualization for convolutional neural networks☆11Jan 28, 2021Updated 5 years ago
- An implementation of the Sequence to Sequence model using the Lasagne library (WIP)☆12Aug 11, 2016Updated 9 years ago
- Monocular/stereo depth estimation with regression☆12May 16, 2019Updated 6 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆22Jan 5, 2022Updated 4 years ago
- A collection of Models, Datasets, DataModules, Callbacks, Metrics, Losses and Loggers to better integrate pytorch-lightning with transfor…☆47May 29, 2023Updated 2 years ago
- Blog post: how to do deterministic policy gradient with gumbel softmax and why you should do it.☆12Jun 20, 2017Updated 8 years ago
- Staged Training for Transformer Language Models☆33Mar 31, 2022Updated 3 years ago
- Attention based aspect extraction via pytorch☆14Jun 8, 2020Updated 5 years ago
- generative models for speech☆20Jul 4, 2016Updated 9 years ago
- A C++ implementation of Network Simplex Algorithm☆11Nov 12, 2018Updated 7 years ago