yukw777 / leela-zero-pytorchLinks
A simple PyTorch + PyTorch Lightning training pipeline for Leela Zero
☆54Updated 4 years ago
Alternatives and similar repositories for leela-zero-pytorch
Users that are interested in leela-zero-pytorch are comparing it to the libraries listed below
Sorting:
- A collection of code snippets for my PyTorch Lightning projects☆107Updated 4 years ago
- Implementation of Feedback Transformer in Pytorch☆108Updated 4 years ago
- Pytorch implementation of Compressive Transformers, from Deepmind☆163Updated 4 years ago
- Semantic Segmentation with Pytorch-Lightning☆63Updated 4 years ago
- ☆75Updated 3 years ago
- ☆209Updated 3 years ago
- Another attempt at a long-context / efficient transformer by me☆38Updated 3 years ago
- Implementation of a Transformer that Ponders, using the scheme from the PonderNet paper☆81Updated 4 years ago
- A PyTorch Lightning extension that accelerates and enhances foundation model experimentation with flexible fine-tuning schedules.☆67Updated last week
- Trains Transformer model variants. Data isn't shuffled between batches.☆143Updated 3 years ago
- Implementation of Mega, the Single-head Attention with Multi-headed EMA architecture that currently holds SOTA on Long Range Arena☆207Updated 2 years ago
- A case study of efficient training of large language models using commodity hardware.☆68Updated 3 years ago
- The spiritual successor to knockknock for PyTorch Lightning, get notified when your training ends☆77Updated 11 months ago
- A collection of Models, Datasets, DataModules, Callbacks, Metrics, Losses and Loggers to better integrate pytorch-lightning with transfor…☆47Updated 2 years ago
- My implementation of DeepMind's Perceiver☆63Updated 4 years ago
- GPT, but made only out of MLPs☆89Updated 4 years ago
- Simple stochastic weight averaging callback for Keras☆63Updated 4 years ago
- Learned Hyperparameter Optimizers☆60Updated 4 years ago
- Simple and efficient RevNet-Library for PyTorch with XLA and DeepSpeed support and parameter offload☆131Updated 3 years ago
- EfficientNet, MobileNetV3, MobileNetV2, MixNet, etc in JAX w/ Flax Linen and Objax☆129Updated last year
- Cyclemoid implementation for PyTorch☆90Updated 3 years ago
- An alternative to convolution in neural networks☆258Updated last year
- Fast Discounted Cumulative Sums in PyTorch☆96Updated 4 years ago
- Standalone Product Key Memory module in Pytorch - for augmenting Transformer models☆86Updated 3 weeks ago
- Configuration classes enabling type-safe PyTorch configuration for Hydra apps☆224Updated 3 years ago
- Context Manager to profile the forward and backward times of PyTorch's nn.Module☆82Updated 2 years ago
- Unofficial PyTorch implementation of Attention Free Transformer (AFT) layers by Apple Inc.☆243Updated 3 years ago
- Configuration classes enabling Hydra to configure and manage Pytorch Lightning projects.☆43Updated 4 years ago
- Collection of the latest, greatest, deep learning optimizers (for Pytorch) - CNN, NLP suitable☆217Updated 4 years ago
- A lightweight library designed to accelerate the process of training PyTorch models by providing a minimal, but extensible training loop …☆192Updated 5 months ago