TJ-Solergibert / transformers-in-supercomputersLinks
Transformers training in a supercomputer with the π€ Stack and Slurm
β15Updated last year
Alternatives and similar repositories for transformers-in-supercomputers
Users that are interested in transformers-in-supercomputers are comparing it to the libraries listed below
Sorting:
- Lightning HPO & Training Studio Appβ19Updated 2 years ago
- Quickest way to share everything about your research within a single appβ37Updated last year
- A miniture AI training framework for PyTorchβ42Updated last year
- Lightning Bits: Engineering for Researchers repoβ134Updated 3 years ago
- My own attempt at a long context genomics model, leveraging recent advances in long context attention modeling (Flash Attention + other hβ¦β55Updated 2 years ago
- Template repo for Python projects, especially those focusing on machine learning and/or deep learning.β15Updated 2 weeks ago
- Highly commented implementations of Transformers in PyTorchβ138Updated 2 years ago
- β32Updated 2 years ago
- β26Updated 2 years ago
- Common Python utilities and GitHub Actions in Lightning Ecosystemβ63Updated this week
- β56Updated last year
- Train fastai models faster (and other useful tools)β72Updated 7 months ago
- Kaggling for fast kagglers!β54Updated 2 years ago
- Minimal example scripts of the Hugging Face Trainer, focused on staying under 150 linesβ196Updated last year
- Research repo for code that may or may not end up in fastai3β50Updated 4 years ago
- The spiritual successor to knockknock for PyTorch Lightning, get notified when your training endsβ77Updated last year
- Cyclemoid implementation for PyTorchβ90Updated 3 years ago
- Implementation of CaiT models in TensorFlow and ImageNet-1k checkpoints. Includes code for inference and fine-tuning.β12Updated 2 years ago
- Introductory lecture on Pytorchβ17Updated 3 years ago
- β133Updated 2 years ago
- β48Updated 2 years ago
- β15Updated 3 years ago
- Presents comprehensive benchmarks of XLA-compatible pre-trained models in Keras.β37Updated 2 years ago
- β24Updated 3 years ago
- All about the fundamental blocks of TF and JAX!β277Updated 4 years ago
- β64Updated 2 years ago
- Experimental CUDA kernel framework unifying typed dimensions, NVRTC JIT specialization, and MLβguided tuning.β46Updated last week
- Context Manager to profile the forward and backward times of PyTorch's nn.Moduleβ83Updated 2 years ago
- IceData: Datasets Hub for the *IceVision* Frameworkβ49Updated 3 years ago
- ML Research paper summaries, annotated papers and implementation walkthroughsβ114Updated 3 years ago