TJ-Solergibert / transformers-in-supercomputersLinks
Transformers training in a supercomputer with the π€ Stack and Slurm
β15Updated last year
Alternatives and similar repositories for transformers-in-supercomputers
Users that are interested in transformers-in-supercomputers are comparing it to the libraries listed below
Sorting:
- Lightning HPO & Training Studio Appβ19Updated 2 years ago
- Quickest way to share everything about your research within a single appβ37Updated last year
- β26Updated 2 years ago
- Cyclemoid implementation for PyTorchβ90Updated 3 years ago
- β56Updated last year
- My own attempt at a long context genomics model, leveraging recent advances in long context attention modeling (Flash Attention + other hβ¦β55Updated 2 years ago
- β32Updated 2 years ago
- Highly commented implementations of Transformers in PyTorchβ139Updated 2 years ago
- A miniture AI training framework for PyTorchβ42Updated 11 months ago
- Minimal example scripts of the Hugging Face Trainer, focused on staying under 150 linesβ196Updated last year
- β24Updated 3 years ago
- Common Python utilities and GitHub Actions in Lightning Ecosystemβ62Updated last week
- All about the fundamental blocks of TF and JAX!β275Updated 4 years ago
- ML Research paper summaries, annotated papers and implementation walkthroughsβ114Updated 3 years ago
- Lightning Bits: Engineering for Researchers repoβ132Updated 3 years ago
- Interview Questions and Answers for Machine Learning Engineer roleβ116Updated 7 months ago
- β48Updated 2 years ago
- Outlining techniques for improving the training performance of your PyTorch model without compromising its accuracyβ129Updated 2 years ago
- Fourth place solution to the "OpenVaccine: COVID-19 mRNA Vaccine Degradation Prediction" organized by Stanford University and Kaggleβ21Updated 5 years ago
- Experimental CUDA kernel framework unifying typed dimensions, NVRTC JIT specialization, and MLβguided tuning.β46Updated this week
- The spiritual successor to knockknock for PyTorch Lightning, get notified when your training endsβ77Updated last year
- β15Updated 3 years ago
- Kaggling for fast kagglers!β54Updated 2 years ago
- Presents comprehensive benchmarks of XLA-compatible pre-trained models in Keras.β37Updated 2 years ago
- Source notebook code for the course, stripped of all information. Please consider puchasing the course at https://store.walkwithfastai.coβ¦β36Updated last year
- Research repo for code that may or may not end up in fastai3β50Updated 4 years ago
- Train fastai models faster (and other useful tools)β72Updated 6 months ago
- β133Updated 2 years ago
- A tensorflow implementation of the Forward-Forward Algorithm from NeurIPS '22.β10Updated 2 years ago
- Implementation of CaiT models in TensorFlow and ImageNet-1k checkpoints. Includes code for inference and fine-tuning.β12Updated 2 years ago