TJ-Solergibert / transformers-in-supercomputersLinks
Transformers training in a supercomputer with the π€ Stack and Slurm
β15Updated last year
Alternatives and similar repositories for transformers-in-supercomputers
Users that are interested in transformers-in-supercomputers are comparing it to the libraries listed below
Sorting:
- Quickest way to share everything about your research within a single appβ37Updated last year
- Lightning HPO & Training Studio Appβ19Updated 2 years ago
- β32Updated 2 years ago
- β26Updated 2 years ago
- A miniture AI training framework for PyTorchβ42Updated 10 months ago
- Highly commented implementations of Transformers in PyTorchβ139Updated 2 years ago
- β55Updated last year
- Introductory lecture on Pytorchβ17Updated 3 years ago
- Lightning Bits: Engineering for Researchers repoβ132Updated 3 years ago
- My own attempt at a long context genomics model, leveraging recent advances in long context attention modeling (Flash Attention + other hβ¦β55Updated 2 years ago
- Cyclemoid implementation for PyTorchβ90Updated 3 years ago
- Minimal example scripts of the Hugging Face Trainer, focused on staying under 150 linesβ196Updated last year
- Context Manager to profile the forward and backward times of PyTorch's nn.Moduleβ83Updated 2 years ago
- The spiritual successor to knockknock for PyTorch Lightning, get notified when your training endsβ77Updated last year
- Named Entity Recognition with an decoder-only (autoregressive) LLM using HuggingFaceβ46Updated 3 months ago
- A lightweight library designed to accelerate the process of training PyTorch models by providing a minimal, but extensible training loop β¦β192Updated 6 months ago
- Train fastai models faster (and other useful tools)β72Updated 6 months ago
- β48Updated last year
- Research repo for code that may or may not end up in fastai3β50Updated 4 years ago
- Fourth place solution to the "OpenVaccine: COVID-19 mRNA Vaccine Degradation Prediction" organized by Stanford University and Kaggleβ20Updated 5 years ago
- Experimental CUDA kernel framework unifying typed dimensions, NVRTC JIT specialization, and MLβguided tuning.β43Updated last week
- Common Python utilities and GitHub Actions in Lightning Ecosystemβ62Updated last week
- Includes PyTorch -> Keras model porting code for DeiT models with fine-tuning and inference notebooks.β41Updated 3 years ago
- Quickest way to share everything about your research within a single appβ16Updated last year
- Outlining techniques for improving the training performance of your PyTorch model without compromising its accuracyβ129Updated 2 years ago
- ML/DL Math and Method notesβ64Updated 2 years ago
- Implementation of CaiT models in TensorFlow and ImageNet-1k checkpoints. Includes code for inference and fine-tuning.β12Updated 2 years ago
- β134Updated 2 years ago
- β20Updated 3 years ago
- Presents comprehensive benchmarks of XLA-compatible pre-trained models in Keras.β37Updated 2 years ago