tensorops / TransformerX
Flexible Python library providing building blocks (layers) for reproducible Transformers research (Tensorflow β
, Pytorch π, and Jax π)
β53Updated last year
Alternatives and similar repositories for TransformerX:
Users that are interested in TransformerX are comparing it to the libraries listed below
- Explorations into the proposal from the paper "Grokfast, Accelerated Grokking by Amplifying Slow Gradients"β95Updated last month
- This repository contains a better implementation of Kolmogorov-Arnold networksβ61Updated 9 months ago
- Outlining techniques for improving the training performance of your PyTorch model without compromising its accuracyβ127Updated last year
- Repository containing awesome resources regarding Hugging Face tooling.β46Updated last year
- Gradient Boosting Reinforcement Learning (GBRL)β100Updated 2 weeks ago
- β130Updated last year
- Actually Robust Training - Tool Inspired by Andrej Karpathy "Recipe for training neural networks". It allows you to decompose your Deepβ¦β44Updated 10 months ago
- Building GPT ...β17Updated 2 months ago
- RAGs: Simple implementations of Retrieval Augmented Generation (RAG) Systemsβ90Updated last month
- Use Grounding DINO, Segment Anything, and CLIP to label objects in images.β27Updated last year
- Accelerate Model Training with PyTorch 2.X, published by Packtβ38Updated 8 months ago
- A collection of various LLM sampling methods implemented in pure Pytorchβ20Updated 2 months ago
- β30Updated 9 months ago
- A miniture AI training framework for PyTorchβ39Updated 3 weeks ago
- Some personal experiments around routing tokens to different autoregressive attention, akin to mixture-of-expertsβ116Updated 4 months ago
- Cyclemoid implementation for PyTorchβ87Updated 2 years ago
- Presents comprehensive benchmarks of XLA-compatible pre-trained models in Keras.β37Updated last year
- SaLSa Optimizer implementation (No learning rates needed)β28Updated 2 weeks ago
- Just some miscellaneous utility functions / decorators / modules related to Pytorch and Accelerate to help speed up implementation of newβ¦β120Updated 6 months ago
- Pytorch implementation of a simple way to enable (Stochastic) Frame Averaging for any networkβ49Updated 6 months ago
- β78Updated 10 months ago
- β27Updated 7 months ago
- β44Updated 3 months ago
- Functional local implementations of main model parallelism approachesβ95Updated 2 years ago
- Implementation of Infini-Transformer in Pytorchβ109Updated last month
- several types of attention modules written in PyTorch for learning purposesβ45Updated 4 months ago
- Implementation of the Kalman Filtering Attention proposed in "Kalman Filtering Attention for User Behavior Modeling in CTR Prediction"β57Updated last year
- Supercharge huggingface transformers with model parallelism.β76Updated 4 months ago
- A multi-backend (TensorFlow, PyTorch, JAX, and NumPy) implementation of the Segment Anything model in Keras 3.0β32Updated 10 months ago