Rishit-dagli / Fast-Transformer
An implementation of Additive Attention
☆150Updated 3 years ago
Alternatives and similar repositories for Fast-Transformer:
Users that are interested in Fast-Transformer are comparing it to the libraries listed below
- State of the art faster Transformer with Tensorflow 2.0 ( NLP, Computer Vision, Audio ).☆85Updated 2 years ago
- ☆39Updated 2 years ago
- Accelerated NLP pipelines for fast inference on CPU. Built with Transformers and ONNX runtime.☆126Updated 4 years ago
- Minimal implementation of PAWS (https://arxiv.org/abs/2104.13963) in TensorFlow.☆45Updated 3 years ago
- Code for the Shortformer model, from the ACL 2021 paper by Ofir Press, Noah A. Smith and Mike Lewis.☆146Updated 3 years ago
- This project shows how to derive the total number of training tokens from a large text dataset from 🤗 datasets with Apache Beam and Data…☆25Updated 2 years ago
- Implementing activation functions from scratch in Tensorflow.☆36Updated 3 years ago
- Instantly improve your training performance of TensorFlow models with just 2 lines of code!☆106Updated 3 years ago
- High Performance Tensorflow Data Pipeline with State of Art Augmentations and low level optimizations.☆86Updated 3 years ago
- This repository hosts code for converting the original MLP Mixer models (JAX) to TensorFlow.☆15Updated 3 years ago
- Dense Passage Retrieval using tensorflow-keras on TPU☆15Updated 3 years ago
- 🛠️ Tools for Transformers compression using PyTorch Lightning ⚡☆82Updated 5 months ago
- Simple tooling for marking deprecated functions or classes and re-routing to the new successors' instance.☆51Updated 2 weeks ago
- Patches the torch.save function with arbitrary code that gets executed upon torch.load.☆71Updated 2 years ago
- XtremeDistil framework for distilling/compressing massive multilingual neural network models to tiny and efficient models for AI at scale☆154Updated last year
- Python Research Framework☆106Updated 2 years ago
- Tensorflow implementation of a linear attention architecture☆44Updated 4 years ago
- A collection of Models, Datasets, DataModules, Callbacks, Metrics, Losses and Loggers to better integrate pytorch-lightning with transfor…☆47Updated last year
- chaii: hindi and tamil question answering☆35Updated 3 years ago
- ☆28Updated last year
- A python library for highly configurable transformers - easing model architecture search and experimentation.☆49Updated 3 years ago
- A case study of efficient training of large language models using commodity hardware.☆69Updated 2 years ago
- An attempt at the implementation of GLOM, Geoffrey Hinton's paper for emergent part-whole hierarchies from data☆37Updated 4 years ago
- Neural Search System on Arxiv AI/ML Papers☆54Updated 3 years ago
- Unofficial PyTorch implementation of Fastformer based on paper "Fastformer: Additive Attention Can Be All You Need"."☆134Updated 3 years ago
- A GPT, made only of MLPs, in Jax☆57Updated 3 years ago
- This repository hosts code for converting the original Vision Transformer models (JAX) to TensorFlow.☆33Updated 3 years ago
- Implementation of Feedback Transformer in Pytorch☆105Updated 4 years ago
- Minimal implementation of Denoised Smoothing (https://arxiv.org/abs/2003.01908) in TensorFlow.☆20Updated 3 years ago
- Generating Training Data Made Easy☆43Updated 4 years ago