Rishit-dagli / Nystromformer
An implementation of the Nyströmformer, using Nystrom method to approximate standard self attention
☆57Updated 2 years ago
Alternatives and similar repositories for Nystromformer:
Users that are interested in Nystromformer are comparing it to the libraries listed below
- An Implementation of Transformer in Transformer in TensorFlow for image classification, attention inside local patches☆43Updated 3 years ago
- convert a scikit-learn decision tree into a Keras model☆39Updated last year
- Unifying Python/C++/CUDA memory: Python buffered array ↔️ `std::vector` ↔️ CUDA managed memory☆80Updated 2 weeks ago
- Testing various image matching algorithms' performance on the Pinecone vector DB☆43Updated last year
- A library for incremental loading of large PyTorch checkpoints☆56Updated 2 years ago
- Minimal deep learning library written from scratch in Python, using NumPy/CuPy.☆121Updated 2 years ago
- Designing bridge trusses with Pytorch autograd☆61Updated last year
- 🧮 Reading group about differential, integral and logical calculi.☆26Updated 10 months ago
- A playground to make it easy to try crazy things☆33Updated last week
- An experiment with "freely" wired neural networks (no layers)☆163Updated 7 months ago
- A fast Tsetlin Machine implementation employing bit-wise operators, with MNIST demo.☆67Updated 5 years ago
- Visual Transformer Mechanistic Analysis Tool☆34Updated last year
- Run compute jobs on AWS as if you were running them locally.☆125Updated 3 years ago
- Brzozowski derivative python sketch☆85Updated 11 months ago
- ☆18Updated 2 years ago
- Reference implementation of "An Algorithm for Routing Vectors in Sequences" (Heinsen, 2022) and "An Algorithm for Routing Capsules in All…☆169Updated last year
- ☆126Updated last year
- There are C language computer programs about the simulator, transformation, and test statistic of continuous Bernoulli distribution. More…☆25Updated 10 months ago
- Revealing example of self-attention, the building block of transformer AI models☆130Updated last year
- ☆39Updated 2 years ago
- This is a numpy implementation of the Skip-gram algorithm described in Mikolov et al's Word2Vec paper. It is intended for didactic purpos…☆35Updated last year
- An attempt at the implementation of GLOM, Geoffrey Hinton's paper for emergent part-whole hierarchies from data☆37Updated 3 years ago
- 1.2% test error on MNIST using only least squares and numpy calls.☆17Updated last year
- A Detailed Introduction to My Favorite Statistical Measure, Hoeffding's D☆97Updated last year
- A "build to learn" Alpha Zero implementation using Gradient Boosted Decision Trees (LightGBM)☆86Updated last week
- Command-line tool to remotely execute code in the cloud☆134Updated 3 years ago
- duralava is a neural network which can simulate a lava lamp in an infinite loop.☆90Updated last year
- A star for organising blocks and playing with transformers.☆23Updated 10 months ago
- Tensorflow implementation of Collaborative Sampling for Image inpainting☆35Updated 5 years ago
- A copy of ONNX models, datasets, and code all in one GitHub repository. Follow the README to learn more.☆104Updated last year