Rishit-dagli / NystromformerLinks
An implementation of the Nyströmformer, using Nystrom method to approximate standard self attention
☆56Updated 2 years ago
Alternatives and similar repositories for Nystromformer
Users that are interested in Nystromformer are comparing it to the libraries listed below
Sorting:
- Unifying Python/C++/CUDA memory: Python buffered array ↔️ `std::vector` ↔️ CUDA managed memory☆82Updated this week
- Testing various image matching algorithms' performance on the Pinecone vector DB☆43Updated 2 years ago
- convert a scikit-learn decision tree into a Keras model☆39Updated last year
- An Implementation of Transformer in Transformer in TensorFlow for image classification, attention inside local patches☆43Updated 3 years ago
- Interactive Model Iteration with Weak Supervision and Pre-Trained Embeddings☆77Updated 3 years ago
- An experiment with "freely" wired neural networks (no layers)☆163Updated last year
- There are C language computer programs about the simulator, transformation, and test statistic of continuous Bernoulli distribution. More…☆25Updated last year
- Small deep learning library written from scratch in Python, using NumPy/CuPy.☆125Updated 2 years ago
- ☆252Updated 2 years ago
- Brzozowski derivative python sketch☆85Updated 4 months ago
- Run compute jobs on AWS as if you were running them locally.☆125Updated 3 years ago
- duralava is a neural network which can simulate a lava lamp in an infinite loop.☆90Updated 2 years ago
- A BERT that you can train on a (gaming) laptop.☆209Updated last year
- A library for incremental loading of large PyTorch checkpoints☆56Updated 2 years ago
- A Detailed Introduction to My Favorite Statistical Measure, Hoeffding's D☆98Updated last year
- Beating the `bisect` module's implementation using C-extensions.☆30Updated 2 years ago
- Grow virtual creatures in static and physics simulated environments.☆53Updated last year
- Reference implementation of "An Algorithm for Routing Vectors in Sequences" (Heinsen, 2022) and "An Algorithm for Routing Capsules in All…☆172Updated 2 years ago
- Test prompts for GPT-J-6B and the resulting AI-generated texts☆53Updated 4 years ago
- Lazy, a tool for running things in idle time☆48Updated 4 years ago
- 🦠 AD in less than 20 lines☆54Updated 4 years ago
- DiscoGrad - automatically differentiate across conditional branches in C++ programs☆204Updated 10 months ago
- A "build to learn" Alpha Zero implementation using Gradient Boosted Decision Trees (LightGBM)☆86Updated 4 months ago
- Revealing example of self-attention, the building block of transformer AI models☆131Updated 2 years ago
- ☆126Updated 2 years ago
- 1.2% test error on MNIST using only least squares and numpy calls.☆19Updated last year
- Designing bridge trusses with Pytorch autograd☆61Updated last year
- Wayeb is a Complex Event Processing and Forecasting (CEP/F) engine written in Scala.☆150Updated last year
- Experiments with applying Fourier transofrms to various plane-filling curves and patterns☆66Updated 2 years ago
- A playground to make it easy to try crazy things☆33Updated last month