Implementation of the Kalman Filtering Attention proposed in "Kalman Filtering Attention for User Behavior Modeling in CTR Prediction"
☆59Oct 22, 2023Updated 2 years ago
Alternatives and similar repositories for kalman-filtering-attention
Users that are interested in kalman-filtering-attention are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Yet another random morning idea to be quickly tried and architecture shared if it works; to allow the transformer to pause for any amount…☆53Oct 22, 2023Updated 2 years ago
- Explorations into the recently proposed Taylor Series Linear Attention☆100Aug 18, 2024Updated last year
- Implementation of Discrete Key / Value Bottleneck, in Pytorch☆88Jul 9, 2023Updated 2 years ago
- Implementation of a holodeck, written in Pytorch☆18Nov 1, 2023Updated 2 years ago
- Fast Neural Machine Translation in C++ - development repository☆23May 12, 2024Updated last year
- An implementation of (Induced) Set Attention Block, from the Set Transformers paper☆67Jan 10, 2023Updated 3 years ago
- Implementation of Flash Attention in Jax☆227Mar 1, 2024Updated 2 years ago
- Fine-Tuning Pre-trained Transformers into Decaying Fast Weights☆19Oct 9, 2022Updated 3 years ago
- Stochastic Arithmetic to diagnose Floating-Point problems in Julia☆18May 24, 2019Updated 6 years ago
- [ICLR 2025] UniCO: On Unified Combinatorial Optimization via Problem Reduction to Matrix-Encoded General TSP☆15Jun 20, 2025Updated 9 months ago
- Graph neural network message passing reframed as a Transformer with local attention☆70Dec 24, 2022Updated 3 years ago
- Exploring advanced prompting tools to query SQL database with multiple tables in natural language using LLMs☆16Aug 23, 2024Updated last year
- Implementation of Action Matching for the Schrödinger equation☆25Jun 18, 2023Updated 2 years ago
- Implementations of various linear RNN layers using pytorch and triton☆55Aug 4, 2023Updated 2 years ago
- My attempts at applying Soundstream design on learned tokenization of text and then applying hierarchical attention to text generation☆90Oct 11, 2024Updated last year
- This is the official PyTorch implementation for the HLGP algorithm used to solve large-scale CVRP.☆10Feb 13, 2025Updated last year
- Drug Target Interaction Prediction Using Protein Binding Sites And Drug Fragments☆12Aug 11, 2025Updated 7 months ago
- Implementation of MetNet-3, SOTA neural weather model out of Google Deepmind, in Pytorch☆237Nov 16, 2023Updated 2 years ago
- ☆61Nov 4, 2023Updated 2 years ago
- Course Website for "AI618: Generative Model and Unsupervised Learning"☆37May 23, 2023Updated 2 years ago
- Memory-efficient optimum einsum using opt_einsum planning and PyTorch kernels.☆16Apr 24, 2023Updated 2 years ago
- ☆32May 26, 2024Updated last year
- A library to perform targeted free energy perturbation with normalizing flows.☆10Sep 1, 2025Updated 6 months ago
- Crawling engine that crawls a set of top-level domains looking for documents in a list of languages☆11Feb 6, 2024Updated 2 years ago
- A minimal Pytorch Implementation of Stochastically Quantized Variational AutoEncoder (SQ-VAE) by Sony☆34Oct 16, 2023Updated 2 years ago
- RLMM is a reinforcement learning env for molecular modeling (currently only protein-ligand docking).☆11Nov 14, 2022Updated 3 years ago
- ☆15Mar 15, 2022Updated 4 years ago
- ☆45Apr 30, 2018Updated 7 years ago
- A paper list of world model☆29Apr 10, 2025Updated 11 months ago
- Experiments Notebook of "Understanding the Skill Gap in Recurrent Language Models: The Role of the Gather-and-Aggregate Mechanism"☆15Apr 30, 2025Updated 10 months ago
- ☆11Jun 5, 2024Updated last year
- Source code for Pathfinding in Stochastic Environments paper.☆15Oct 27, 2022Updated 3 years ago
- Implementation of Mega, the Single-head Attention with Multi-headed EMA architecture that currently holds SOTA on Long Range Arena☆207Aug 26, 2023Updated 2 years ago
- [NeurIPS 2022] Your Transformer May Not be as Powerful as You Expect (official implementation)☆34Aug 6, 2023Updated 2 years ago
- Engineering the state of RNN language models (Mamba, RWKV, etc.)☆32May 25, 2024Updated last year
- ☆14Jul 25, 2023Updated 2 years ago
- A machine learning library capable of training various deep neural networks (RNNs, LSTMs, DBNs, ect...) on a GPU. It makes use of auto-di…☆10Aug 28, 2018Updated 7 years ago
- Frontend for evaluating humans on chemistry questions☆11Sep 1, 2024Updated last year
- ☆32May 30, 2024Updated last year