Non-official implementation of "Attention as an RNN" from https://arxiv.org/pdf/2405.13956, efficient associative parallel prefix scan and recurrent version implemented.
☆28Jul 27, 2024Updated last year
Alternatives and similar repositories for Attention-as-RNN
Users that are interested in Attention-as-RNN are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Reference implementation of "Softmax Attention with Constant Cost per Token" (Heinsen, 2024)☆25Jun 6, 2024Updated 2 years ago
- An Easy-to-Use Wrapper for the Spectral Synthesis Code Synspec☆22Jun 24, 2026Updated last week
- A single-line modification to any (dualizer-based) optimizer that allows the optimizer to adapt to the scale of the gradients as they cha…☆19Jan 11, 2025Updated last year
- [CVPR 2026] Task-Aware Image Signal Processor for Advanced Visual Perception☆31Mar 28, 2026Updated 3 months ago
- ☆16Jul 24, 2023Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- [ICLR 2025] This repo is the official implementation of "The Labyrinth of Links: Navigating the Associative Maze of Multi-modal LLMs".☆13Jan 25, 2025Updated last year
- [ICML 2024] Official repository of ICML 2024 - RoboMP2: A Robotic Multimodal Perception-Planning Framework with Multimodal Large Language…☆11Apr 4, 2026Updated 3 months ago
- Unofficial PyTorch implementation of 'Fast and High-Quality Image Denoising via Malleable Convolutions'.☆12Mar 7, 2026Updated 3 months ago
- The official Languini Kitchen repository☆14May 6, 2024Updated 2 years ago
- ☆12Dec 15, 2022Updated 3 years ago
- A novel approach for transformer model introspection that enables saving, compressing, and manipulating internal thought states for advan…☆33Mar 22, 2026Updated 3 months ago
- ☆26Mar 3, 2025Updated last year
- run bytecode Python by PHP☆11Oct 25, 2017Updated 8 years ago
- Train toy models using multi-token prediction objective☆14Apr 18, 2026Updated 2 months ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Official implementation of 'A Large-Scale Exploration of mu-Transfer'☆32Jun 5, 2025Updated last year
- ☆24Sep 25, 2024Updated last year
- (ACL2025 Findings) Official code for the paper "STeCa: Step-level Trajectory Calibration for LLM Agent Learning"☆29Mar 2, 2026Updated 4 months ago
- [ICML 2024] When Linear Attention Meets Autoregressive Decoding: Towards More Effective and Efficient Linearized Large Language Models☆35Jun 12, 2024Updated 2 years ago
- [ICML 2026] The official code of FeRA: Frequency–Energy Constrained Routing for Effective Diffusion Adaptation Fine-Tuning☆29Dec 27, 2025Updated 6 months ago
- [NeurIPS 2024] Official implementation of NeurIPS 2024 paepr "Flow Priors for Linear Inverse Problems via Iterative Corrupted Trajectory …☆26Feb 24, 2025Updated last year
- ☆19Jul 21, 2019Updated 6 years ago
- ☆20Aug 6, 2024Updated last year
- A system to manage online orders across Amazon, Ebay, Walmart, Reverb, and Big Commerce stores☆14Mar 13, 2023Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- 利用海康威视的代码进行目标检测与跟踪☆21Jan 4, 2024Updated 2 years ago
- Official PyTorch implementation of CD-MOE☆12Mar 18, 2026Updated 3 months ago
- Text-to-Speech for ROS 2☆21Dec 8, 2025Updated 6 months ago
- Counterfactual Generative Modeling with Variational Causal Inference (ICLR 2025)☆21Sep 30, 2025Updated 9 months ago
- Parallel Prefix Sum (Scan) with CUDA.☆15Jul 17, 2020Updated 5 years ago
- ☆11Nov 13, 2025Updated 7 months ago
- go + tauri + vue.js 做的一款通信软件,基于websocket实现即时聊天☆11Aug 8, 2023Updated 2 years ago
- Accepted at WWW 25 Industrial Track (oral)☆17Jun 6, 2025Updated last year
- ☆36Oct 20, 2020Updated 5 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Official implementation for "Let Offline RL Flow: Training Conservative Agents in the Latent Space of Normalizing Flows", NeurIPS 2022, O…☆12Jan 31, 2023Updated 3 years ago
- ImMesh_PGO: An Immediate LiDAR Localization and Meshing Framework with Loop Closure☆34Nov 23, 2024Updated last year
- This is an official implementation for "Learning a Cross-Modality Anomaly Detector for Remote Sensing Imagery“ (TIP 2024))☆18Jul 24, 2025Updated 11 months ago
- Reference implementation of models from Nyonic Model Factory☆12May 13, 2024Updated 2 years ago
- ☆16Jun 1, 2023Updated 3 years ago
- Code for ICLR 2022 Paper (HyperDQN: A Randomized Exploration Method for Deep Reinforcement Learning)☆12Nov 28, 2023Updated 2 years ago
- ROS packages for facilitating text-to-speech and the use of Amazon Polly.☆18Feb 8, 2022Updated 4 years ago