rimads / avey-dpaView external linksLinks
Code for the paper Don't Pay Attention
☆51Sep 25, 2025Updated 4 months ago
Alternatives and similar repositories for avey-dpa
Users that are interested in avey-dpa are comparing it to the libraries listed below
Sorting:
- Parallel Associative Scan for Language Models☆18Jan 8, 2024Updated 2 years ago
- Implementation of Hyena Hierarchy in JAX☆10Apr 30, 2023Updated 2 years ago
- Combining SOAP and MUON☆19Feb 11, 2025Updated last year
- Ultra-minimal autoregressive diffusion model for image generation☆21Dec 26, 2025Updated last month
- Semantic alignment of astronomical data with natural language using multi-modal models. (Jax) Code associated with https://arxiv.org/abs/…☆17Oct 18, 2024Updated last year
- Fluid Language Model Benchmarking☆26Sep 16, 2025Updated 5 months ago
- ☆19Dec 4, 2025Updated 2 months ago
- AGaLiTe: Approximate Gated Linear Transformers for Online Reinforcement Learning (Published in TMLR)☆23Oct 15, 2024Updated last year
- Efficient PScan implementation in PyTorch☆17Jan 2, 2024Updated 2 years ago
- Teaching Pretrained Language Models to Think Deeper with Retrofitted Recurrence☆56Nov 11, 2025Updated 3 months ago
- ☆45Apr 30, 2018Updated 7 years ago
- A minimal home grid world environment to evaluate language understanding in interactive agents.☆24Sep 6, 2023Updated 2 years ago
- qwen3 experiments☆34Jul 1, 2025Updated 7 months ago
- ☆22Nov 9, 2024Updated last year
- ☆29Jul 9, 2024Updated last year
- Official Repo for Error-Free Linear Attention is a Free Lunch: Exact Solution from Continuous-Time Dynamics☆71Jan 13, 2026Updated last month
- Code for "ReSpace: Text-Driven 3D Indoor Scene Synthesis and Editing with Preference Alignment"☆60Dec 9, 2025Updated 2 months ago
- ☆35Feb 26, 2024Updated last year
- manipulating cointegrated pairs to achieve a market-neutral strategy that outperforms indices☆12Jan 12, 2021Updated 5 years ago
- The official repository for SkyLadder: Better and Faster Pretraining via Context Window Scheduling☆42Dec 29, 2025Updated last month
- Declarative SkiaSharp drawings - eg SVG or XAML☆31Sep 19, 2024Updated last year
- Plugin QGIS☆10Jan 16, 2023Updated 3 years ago
- Simple & Scalable Pretraining for Neural Architecture Research☆308Dec 6, 2025Updated 2 months ago
- ☆71Oct 23, 2025Updated 3 months ago
- Minimal but scalable implementation of large language models in JAX☆35Nov 28, 2025Updated 2 months ago
- JAX/Flax implementation of the Hyena Hierarchy☆34Apr 27, 2023Updated 2 years ago
- 100 Production-Ready Claude Code Skills - The most comprehensive collection of AI skills for sales, business automation, content creation…☆35Oct 22, 2025Updated 3 months ago
- Bugtracker of novel-ebook.com☆12Aug 11, 2021Updated 4 years ago
- Project-agnostic, composable configuration system for AI-assisted development workflows. Single source of truth for agentic tools (Claude…☆24Updated this week
- This is a frontend to the Inkscape command line feature to allow the user to perform batch conversions of SVG files.☆15Dec 10, 2013Updated 12 years ago
- Card Payments Simulation Tool For Indie Devs : Core Card Switch Engine, Fraud Engine, ATM/POS GUI Simulator , Admin Dash (Real-time MSG …☆19Jun 15, 2025Updated 8 months ago
- Horizontal Pod Autoscaling for .NET applications☆10May 23, 2019Updated 6 years ago
- A neural network layer API and library for sequence modeling, designed for easy creation of sequence models that can be executed layerwis…☆50Feb 6, 2026Updated last week
- Unofficial but Efficient Implementation of "Mamba: Linear-Time Sequence Modeling with Selective State Spaces" in JAX☆93Jan 25, 2024Updated 2 years ago
- Ring-V2 is a reasoning MoE LLM provided and open-sourced by InclusionAI.☆93Oct 23, 2025Updated 3 months ago
- ☆35Nov 22, 2024Updated last year
- A Simple Algorithm for Minimum Cuts in Near-Linear Time (SWAT '20)☆12Apr 24, 2020Updated 5 years ago
- PyTorch Implementation of Context-Aware Sequential Model for Multi-Behaviour Recommendation https://arxiv.org/abs/2312.09684☆10May 31, 2024Updated last year
- ☆24Updated this week