Source code for the paper "Positional Attention: Expressivity and Learnability of Algorithmic Computation"
☆14May 26, 2025Updated 11 months ago
Alternatives and similar repositories for positional_attention
Users that are interested in positional_attention are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Combining SOAP and MUON☆22Feb 11, 2025Updated last year
- Experiments Notebook of "Understanding the Skill Gap in Recurrent Language Models: The Role of the Gather-and-Aggregate Mechanism"☆15Apr 30, 2025Updated last year
- [ACL 2025] Official implementation of the "CoT-ICL Lab" framework☆11May 1, 2026Updated 3 weeks ago
- Official repository for the paper "Automating Continual Learning"☆19Jun 11, 2025Updated 11 months ago
- slowly building a set of infinite riddle generators for data-hungry methods☆14Nov 15, 2022Updated 3 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- The simplest, fastest repository for training/finetuning medium-sized xLSTMs.☆41May 24, 2024Updated 2 years ago
- Language Segment-Anything (with updated dependencies)☆36Mar 4, 2024Updated 2 years ago
- Implementation of Neurips 2023 Paper "Multi Time Scale World Models"☆17Nov 8, 2024Updated last year
- logWMSE, an audio quality metric with support for digital silence target. Useful for evaluating audio source separation systems, even whe…☆39Jun 24, 2025Updated 11 months ago
- manipulating cointegrated pairs to achieve a market-neutral strategy that outperforms indices☆10Jan 12, 2021Updated 5 years ago
- Reversible programming in Agda☆13Jun 22, 2023Updated 2 years ago
- [NeurIPS 2024] AlphaPruning: Using Heavy-Tailed Self Regularization Theory for Improved Layer-wise Pruning of Large Language Models☆34Jun 9, 2025Updated 11 months ago
- Tidy autoregressive inference in JAX☆15Sep 1, 2025Updated 8 months ago
- Official source code for Time is Not Enough: Time-Frequency based Explanation for Time-Series Black-Box Models☆13Dec 5, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A Lightweight Deep Learning Library in MATLAB☆11Jun 28, 2019Updated 6 years ago
- coloring terminal text with intensities (used for plotting probability, entropy with tokens)☆12Oct 11, 2024Updated last year
- [ICML 2025]"Graph World Model", Tao Feng, Yexin Wu, Guanyu Lin, Jiaxuan You☆37Sep 20, 2025Updated 8 months ago
- A few models converted from caffe to CoreMLs format.☆15Jun 6, 2017Updated 8 years ago
- This is the official code for the paper "Safety Tax: Safety Alignment Makes Your Large Reasoning Models Less Reasonable".☆33Mar 11, 2025Updated last year
- Scratchpad/Chain-of-Thought Prompts☆12Jun 6, 2022Updated 3 years ago
- A script to transcribe audio files with Google Cloud Speech API.☆10Oct 31, 2017Updated 8 years ago
- orbital MCMC☆10Jun 17, 2021Updated 4 years ago
- Optimal distance lower bound k-mer sampling.☆12Jun 19, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Code to reproduce key results accompanying "SAEs (usually) Transfer Between Base and Chat Models"☆13Jul 18, 2024Updated last year
- Sequence-based prediction of peptide-TCR interactions using paired chain data☆13Feb 2, 2026Updated 3 months ago
- data processing code for MIMIC-IV 2.2☆14Jan 26, 2024Updated 2 years ago
- Analysis of snRNA-seq2 data coming from 3 months old mouse liver, dissecting the influence ploidy has on gene expression.☆14Dec 6, 2021Updated 4 years ago
- Python package for pairwise ranking☆15Oct 17, 2024Updated last year
- Official implementation of "Traveling Waves Encode the Recent Past and Enhance Sequence Learning" (ICLR 2024)☆13Mar 15, 2024Updated 2 years ago
- ☆15Mar 2, 2025Updated last year
- Makes vim behave like `tail -f`☆10Oct 7, 2016Updated 9 years ago
- [NeurIPS 2023 Spotlight] Temperature Balancing, Layer-wise Weight Analysis, and Neural Network Training☆36Apr 7, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Data for paper "Random Access in Large-Scale DNA Data Storage"☆12Jan 11, 2018Updated 8 years ago
- [ICML'21 Oral] Improving Lossless Compression Rates via Monte Carlo Bits-Back Coding☆14Jun 10, 2021Updated 4 years ago
- [ICML 2025] Predictive Data Selection: The Data That Predicts Is the Data That Teaches☆64Mar 4, 2025Updated last year
- An approximate implementation of the OpenAI paper - An Empirical Model of Large-Batch Training for MNIST☆11Nov 19, 2022Updated 3 years ago
- GoldFinch and other hybrid transformer components☆46Jul 20, 2024Updated last year
- Javascript library for visualizing dynamic neural networks across time.☆13Dec 9, 2019Updated 6 years ago
- One-shot Global Localization through Semantic Distribution Feature Retrieval and Semantic Topological Histogram Registration☆19Feb 14, 2025Updated last year