Source code for the paper "Positional Attention: Expressivity and Learnability of Algorithmic Computation"
☆14May 26, 2025Updated 10 months ago
Alternatives and similar repositories for positional_attention
Users that are interested in positional_attention are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Combining SOAP and MUON☆19Feb 11, 2025Updated last year
- Experiments Notebook of "Understanding the Skill Gap in Recurrent Language Models: The Role of the Gather-and-Aggregate Mechanism"☆15Apr 30, 2025Updated 11 months ago
- [KDD 2023] code for "Test accuracy vs. generalization gap: model selection in NLP without accessing training or testing data" https://arx…☆12Oct 17, 2022Updated 3 years ago
- ☆18Nov 10, 2024Updated last year
- [ACL 2025] Official implementation of the "CoT-ICL Lab" framework☆11Oct 10, 2025Updated 6 months ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- The official repository for AdaMuon☆37Aug 27, 2025Updated 7 months ago
- Official repository for the paper "Automating Continual Learning"☆18Jun 11, 2025Updated 10 months ago
- RS-IMLE☆44Dec 7, 2024Updated last year
- slowly building a set of infinite riddle generators for data-hungry methods☆14Nov 15, 2022Updated 3 years ago
- The simplest, fastest repository for training/finetuning medium-sized xLSTMs.☆41May 24, 2024Updated last year
- ☆27Jul 9, 2024Updated last year
- Language Segment-Anything (with updated dependencies)☆34Mar 4, 2024Updated 2 years ago
- logWMSE, an audio quality metric with support for digital silence target. Useful for evaluating audio source separation systems, even whe…☆38Jun 24, 2025Updated 9 months ago
- Reasoning-based Evaluation and Ranking of Translations.☆20Jul 18, 2025Updated 8 months ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- manipulating cointegrated pairs to achieve a market-neutral strategy that outperforms indices☆11Jan 12, 2021Updated 5 years ago
- [NeurIPS 2024] AlphaPruning: Using Heavy-Tailed Self Regularization Theory for Improved Layer-wise Pruning of Large Language Models☆33Jun 9, 2025Updated 10 months ago
- [ICML 2025]"Graph World Model", Tao Feng, Yexin Wu, Guanyu Lin, Jiaxuan You☆33Sep 20, 2025Updated 6 months ago
- Official source code for Time is Not Enough: Time-Frequency based Explanation for Time-Series Black-Box Models☆12Dec 5, 2024Updated last year
- ☆11Mar 4, 2020Updated 6 years ago
- coloring terminal text with intensities (used for plotting probability, entropy with tokens)☆12Oct 11, 2024Updated last year
- This is the official code for the paper "Safety Tax: Safety Alignment Makes Your Large Reasoning Models Less Reasonable".☆31Mar 11, 2025Updated last year
- ☆12Sep 18, 2024Updated last year
- orbital MCMC☆10Jun 17, 2021Updated 4 years ago
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Implementation of NIPS2023: Unleashing the Full Potential of Product Quantization for Large-Scale Image Retrieva☆11Nov 12, 2024Updated last year
- Optimal distance lower bound k-mer sampling.☆12Jun 19, 2024Updated last year
- Code to reproduce key results accompanying "SAEs (usually) Transfer Between Base and Chat Models"☆13Jul 18, 2024Updated last year
- Open source code for ICML 2025 Paper: Eigenspectrum Analysis of Neural Networks without Aspect Ratio Bias☆43Nov 14, 2025Updated 5 months ago
- Sequence-based prediction of peptide-TCR interactions using paired chain data☆13Feb 2, 2026Updated 2 months ago
- Docker image for clojupyter.☆14Apr 12, 2020Updated 6 years ago
- data processing code for MIMIC-IV 2.2☆14Jan 26, 2024Updated 2 years ago
- Analysis of snRNA-seq2 data coming from 3 months old mouse liver, dissecting the influence ploidy has on gene expression.☆14Dec 6, 2021Updated 4 years ago
- ☆15Mar 2, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- TF 2.x implementation of SimSiam (Exploring Simple Siamese Representation Learning, CVPR 2021)☆16Jun 21, 2021Updated 4 years ago
- Implementation of Danijar's latest iteration for his Dreamer line of work☆178Updated this week
- Makes vim behave like `tail -f`☆10Oct 7, 2016Updated 9 years ago
- Simple implementations of attention modules adapted for the biological data domain.☆14May 20, 2025Updated 10 months ago
- Data for paper "Random Access in Large-Scale DNA Data Storage"☆13Jan 11, 2018Updated 8 years ago
- [ICML'21 Oral] Improving Lossless Compression Rates via Monte Carlo Bits-Back Coding☆14Jun 10, 2021Updated 4 years ago
- [ICML 2025] Predictive Data Selection: The Data That Predicts Is the Data That Teaches☆63Mar 4, 2025Updated last year