Source code for the paper "Positional Attention: Expressivity and Learnability of Algorithmic Computation"
☆14May 26, 2025Updated last year
Alternatives and similar repositories for positional_attention
Users that are interested in positional_attention are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Combining SOAP and MUON☆22Feb 11, 2025Updated last year
- Experiments Notebook of "Understanding the Skill Gap in Recurrent Language Models: The Role of the Gather-and-Aggregate Mechanism"☆15Apr 30, 2025Updated last year
- ☆19Nov 10, 2024Updated last year
- Official repository for the paper "Automating Continual Learning"☆20Jun 11, 2025Updated last year
- RS-IMLE☆44Dec 7, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- The simplest, fastest repository for training/finetuning medium-sized xLSTMs.☆41May 24, 2024Updated 2 years ago
- ☆27Jul 9, 2024Updated last year
- Sequence algorithms for use in Flashlight.☆14Jan 12, 2026Updated 5 months ago
- Implementation of Neurips 2023 Paper "Multi Time Scale World Models"☆17Nov 8, 2024Updated last year
- logWMSE, an audio quality metric with support for digital silence target. Useful for evaluating audio source separation systems, even whe…☆39Jun 24, 2025Updated 11 months ago
- Reasoning-based Evaluation and Ranking of Translations.☆20Jun 2, 2026Updated last week
- manipulating cointegrated pairs to achieve a market-neutral strategy that outperforms indices☆11Jan 12, 2021Updated 5 years ago
- Reversible programming in Agda☆13Jun 22, 2023Updated 2 years ago
- Tidy autoregressive inference in JAX☆15Sep 1, 2025Updated 9 months ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Official source code for Time is Not Enough: Time-Frequency based Explanation for Time-Series Black-Box Models☆13Dec 5, 2024Updated last year
- A Lightweight Deep Learning Library in MATLAB☆11Jun 28, 2019Updated 6 years ago
- [ICML 2025]"Graph World Model", Tao Feng, Yexin Wu, Guanyu Lin, Jiaxuan You☆39Sep 20, 2025Updated 8 months ago
- A few models converted from caffe to CoreMLs format.☆15Jun 6, 2017Updated 9 years ago
- This is the official code for the paper "Safety Tax: Safety Alignment Makes Your Large Reasoning Models Less Reasonable".☆34Mar 11, 2025Updated last year
- ☆13Sep 18, 2024Updated last year
- ☆14Jun 6, 2023Updated 3 years ago
- Optimal distance lower bound k-mer sampling.☆12Jun 19, 2024Updated last year
- Implementation of NIPS2023: Unleashing the Full Potential of Product Quantization for Large-Scale Image Retrieva☆11Nov 12, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Code to reproduce key results accompanying "SAEs (usually) Transfer Between Base and Chat Models"☆13Jul 18, 2024Updated last year
- data processing code for MIMIC-IV 2.2☆15Jan 26, 2024Updated 2 years ago
- Listing of GPU based bioinformatics software & sites & publications☆12Jan 16, 2022Updated 4 years ago
- Analysis of snRNA-seq2 data coming from 3 months old mouse liver, dissecting the influence ploidy has on gene expression.☆14Dec 6, 2021Updated 4 years ago
- Official implementation of "Traveling Waves Encode the Recent Past and Enhance Sequence Learning" (ICLR 2024)☆12Mar 15, 2024Updated 2 years ago
- ☆15Mar 2, 2025Updated last year
- flow-merge is a powerful Python library that enables seamless merging of multiple transformer-based language models using the most popula…☆21Feb 12, 2025Updated last year
- [NeurIPS 2023 Spotlight] Temperature Balancing, Layer-wise Weight Analysis, and Neural Network Training☆36Apr 7, 2025Updated last year
- ☆18Nov 15, 2022Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Simple implementations of attention modules adapted for the biological data domain.☆14May 20, 2025Updated last year
- Visualizing the the loss landscape of Fully-Connected Neural Networks☆46Jun 4, 2023Updated 3 years ago
- Javascript library for visualizing dynamic neural networks across time.☆13Dec 9, 2019Updated 6 years ago
- Implementation of Danijar's latest iteration for his Dreamer line of work☆191Updated this week
- One-shot Global Localization through Semantic Distribution Feature Retrieval and Semantic Topological Histogram Registration☆19Feb 14, 2025Updated last year
- Raw data and analysis of scRNA Seq experiments for the paper Multimodal Mapping of the Immune Landscape in Human Pancreatic Cancer. This …☆15Nov 30, 2020Updated 5 years ago
- [NeurIPS 2023] Implementation of "Improving Self-supervised Molecular Representation Learning using Persistent Homology"☆15Nov 16, 2023Updated 2 years ago