Source code for the paper "Positional Attention: Expressivity and Learnability of Algorithmic Computation"
☆14May 26, 2025Updated 11 months ago
Alternatives and similar repositories for positional_attention
Users that are interested in positional_attention are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Combining SOAP and MUON☆20Feb 11, 2025Updated last year
- [KDD 2023] code for "Test accuracy vs. generalization gap: model selection in NLP without accessing training or testing data" https://arx…☆12Oct 17, 2022Updated 3 years ago
- ☆19Nov 10, 2024Updated last year
- Official repository for the paper "Automating Continual Learning"☆18Jun 11, 2025Updated 10 months ago
- RS-IMLE☆44Dec 7, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆27Jul 9, 2024Updated last year
- CAST-Seq Bioinformatic pipeline☆7Jan 5, 2026Updated 4 months ago
- logWMSE, an audio quality metric with support for digital silence target. Useful for evaluating audio source separation systems, even whe…☆38Jun 24, 2025Updated 10 months ago
- Reasoning-based Evaluation and Ranking of Translations.☆20Jul 18, 2025Updated 9 months ago
- manipulating cointegrated pairs to achieve a market-neutral strategy that outperforms indices☆10Jan 12, 2021Updated 5 years ago
- [NeurIPS 2021] code for "Taxonomizing local versus global structure in neural network loss landscapes" https://arxiv.org/abs/2107.11228☆20Jan 7, 2022Updated 4 years ago
- Reversible programming in Agda☆13Jun 22, 2023Updated 2 years ago
- Tidy autoregressive inference in JAX☆15Sep 1, 2025Updated 8 months ago
- [ICML 2025]"Graph World Model", Tao Feng, Yexin Wu, Guanyu Lin, Jiaxuan You☆35Sep 20, 2025Updated 7 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆11Mar 4, 2020Updated 6 years ago
- A Lightweight Deep Learning Library in MATLAB☆11Jun 28, 2019Updated 6 years ago
- This is the official code for the paper "Safety Tax: Safety Alignment Makes Your Large Reasoning Models Less Reasonable".☆33Mar 11, 2025Updated last year
- Scratchpad/Chain-of-Thought Prompts☆12Jun 6, 2022Updated 3 years ago
- A script to transcribe audio files with Google Cloud Speech API.☆10Oct 31, 2017Updated 8 years ago
- Implementation of NIPS2023: Unleashing the Full Potential of Product Quantization for Large-Scale Image Retrieva☆11Nov 12, 2024Updated last year
- Sequence-based prediction of peptide-TCR interactions using paired chain data☆13Feb 2, 2026Updated 3 months ago
- Equivariant layers for RC-complement symmetry in DNA sequence data☆12Feb 24, 2022Updated 4 years ago
- Docker image for clojupyter.☆14Apr 12, 2020Updated 6 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Python package for pairwise ranking☆15Oct 17, 2024Updated last year
- ☆15Mar 2, 2025Updated last year
- Makes vim behave like `tail -f`☆10Oct 7, 2016Updated 9 years ago
- flow-merge is a powerful Python library that enables seamless merging of multiple transformer-based language models using the most popula…☆20Feb 12, 2025Updated last year
- [NeurIPS 2023 Spotlight] Temperature Balancing, Layer-wise Weight Analysis, and Neural Network Training☆36Apr 7, 2025Updated last year
- ☆18Nov 15, 2022Updated 3 years ago
- Data for paper "Random Access in Large-Scale DNA Data Storage"☆13Jan 11, 2018Updated 8 years ago
- Simple implementations of attention modules adapted for the biological data domain.☆14May 20, 2025Updated 11 months ago
- [ICML 2025] Predictive Data Selection: The Data That Predicts Is the Data That Teaches☆64Mar 4, 2025Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- An approximate implementation of the OpenAI paper - An Empirical Model of Large-Batch Training for MNIST☆11Nov 19, 2022Updated 3 years ago
- GoldFinch and other hybrid transformer components☆46Jul 20, 2024Updated last year
- One-shot Global Localization through Semantic Distribution Feature Retrieval and Semantic Topological Histogram Registration☆19Feb 14, 2025Updated last year
- Implementation of Danijar's latest iteration for his Dreamer line of work☆185Updated this week
- [NeurIPS 2023] Implementation of "Improving Self-supervised Molecular Representation Learning using Persistent Homology"☆15Nov 16, 2023Updated 2 years ago
- Ontoclick - A web browser extension to turn highlighted text into a proper Ontology term.☆13Jun 2, 2023Updated 2 years ago
- This is a simple interface for chroma - it takes in documents, embeds them into a DB and allows you to query over them using GPT 3.5☆10Dec 7, 2024Updated last year