konstantinosKokos / apeLinks
🧮 Algebraic Positional Encodings.
☆13Updated 4 months ago
Alternatives and similar repositories for ape
Users that are interested in ape are comparing it to the libraries listed below
Sorting:
- Engineering the state of RNN language models (Mamba, RWKV, etc.)☆32Updated last year
- ☆32Updated 7 months ago
- Official repository for the paper "Exploring the Promise and Limits of Real-Time Recurrent Learning" (ICLR 2024)☆10Updated last year
- Minimum Description Length probing for neural network representations☆19Updated 4 months ago
- PyTorch implementation for "Long Horizon Temperature Scaling", ICML 2023☆20Updated 2 years ago
- Source code for the paper "Positional Attention: Expressivity and Learnability of Algorithmic Computation"☆14Updated last week
- ☆13Updated this week
- ☆11Updated 3 months ago
- RWKV model implementation☆38Updated last year
- Implementation of Spectral State Space Models☆16Updated last year
- Source-to-Source Debuggable Derivatives in Pure Python☆15Updated last year
- Embroid: Unsupervised Prediction Smoothing Can Improve Few-Shot Classification☆11Updated last year
- ☆31Updated last year
- The Energy Transformer block, in JAX☆56Updated last year
- Repo for solving arc problems with an Neural Cellular Automata☆14Updated last week
- ☆31Updated last year
- ☆18Updated last year
- ☆53Updated 7 months ago
- Quantification of Uncertainty with Adversarial Models☆28Updated last year
- ☆27Updated last year
- Generative cellular automaton-like learning environments for RL.☆19Updated 4 months ago
- ☆17Updated 9 months ago
- A Scalable Approximate Method for Probabilistic Neurosymbolic Inference☆15Updated 4 months ago
- Official repository for the paper "Neural Differential Equations for Learning to Program Neural Nets Through Continuous Learning Rules" (…☆22Updated 2 years ago
- Code for the paper "Stack Attention: Improving the Ability of Transformers to Model Hierarchical Patterns"☆17Updated last year
- ☆22Updated 3 years ago
- JAX implementation of "Fine-Tuning Language Models with Just Forward Passes"☆19Updated last year
- An attempt to merge ESBN with Transformers, to endow Transformers with the ability to emergently bind symbols☆16Updated 3 years ago
- Code for "Accelerating Training with Neuron Interaction and Nowcasting Networks" [to appear at ICLR 2025]☆19Updated last week
- An interactive tool for analyzing, executing, and improving dynamic programming algorithms.☆13Updated 10 months ago