ml-jku / hopfield-boostingLinks
☆33Updated last year
Alternatives and similar repositories for hopfield-boosting
Users that are interested in hopfield-boosting are comparing it to the libraries listed below
Sorting:
- ☆81Updated last year
- ☆69Updated last year
- Code and pretrained models for the paper: "MatMamba: A Matryoshka State Space Model"☆61Updated 10 months ago
- Official implementation of MetaTree: Learning a Decision Tree Algorithm with Transformers☆114Updated last year
- Explorations into the proposal from the paper "Grokfast, Accelerated Grokking by Amplifying Slow Gradients"☆102Updated 9 months ago
- The official repository for HyperZ⋅Z⋅W Operator Connects Slow-Fast Networks for Full Context Interaction.☆39Updated 6 months ago
- Code for "Accelerating Training with Neuron Interaction and Nowcasting Networks" [to appear at ICLR 2025]☆20Updated last week
- ICLR 2025 - official implementation for "I-Con: A Unifying Framework for Representation Learning"☆111Updated 3 months ago
- Source code for the paper "Positional Attention: Expressivity and Learnability of Algorithmic Computation"☆14Updated 4 months ago
- $100K or 100 Days: Trade-offs when Pre-Training with Academic Resources☆147Updated last week
- ☆163Updated last year
- Automatic Integration for Neural Spatio-Temporal Point Process models (AI-STPP) is a new paradigm for exact, efficient, non-parametric inf…☆25Updated 11 months ago
- A Python Library for Learning Non-Euclidean Representations☆64Updated 2 months ago
- ☆65Updated 6 months ago
- Unofficial implementation of Conformal Language Modeling by Quach et al☆29Updated 2 years ago
- ☆58Updated last year
- Sparse and discrete interpretability tool for neural networks☆63Updated last year
- A single repo with all scripts and utils to train / fine-tune the Mamba model with or without FIM☆59Updated last year
- Implementation of Infini-Transformer in Pytorch☆113Updated 9 months ago
- Evaluation of neuro-symbolic engines☆39Updated last year
- Implementation of GateLoop Transformer in Pytorch and Jax☆90Updated last year
- Automatic identification of regions in the latent space of a model that correspond to unique concepts, namely to concepts with a semantic…☆14Updated last year
- Deep Networks Grok All the Time and Here is Why☆37Updated last year
- ☆14Updated last month
- ☆43Updated 11 months ago
- Yet another random morning idea to be quickly tried and architecture shared if it works; to allow the transformer to pause for any amount…☆53Updated last year
- Extending Conformal Prediction to LLMs☆67Updated last year
- gzip Predicts Data-dependent Scaling Laws☆34Updated last year
- ☆61Updated last year
- Q-Probe: A Lightweight Approach to Reward Maximization for Language Models☆41Updated last year