ml-jku / hopfield-boostingLinks
☆33Updated last year
Alternatives and similar repositories for hopfield-boosting
Users that are interested in hopfield-boosting are comparing it to the libraries listed below
Sorting:
- ☆81Updated last year
- Code for "Accelerating Training with Neuron Interaction and Nowcasting Networks" [ICLR 2025]☆23Updated 3 weeks ago
- Code and pretrained models for the paper: "MatMamba: A Matryoshka State Space Model"☆61Updated 11 months ago
- ☆69Updated last year
- Explorations into the proposal from the paper "Grokfast, Accelerated Grokking by Amplifying Slow Gradients"☆102Updated 10 months ago
- The official repository for HyperZ⋅Z⋅W Operator Connects Slow-Fast Networks for Full Context Interaction.☆40Updated 6 months ago
- Official implementation of MetaTree: Learning a Decision Tree Algorithm with Transformers☆114Updated last year
- ICLR 2025 - official implementation for "I-Con: A Unifying Framework for Representation Learning"☆117Updated 4 months ago
- A Python Library for Learning Non-Euclidean Representations☆67Updated 2 months ago
- Source code for the paper "Positional Attention: Expressivity and Learnability of Algorithmic Computation"☆14Updated 5 months ago
- Understanding how features learned by neural networks evolve throughout training☆39Updated last year
- $100K or 100 Days: Trade-offs when Pre-Training with Academic Resources☆147Updated 3 weeks ago
- A single repo with all scripts and utils to train / fine-tune the Mamba model with or without FIM☆59Updated last year
- Implementation of Mind Evolution, Evolving Deeper LLM Thinking, from Deepmind☆57Updated 5 months ago
- Implementation of MambaFormer in Pytorch ++ Zeta from the paper: "Can Mamba Learn How to Learn? A Comparative Study on In-Context Learnin…☆20Updated last week
- ☆58Updated last year
- Automatic identification of regions in the latent space of a model that correspond to unique concepts, namely to concepts with a semantic…☆14Updated last year
- Automatic Integration for Neural Spatio-Temporal Point Process models (AI-STPP) is a new paradigm for exact, efficient, non-parametric inf…☆25Updated last year
- LoRA-Ensemble: Efficient Uncertainty Modelling for Self-attention Networks☆51Updated last month
- Collection of tests performed during the study of the new Kolmogorov-Arnold Neural Networks (KAN)☆41Updated 8 months ago
- Implementation of Gradient Agreement Filtering, from Chaubard et al. of Stanford, but for single machine microbatches, in Pytorch☆25Updated 9 months ago
- Unofficial implementation of Conformal Language Modeling by Quach et al☆29Updated 2 years ago
- Deep Networks Grok All the Time and Here is Why☆37Updated last year
- Implementation of Spectral State Space Models☆16Updated last year
- ☆61Updated last year
- Implementation of GateLoop Transformer in Pytorch and Jax☆90Updated last year
- Implementation of 🌻 Mirasol, SOTA Multimodal Autoregressive model out of Google Deepmind, in Pytorch☆88Updated last year
- Implementation of the general framework for AMIE, from the paper "Towards Conversational Diagnostic AI", out of Google Deepmind☆68Updated last year
- ☆32Updated last year
- ☆32Updated 3 weeks ago