ml-jku / hopfield-boosting
☆31Updated 9 months ago
Alternatives and similar repositories for hopfield-boosting:
Users that are interested in hopfield-boosting are comparing it to the libraries listed below
- Code for "Accelerating Training with Neuron Interaction and Nowcasting Networks" [to appear at ICLR 2025]☆18Updated this week
- ☆67Updated 6 months ago
- Unofficial implementation of Conformal Language Modeling by Quach et al☆28Updated last year
- ☆39Updated 7 months ago
- Official implementation for "Targeted Cause Discovery with Data-Driven Learning"☆23Updated 6 months ago
- Official implementation of "BERTs are Generative In-Context Learners"☆25Updated 8 months ago
- One Initialization to Rule them All: Fine-tuning via Explained Variance Adaptation☆38Updated 4 months ago
- ☆78Updated 10 months ago
- Implementation of MambaFormer in Pytorch ++ Zeta from the paper: "Can Mamba Learn How to Learn? A Comparative Study on In-Context Learnin…☆20Updated last month
- Towards Understanding the Mixture-of-Experts Layer in Deep Learning☆22Updated last year
- Quantification of Uncertainty with Adversarial Models☆28Updated last year
- ☆52Updated 5 months ago
- An annotated implementation of the Hyena Hierarchy paper☆32Updated last year
- Implementation of Spectral State Space Models☆16Updated last year
- Collection of tests performed during the study of the new Kolmogorov-Arnold Neural Networks (KAN)☆37Updated 2 weeks ago
- ☆43Updated 4 months ago
- A single repo with all scripts and utils to train / fine-tune the Mamba model with or without FIM☆53Updated 11 months ago
- A tool to assist in the interpretation of learned features in sparse autoencoders (in particular the four SAE's trained by Joseph Bloom o…☆18Updated 5 months ago
- Code and pretrained models for the paper: "MatMamba: A Matryoshka State Space Model"☆58Updated 3 months ago
- Source code for the paper "Positional Attention: Out-of-Distribution Generalization and Expressivity for Neural Algorithmic Reasoning"☆14Updated last month
- gzip Predicts Data-dependent Scaling Laws☆34Updated 9 months ago
- Extending Conformal Prediction to LLMs☆64Updated 8 months ago
- ☆29Updated 2 months ago
- Automatic identification of regions in the latent space of a model that correspond to unique concepts, namely to concepts with a semantic…☆13Updated last year
- $100K or 100 Days: Trade-offs when Pre-Training with Academic Resources☆133Updated this week
- ☆47Updated 3 months ago
- Recycling diverse models☆44Updated 2 years ago
- Code for NOLA, an implementation of "nola: Compressing LoRA using Linear Combination of Random Basis"☆52Updated 6 months ago
- The official repository for HyperZ⋅Z⋅W Operator Connects Slow-Fast Networks for Full Context Interaction.☆33Updated last month