carloalbertobarbano / forward-forward-pytorch
PyTorch implementation of Hinton's FF Algorithm with hard negatives sampling
☆14Updated last year
Related projects: ⓘ
- Code for "Can We Scale Transformers to Predict Parameters of Diverse ImageNet Models?" [ICML 2023]☆30Updated 3 weeks ago
- ☆48Updated 3 months ago
- MambaFormer in-context learning experiments and implementation for https://arxiv.org/abs/2402.04248☆33Updated 3 months ago
- Revisiting Efficient Training Algorithms For Transformer-based Language Models (NeurIPS 2023)☆77Updated last year
- ☆42Updated 3 months ago
- A simple Python implementation of forward-forward NN training by G. Hinton from NeurIPS 2022☆20Updated last year
- ☆73Updated last year
- Differentiable Top-k Classification Learning☆71Updated last year
- ☆55Updated 2 years ago
- ☆21Updated 3 months ago
- ☆52Updated last month
- Code for papers Linear Algebra with Transformers (TMLR) and What is my Math Transformer Doing? (AI for Maths Workshop, Neurips 2022)☆61Updated last month
- Reimplementation of Geoffrey Hinton's Forward-Forward Algorithm☆117Updated 10 months ago
- Official implementation for Equivariant Architectures for Learning in Deep Weight Spaces [ICML 2023]☆81Updated last year
- Model Stock: All we need is just a few fine-tuned models☆75Updated 5 months ago
- Implementation of OpenAI's 'Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets' paper.☆28Updated 11 months ago
- Official code for the ICML 2024 paper "The Entropy Enigma: Success and Failure of Entropy Minimization"☆44Updated 3 months ago
- Implementation of Infini-Transformer in Pytorch☆100Updated last month
- CUDA implementation of autoregressive linear attention, with all the latest research findings☆43Updated last year
- ☆47Updated 3 months ago
- ☆160Updated last year
- Replicating and dissecting the git-re-basin project in one-click-replication Colabs☆36Updated 2 years ago
- ☆65Updated 9 months ago
- Implementation of Hourglass Transformer, in Pytorch, from Google and OpenAI☆74Updated 2 years ago
- Code Release for "Broken Neural Scaling Laws" (BNSL) paper☆57Updated 10 months ago
- 94% on CIFAR-10 in 3.09 seconds 💨 96% in 27 seconds☆127Updated last month
- PyTorch implementation of "From Sparse to Soft Mixtures of Experts"☆38Updated last year
- ☆29Updated last year
- Implementation of the general framework for AMIE, from the paper "Towards Conversational Diagnostic AI", out of Google Deepmind☆51Updated this week
- Recycling diverse models☆42Updated last year