Trel725 / forward-forward
A simple Python implementation of forward-forward NN training by G. Hinton from NeurIPS 2022
☆21Updated last year
Related projects ⓘ
Alternatives and complementary repositories for forward-forward
- PyTorch implementation of Hinton's FF Algorithm with hard negatives sampling☆14Updated last year
- ☆50Updated 6 months ago
- Code Release for "Broken Neural Scaling Laws" (BNSL) paper☆57Updated last year
- ☆73Updated 2 years ago
- Official repository for the paper "Going Beyond Linear Transformers with Recurrent Fast Weight Programmers" (NeurIPS 2021)☆47Updated last year
- ☆20Updated 11 months ago
- Revisiting Efficient Training Algorithms For Transformer-based Language Models (NeurIPS 2023)☆79Updated last year
- Official repository of paper "RNNs Are Not Transformers (Yet): The Key Bottleneck on In-context Retrieval"☆24Updated 7 months ago
- ☆44Updated last year
- unofficial re-implementation of "Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets"☆63Updated 2 years ago
- ☆29Updated 2 months ago
- ☆45Updated 9 months ago
- Replicating and dissecting the git-re-basin project in one-click-replication Colabs☆36Updated 2 years ago
- Deep Learning & Information Bottleneck☆50Updated last year
- ☆58Updated 2 years ago
- Official code for the ICML 2024 paper "The Entropy Enigma: Success and Failure of Entropy Minimization"☆44Updated 5 months ago
- Code accompanying the paper "A contrastive rule for meta-learning"☆11Updated 3 weeks ago
- Reimplementation of Geoffrey Hinton's Forward-Forward Algorithm☆132Updated last year
- The official repository for our paper "Are Neural Nets Modular? Inspecting Functional Modularity Through Differentiable Weight Masks". We…☆46Updated last year
- Advantage Leftover Lunch Reinforcement Learning (A-LoL RL): Improving Language Models with Advantage-based Offline Policy Gradients☆26Updated 2 months ago
- This repository includes code to reproduce the tables in "Loss Landscapes are All You Need: Neural Network Generalization Can Be Explaine…☆34Updated last year
- PyTorch implementation for "Long Horizon Temperature Scaling", ICML 2023☆19Updated last year
- Official implementation of the transformer (TF) architecture suggested in a paper entitled "Looped Transformers as Programmable Computers…☆22Updated last year
- ☆35Updated 7 months ago
- ☆56Updated 2 years ago
- ☆77Updated 3 months ago
- A collection of meta-learning algorithms in Jax☆23Updated 2 years ago
- Pytorch code for "Improving Self-Supervised Learning by Characterizing Idealized Representations"☆40Updated last year
- The official repository for our paper "The Devil is in the Detail: Simple Tricks Improve Systematic Generalization of Transformers". We s…☆66Updated last year