carloalbertobarbano / forward-forward-pytorch
PyTorch implementation of Hinton's FF Algorithm with hard negatives sampling
☆14Updated 2 years ago
Alternatives and similar repositories for forward-forward-pytorch:
Users that are interested in forward-forward-pytorch are comparing it to the libraries listed below
- Code for "Can We Scale Transformers to Predict Parameters of Diverse ImageNet Models?" [ICML 2023]☆31Updated 6 months ago
- ☆51Updated 9 months ago
- ☆73Updated 2 years ago
- ☆54Updated 7 months ago
- Recycling diverse models☆44Updated 2 years ago
- Code for papers Linear Algebra with Transformers (TMLR) and What is my Math Transformer Doing? (AI for Maths Workshop, Neurips 2022)☆67Updated 7 months ago
- Train ImageNet *fast* in 500 lines of code with FFCV☆140Updated 10 months ago
- Replicating and dissecting the git-re-basin project in one-click-replication Colabs☆36Updated 2 years ago
- ImageNet-12k subset of ImageNet-21k (fall11)☆21Updated last year
- Code release for REPAIR: REnormalizing Permuted Activations for Interpolation Repair☆47Updated last year
- ☆52Updated 5 months ago
- Official code for the ICML 2024 paper "The Entropy Enigma: Success and Failure of Entropy Minimization"☆49Updated 9 months ago
- Reimplementation of Geoffrey Hinton's Forward-Forward Algorithm☆144Updated last year
- Revisiting Efficient Training Algorithms For Transformer-based Language Models (NeurIPS 2023)☆79Updated last year
- Differentiable Top-k Classification Learning☆80Updated 2 years ago
- Implementation of ASAM: Adaptive Sharpness-Aware Minimization for Scale-Invariant Learning of Deep Neural Networks, ICML 2021.☆141Updated 3 years ago
- [ICML2022] Training Your Sparse Neural Network Better with Any Mask. Ajay Jaiswal, Haoyu Ma, Tianlong Chen, ying Ding, and Zhangyang Wang☆27Updated 2 years ago
- Implementation of Infini-Transformer in Pytorch☆109Updated 2 months ago
- Simple implementation of muP, based on Spectral Condition for Feature Learning. The implementation is SGD only, dont use it for Adam☆73Updated 7 months ago
- Yet another random morning idea to be quickly tried and architecture shared if it works; to allow the transformer to pause for any amount…☆53Updated last year
- ☆81Updated last year
- This repository includes code to reproduce the tables in "Loss Landscapes are All You Need: Neural Network Generalization Can Be Explaine…☆35Updated 2 years ago
- ☆21Updated 2 years ago
- CUDA implementation of autoregressive linear attention, with all the latest research findings☆44Updated last year
- MambaFormer in-context learning experiments and implementation for https://arxiv.org/abs/2402.04248☆51Updated 9 months ago
- These papers will provide unique insightful concepts that will broaden your perspective on neural networks and deep learning☆47Updated last year
- ☆60Updated 3 years ago
- Experiment of using Tangent to autodiff triton☆78Updated last year
- ☆29Updated 2 years ago
- Code for "SAM as an Optimal Relaxation of Bayes", ICLR 2023.☆25Updated last year