TomFrederik/grokking

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/TomFrederik/grokking)

TomFrederik / grokking

Re-implementation of 'Grokking: Generalization beyond overfitting on small algorithmic datasets'

☆38

Alternatives and similar repositories for grokking

Users that are interested in grokking are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

brennanaba / AbData
View on GitHub
☆10Sep 13, 2021Updated 4 years ago
aogara-ds / hoodwinked-website
View on GitHub
A text-based game where language models learn to lie and to detect lies.
☆12Oct 4, 2023Updated 2 years ago
CompVis / visual-search
View on GitHub
Visual search interface
☆11Nov 30, 2021Updated 4 years ago
lucidrains / CLAP
View on GitHub
Contrastive Language-Audio Pretraining
☆15May 18, 2021Updated 5 years ago
DiffEqML / tutorials
View on GitHub
☆11Apr 14, 2022Updated 4 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
Jack000 / DALLE-pytorch
View on GitHub
Implementation / replication of DALL-E, OpenAI's Text to Image Transformer, in Pytorch
☆89Dec 3, 2021Updated 4 years ago
TomFrederik / unseal
View on GitHub
Mechanistic Interpretability for Transformer Models
☆53Jun 1, 2022Updated 4 years ago
crowsonkb / esgd
View on GitHub
ESGD-M is a stochastic non-convex second order optimizer, suitable for training deep learning models, for PyTorch.
☆57Sep 18, 2022Updated 3 years ago
sayakpaul / MLPMixer-jax2tf
View on GitHub
This repository hosts code for converting the original MLP Mixer models (JAX) to TensorFlow.
☆15Sep 29, 2021Updated 4 years ago
eyaler / clip_biggan
View on GitHub
☆13Sep 17, 2021Updated 4 years ago
ermongroup / fast_feedforward_computation
View on GitHub
Official code for "Accelerating Feedforward Computation via Parallel Nonlinear Equation Solving", ICML 2021
☆30Sep 25, 2021Updated 4 years ago
afiaka87 / latent-diffusion-deepspeed
View on GitHub
Finetune the 1.4B latent diffusion text2img-large checkpoint from CompVis using deepspeed. (work-in-progress)
☆36Apr 17, 2022Updated 4 years ago
robvanvolt / DALLE-tools
View on GitHub
DALLE-tools provided useful dataset utilities to improve you workflow with WebDatasets.
☆14Mar 9, 2022Updated 4 years ago
ayoublasri / Biomodelling.jl
View on GitHub
Framework for stochastic modelling in systems biology
☆12Aug 11, 2022Updated 3 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
dzryk / cliptalk
View on GitHub
☆19Aug 19, 2021Updated 4 years ago
ruiqigao / grid-cell-path
View on GitHub
Official code for On Path Integration of Grid Cells: Group Representation and Isotropic Scaling (NeurIPS 2021)
☆54Nov 10, 2021Updated 4 years ago
pbaylies / Augmented_CLIP
View on GitHub
Training simple models to predict CLIP image embeddings from text embeddings, and vice versa.
☆60Mar 31, 2022Updated 4 years ago
alexandonian / contrastive-feature-loss
View on GitHub
PyTorch implementation of Contrastive Feature Loss for Image Prediction (AIM Workshop at ICCV 2021)
☆55Nov 19, 2021Updated 4 years ago
Zasder3 / CLIP-Style-Transfer
View on GitHub
Doing style transfer with linguistic features using OpenAI's CLIP.
☆14May 4, 2021Updated 5 years ago
ahennequ / cuda-tensorcores-register-mapping
View on GitHub
☆19Oct 3, 2022Updated 3 years ago
zzd1992 / Image-Local-Attention
View on GitHub
A better PyTorch implementation of image local attention which reduces the GPU memory by an order of magnitude.
☆141Dec 21, 2021Updated 4 years ago
facebookresearch / dmae_st
View on GitHub
Directed masked autoencoders
☆14Mar 25, 2026Updated 4 months ago
TAU-MLwell / Set-Tree
View on GitHub
Official repository for the paper: "Trees with Attention for Set Prediction Tasks" (ICML21)
☆10Jan 19, 2022Updated 4 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
unixpickle / vae-textures
View on GitHub
Texture mapping with variational auto-encoders
☆40Oct 1, 2021Updated 4 years ago
allfed / allfed-integrated-model
View on GitHub
Integrated model to calculate the effects of resilient foods in catastrophic events
☆11Updated this week
philipperemy / keras-snail-attention
View on GitHub
SNAIL Attention Block for Keras.
☆17Mar 30, 2020Updated 6 years ago
dribnet / clipit_old
View on GitHub
VQGAN+CLIP with some additional tuning. For notebooks and the command line.
☆50Aug 20, 2021Updated 4 years ago
crowsonkb / cloob-training
View on GitHub
CLOOB training (JAX) and inference (JAX and PyTorch)
☆76May 16, 2022Updated 4 years ago
gauravdhama / eigengame_deepmind
View on GitHub
A basic implementation of the paper Eigengame : PCA as a Nash Equilibrium
☆21Jun 7, 2021Updated 5 years ago
SamuelSchmidgall / EvolutionarySelfReplication
View on GitHub
Produce intelligence by means of natural selection without objective/reward optimization
☆16Sep 29, 2021Updated 4 years ago
antofuller / configaformers
View on GitHub
A python library for highly configurable transformers - easing model architecture search and experimentation.
☆48Nov 30, 2021Updated 4 years ago
afqueiruga / StatefulOdeNets
View on GitHub
Refining continuous-in-depth neural networks
☆41Nov 14, 2021Updated 4 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
Hoversquid / MLAnimator
View on GitHub
Repo for storing the files I use to make animations with big-sleep, deep-daze, and VQGAN + CLIP.
☆16Sep 14, 2021Updated 4 years ago
iechevarria / lego-face-VAE
View on GitHub
Variational autoencoder for Lego minifig faces
☆16May 22, 2023Updated 3 years ago
cfoster0 / CLAP
View on GitHub
Contrastive Language-Audio Pretraining
☆88Mar 6, 2022Updated 4 years ago
duskvirkus / alias-free-gan
View on GitHub
Unofficial Alias-Free GAN implementation. Based on rosinality's version with expanded training and inference options.
☆76Aug 3, 2023Updated 2 years ago
pbaylies / clustering-laion400m
View on GitHub
Script and models for clustering LAION-400m CLIP embeddings.
☆26Jan 10, 2022Updated 4 years ago
MadryLab / EditingClassifiers
View on GitHub
☆96Oct 27, 2022Updated 3 years ago
georgepar / gmmhmm-pytorch
View on GitHub
Pytorch implementations of GMM - HMM
☆10Dec 28, 2020Updated 5 years ago