Re-implementation of 'Grokking: Generalization beyond overfitting on small algorithmic datasets'
☆38Dec 4, 2021Updated 4 years ago
Alternatives and similar repositories for grokking
Users that are interested in grokking are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The original weights of some Caffe models, ported to PyTorch.☆11Jan 18, 2022Updated 4 years ago
- ☆10Sep 13, 2021Updated 4 years ago
- Visual search interface☆11Nov 30, 2021Updated 4 years ago
- A text-based game where language models learn to lie and to detect lies.☆12Oct 4, 2023Updated 2 years ago
- Contrastive Language-Audio Pretraining☆15May 18, 2021Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Self-Similarity Priors: Neural Collages as Differentiable Fractal Representations☆30Nov 26, 2022Updated 3 years ago
- Implementation / replication of DALL-E, OpenAI's Text to Image Transformer, in Pytorch☆89Dec 3, 2021Updated 4 years ago
- ESGD-M is a stochastic non-convex second order optimizer, suitable for training deep learning models, for PyTorch.☆57Sep 18, 2022Updated 3 years ago
- Mechanistic Interpretability for Transformer Models☆53Jun 1, 2022Updated 3 years ago
- This repository hosts code for converting the original MLP Mixer models (JAX) to TensorFlow.☆15Sep 29, 2021Updated 4 years ago
- Finetune the 1.4B latent diffusion text2img-large checkpoint from CompVis using deepspeed. (work-in-progress)☆36Apr 17, 2022Updated 4 years ago
- DALLE-tools provided useful dataset utilities to improve you workflow with WebDatasets.☆14Mar 9, 2022Updated 4 years ago
- ☆19Aug 19, 2021Updated 4 years ago
- Training simple models to predict CLIP image embeddings from text embeddings, and vice versa.☆60Mar 31, 2022Updated 4 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆19Oct 3, 2022Updated 3 years ago
- Official code for On Path Integration of Grid Cells: Group Representation and Isotropic Scaling (NeurIPS 2021)☆54Nov 10, 2021Updated 4 years ago
- PyTorch implementation of Contrastive Feature Loss for Image Prediction (AIM Workshop at ICCV 2021)☆55Nov 19, 2021Updated 4 years ago
- A better PyTorch implementation of image local attention which reduces the GPU memory by an order of magnitude.☆141Dec 21, 2021Updated 4 years ago
- Doing style transfer with linguistic features using OpenAI's CLIP.☆14May 4, 2021Updated 5 years ago
- Framework for stochastic modelling in systems biology☆12Aug 11, 2022Updated 3 years ago
- Texture mapping with variational auto-encoders☆40Oct 1, 2021Updated 4 years ago
- VQGAN+CLIP with some additional tuning. For notebooks and the command line.☆50Aug 20, 2021Updated 4 years ago
- CLOOB training (JAX) and inference (JAX and PyTorch)☆74May 16, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A basic implementation of the paper Eigengame : PCA as a Nash Equilibrium☆21Jun 7, 2021Updated 4 years ago
- Produce intelligence by means of natural selection without objective/reward optimization☆15Sep 29, 2021Updated 4 years ago
- A python library for highly configurable transformers - easing model architecture search and experimentation.☆48Nov 30, 2021Updated 4 years ago
- Repository for the "Gotta Go Fast When Generating Data with Score-Based Models" paper☆105Nov 20, 2021Updated 4 years ago
- Variational autoencoder for Lego minifig faces☆16May 22, 2023Updated 2 years ago
- Contrastive Language-Audio Pretraining☆88Mar 6, 2022Updated 4 years ago
- Unofficial Alias-Free GAN implementation. Based on rosinality's version with expanded training and inference options.☆76Aug 3, 2023Updated 2 years ago
- ☆92Nov 26, 2021Updated 4 years ago
- Script and models for clustering LAION-400m CLIP embeddings.☆26Jan 10, 2022Updated 4 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆27Oct 8, 2021Updated 4 years ago
- A convolution-free, transformer-only version of the CycleGAN framework☆33Feb 12, 2022Updated 4 years ago
- ☆96Oct 27, 2022Updated 3 years ago
- ☆27Mar 13, 2021Updated 5 years ago
- An adaptive training algorithm for residual network☆17Aug 22, 2020Updated 5 years ago
- Digital paint mixing program based on the Kubelka-Munk equations. Implementation of : T. Lindemeier, J. M. Gülzow, and O. Deussen. 2018…☆14Sep 10, 2020Updated 5 years ago
- Python Research Framework☆107Nov 3, 2022Updated 3 years ago