yizhangzzz / transformers-lego
☆18Updated 2 years ago
Alternatives and similar repositories for transformers-lego:
Users that are interested in transformers-lego are comparing it to the libraries listed below
- The official repository for our paper "Are Neural Nets Modular? Inspecting Functional Modularity Through Differentiable Weight Masks". We…☆46Updated last year
- PyTorch implementation for "Probabilistic Circuits for Variational Inference in Discrete Graphical Models", NeurIPS 2020☆16Updated 3 years ago
- ZeroC is a neuro-symbolic method that trained with elementary visual concepts and relations, can zero-shot recognize and acquire more com…☆30Updated last year
- Code Release for "Broken Neural Scaling Laws" (BNSL) paper☆58Updated last year
- ModelDiff: A Framework for Comparing Learning Algorithms☆55Updated last year
- unofficial re-implementation of "Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets"☆71Updated 2 years ago
- ☆24Updated 2 years ago
- ☆21Updated 4 months ago
- Code for the ICLR 2020 Paper, "A Theory of Usable Information under Computational Constraints"☆24Updated 4 years ago
- Experiments and code to generate the GINC small-scale in-context learning dataset from "An Explanation for In-context Learning as Implici…☆103Updated last year
- ☆35Updated last year
- Implementation of Influence Function approximations for differently sized ML models, using PyTorch☆15Updated last year
- Official code for the paper "Compositional Generalization from First Principles" (NeurIPS 2023)☆9Updated last year
- ☆22Updated 3 years ago
- PyTorch implementation for "Long Horizon Temperature Scaling", ICML 2023☆20Updated last year
- A modern look at the relationship between sharpness and generalization [ICML 2023]☆43Updated last year
- ☆35Updated 2 years ago
- ☆26Updated last year
- ☆17Updated 2 years ago
- [ICML'21] Improved Contrastive Divergence Training of Energy Based Models☆62Updated 2 years ago
- Quantification of Uncertainty with Adversarial Models☆27Updated last year
- ☆21Updated 2 years ago
- Meta-learning inductive biases in the form of useful conserved quantities.☆37Updated 2 years ago
- ☆31Updated last year
- ☆28Updated last year
- ☆28Updated last year
- This repository includes code to reproduce the tables in "Loss Landscapes are All You Need: Neural Network Generalization Can Be Explaine…☆35Updated last year
- This repository provides open-source code for sparse continuous distributions and corresponding Fenchel-Young losses.☆16Updated last year
- ☆60Updated 3 years ago
- Code for Accelerated Linearized Laplace Approximation for Bayesian Deep Learning (ELLA, NeurIPS 22')☆16Updated 2 years ago