rahul13ramesh / compositional_capabilities
Compositional Capabilities of Autoregressive Transformers: A Study on Synthetic, Interpretable Tasks
☆10Updated 10 months ago
Alternatives and similar repositories for compositional_capabilities
Users that are interested in compositional_capabilities are comparing it to the libraries listed below
Sorting:
- Codebase for Mechanistic Mode Connectivity☆14Updated last year
- ☆41Updated 2 years ago
- This repository includes code to reproduce the tables in "Loss Landscapes are All You Need: Neural Network Generalization Can Be Explaine…☆36Updated 2 years ago
- ☆53Updated 9 months ago
- ☆17Updated 2 years ago
- Replicating and dissecting the git-re-basin project in one-click-replication Colabs☆36Updated 2 years ago
- ☆33Updated 2 years ago
- ☆37Updated 3 years ago
- SGD with large step sizes learns sparse features [ICML 2023]☆32Updated 2 years ago
- ☆32Updated 7 months ago
- Experiments for Meta-Learning Symmetries by Reparameterization☆56Updated 4 years ago
- Very deep VAEs in JAX/Flax☆46Updated 3 years ago
- ☆18Updated 2 years ago
- ☆19Updated 3 years ago
- Official repo for the paper "Weight-based Decomposition: A Case for Bilinear MLPs"☆21Updated 5 months ago
- Code Release for "Broken Neural Scaling Laws" (BNSL) paper☆58Updated last year
- code for "Semi-Discrete Normalizing Flows through Differentiable Tessellation"☆26Updated 2 years ago
- Code for Unbiased Implicit Variational Inference (UIVI)☆14Updated 6 years ago
- Jupyter Notebook corresponding to 'Going with the Flow: An Introduction to Normalizing Flows'☆26Updated 4 years ago
- PyTorch implementation of Continuously Indexed Flows paper, with many baseline normalising flows☆31Updated 3 years ago
- [NeurIPS'20] Code for the Paper Compositional Visual Generation and Inference with Energy Based Models☆44Updated 2 years ago
- ☆24Updated 6 years ago
- Blog post☆17Updated last year
- Official repository for the paper "Can You Learn an Algorithm? Generalizing from Easy to Hard Problems with Recurrent Networks"☆59Updated 3 years ago
- JAX implementation of "Fine-Tuning Language Models with Just Forward Passes"☆19Updated last year
- Code for "The Intrinsic Dimension of Images and Its Impact on Learning" - ICLR 2021 Spotlight https://openreview.net/forum?id=XJk19XzGq2J☆68Updated last year
- ☆20Updated last year
- Code for minimum-entropy coupling.☆31Updated 10 months ago
- Official code for the paper "Compositional Generalization from First Principles" (NeurIPS 2023)☆11Updated last year
- Code for the paper: "Tensor Programs II: Neural Tangent Kernel for Any Architecture"☆106Updated 4 years ago