brendel-group / objects-compositional-generalizationLinks
Official code for the paper "Provable Compositional Generalization for Object-Centric Learning" (ICLR 2024, oral)
☆14Updated 11 months ago
Alternatives and similar repositories for objects-compositional-generalization
Users that are interested in objects-compositional-generalization are comparing it to the libraries listed below
Sorting:
- Code for GFlowNet-EM, a novel algorithm for fitting latent variable models with compositional latents and an intractable true posterior.☆41Updated last year
- Sparse and discrete interpretability tool for neural networks☆63Updated last year
- Code Release for "Broken Neural Scaling Laws" (BNSL) paper☆59Updated last year
- Yet another random morning idea to be quickly tried and architecture shared if it works; to allow the transformer to pause for any amount…☆53Updated last year
- Latent Diffusion Language Models☆69Updated last year
- [Preprint] AdaVAE: Exploring Adaptive GPT-2s in VAEs for Language Modeling PyTorch Implementation☆35Updated last year
- Latest Weight Averaging (NeurIPS HITY 2022)☆31Updated 2 years ago
- Code for the paper "The Impact of Positional Encoding on Length Generalization in Transformers", NeurIPS 2023☆136Updated last year
- LayerNorm(SmallInit(Embedding)) in a Transformer to improve convergence☆59Updated 3 years ago
- Replicating and dissecting the git-re-basin project in one-click-replication Colabs☆36Updated 2 years ago
- This repository contains the code used for the experiments in the paper "Fine-Tuning Enhances Existing Mechanisms: A Case Study on Entity…☆27Updated last year
- ☆104Updated 2 years ago
- Implementation of GateLoop Transformer in Pytorch and Jax☆89Updated last year
- Official repo for the paper "Weight-based Decomposition: A Case for Bilinear MLPs"☆22Updated last week
- Official implementation of FIND (NeurIPS '23) Function Interpretation Benchmark and Automated Interpretability Agents☆49Updated 10 months ago
- Official code for the paper "Attention as a Hypernetwork"☆40Updated last year
- Implementation of Gated State Spaces, from the paper "Long Range Language Modeling via Gated State Spaces", in Pytorch☆101Updated 2 years ago
- ☆32Updated last year
- ☆27Updated last year
- [NeurIPS 2023 spotlight] Official implementation of HGRN in our NeurIPS 2023 paper - Hierarchically Gated Recurrent Neural Network for Se…☆66Updated last year
- Revisiting Efficient Training Algorithms For Transformer-based Language Models (NeurIPS 2023)☆80Updated last year
- ☆28Updated last year
- A centralized place for deep thinking code and experiments☆85Updated last year
- ☆51Updated last year
- Understanding how features learned by neural networks evolve throughout training☆36Updated 9 months ago
- Why Do We Need Weight Decay in Modern Deep Learning? [NeurIPS 2024]☆66Updated 10 months ago
- ☆45Updated last year
- Codes and files for the paper Are Emergent Abilities in Large Language Models just In-Context Learning☆33Updated 7 months ago
- Engineering the state of RNN language models (Mamba, RWKV, etc.)☆32Updated last year
- Memory Mosaics are networks of associative memories working in concert to achieve a prediction task.☆47Updated 6 months ago