sc782 / SBM-Transformer
☆13Updated 2 years ago
Alternatives and similar repositories for SBM-Transformer:
Users that are interested in SBM-Transformer are comparing it to the libraries listed below
- Code for GFlowNet-EM, a novel algorithm for fitting latent variable models with compositional latents and an intractable true posterior.☆40Updated last year
- [NeurIPS'23 Spotlight] Learning Probabilistic Symmetrization for Architecture Agnostic Equivariance (LPS), in PyTorch☆29Updated last year
- Code for "Can We Scale Transformers to Predict Parameters of Diverse ImageNet Models?" [ICML 2023]☆32Updated 8 months ago
- ☆31Updated 6 months ago
- Pytorch implementation of neural processes and variants☆28Updated 9 months ago
- Official Code for Local Search GFlowNets (ICLR 2024 Spotlight)☆18Updated 2 months ago
- Bayesian Attention Modules☆35Updated 4 years ago
- ☆22Updated 3 years ago
- [ICLR 2024 Oral] Beyond Weisfeiler-Lehman: A Quantitative Framework for GNN Expressiveness.☆16Updated last year
- ☆9Updated 2 years ago
- Code to reproduce the results for Compositional Attention☆60Updated 2 years ago
- Simple illustrative examples for energy-based models in PyTorch☆62Updated 5 years ago
- Code for our paper "Generative Flow Networks for Discrete Probabilistic Modeling"☆82Updated 2 years ago
- ☆17Updated last year
- ☆62Updated 3 years ago
- Official PyTorch implementation of "Energy-Based Contrastive Learning of Visual Representations", NeurIPS 2022 Oral Paper☆10Updated 2 years ago
- ☆15Updated 2 years ago
- Implementation for our paper "How Far Can Transformers Reason? The Locality Barrier and Inductive Scratchpad"☆12Updated 10 months ago
- [NeurIPS'21] Higher-order Transformers for sets, graphs, and hypergraphs, in PyTorch☆65Updated 2 years ago
- [NeurIPS 2022] Your Transformer May Not be as Powerful as You Expect (official implementation)☆34Updated last year
- ☆17Updated last year
- Online Adaptation of Language Models with a Memory of Amortized Contexts (NeurIPS 2024)☆63Updated 9 months ago
- Code for paper "Compositional Sculpting of Iterative Generative Processes"☆21Updated last year
- [ICML'21] Improved Contrastive Divergence Training of Energy Based Models☆62Updated 3 years ago
- Implementation of OpenAI's 'Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets' paper.☆36Updated last year
- [NeurIPS'20] Code for the Paper Compositional Visual Generation and Inference with Energy Based Models☆44Updated 2 years ago
- ☆36Updated 4 years ago
- Offcial Repo of Paper "Eliminating Position Bias of Language Models: A Mechanistic Approach""☆14Updated 8 months ago
- Official release of code for "Oops I Took A Gradient: Scalable Sampling for Discrete Distributions"☆54Updated last year
- ☆51Updated 2 years ago