deep-spin / sparse-communicationLinks

☆12

Alternatives and similar repositories for sparse-communication

Users that are interested in sparse-communication are comparing it to the libraries listed below

Sorting:

deep-spin / sparse-marginalization-lvm
Official PyTorch (Lightning) implementation of the NeurIPS 2020 paper "Efficient Marginalization of Discrete and Structured Latent Variab…
☆28Updated 4 years ago
xuanlinli17 / autoregressive_inference
Code for "Discovering Non-monotonic Autoregressive Orderings with Variational Inference" (paper and code updated from ICLR 2021)
☆12Updated last year
zomux / lanmt-ebm
lanmt ebm
☆12Updated 5 years ago
RakitinDen / pytorch-recursive-gumbel-max-trick
Leveraging Recursive Gumbel-Max Trick for Approximate Inference in Combinatorial Spaces, NeurIPS 2021
☆13Updated 3 years ago
wouterkool / estimating-gradients-without-replacement
Estimating Gradients for Discrete Random Variables by Sampling without Replacement
☆40Updated 5 years ago
FranxYao / RDP
Implementation of ICML 22 Paper: Scaling Structured Inference with Randomization
☆14Updated 3 years ago
deep-spin / understanding-spigot
Code for the paper "Understanding the Mechanics of SPIGOT: Surrogate Gradients for Latent Structure Learning"
☆11Updated 4 years ago
robert-lieck / RBN
Recursive Bayesian Networks
☆11Updated 2 months ago
Noahs-ARK / PaLM
PyTorch implementation for PaLM: A Hybrid Parser and Language Model.
☆10Updated 5 years ago
bergen / EdgeTransformer
☆22Updated 3 years ago
KurochkinAlexey / AntisymmetricRNN
Python implementation of paper "AntisymmetricRNN: A Dynamical System View on Recurrent Neural Networks"
☆15Updated 6 years ago
kingofspace0wzz / wae-rnf-lm
Pytorch Implemetation for our NAACL2019 Paper "Riemannian Normalizing Flow on Variational Wasserstein Autoencoder for Text Modeling" http…
☆62Updated 5 years ago
thjashin / rodeo
Gradient Estimation with Discrete Stein Operators (NeurIPS 2022)
☆17Updated last year
deep-spin / sparse_continuous_distributions
This repository provides open-source code for sparse continuous distributions and corresponding Fenchel-Young losses.
☆16Updated 2 years ago
jenni-ai / T2FW
Fine-Tuning Pre-trained Transformers into Decaying Fast Weights
☆19Updated 2 years ago
iesl / s-diora
☆12Updated 4 years ago
agadetsky / pytorch-pl-variance-reduction
[AAAI 2020 Oral] Low-variance Black-box Gradient Estimates for the Plackett-Luce Distribution
☆38Updated 4 years ago
RobertCsordas / ndr
The official repository for our paper "The Neural Data Router: Adaptive Control Flow in Transformers Improves Systematic Generalization".
☆33Updated last month
timvieira / vocrf
Variable-order CRFs with structure learning
☆16Updated last year
sustcsonglin / gated_linear_attention_layer
☆32Updated last year
deep-spin / lp-sparsemap
LP-SparseMAP: Differentiable sparse structured prediction in coarse factor graphs
☆41Updated last year
srush / ProbTalk
☆29Updated 3 years ago
belindal / TaskBench500
Suite of 500 procedurally-generated NLP tasks to study language model adaptability
☆21Updated 3 years ago
swiseman / neighbor-splicing
☆12Updated 3 years ago
zhangjiong724 / spectral-RNN
STABILIZING GRADIENTS FOR DEEP NEURAL NETWORKS VIA EFFICIENT SVD PARAMETERIZATION
☆16Updated 7 years ago
srush / mamba-scans
Blog post
☆17Updated last year
da03 / criticize_text_generation
A method for evaluating the high-level coherence of machine-generated texts. Identifies high-level coherence issues in transformer-based …
☆11Updated 2 years ago
LZhengisme / CODA
Implementation of Cascaded Head-colliding Attention (ACL'2021)
☆11Updated 3 years ago
yaohungt / TransformerDissection
[EMNLP'19] Summary for Transformer Understanding
☆53Updated 5 years ago
ermongroup / SPN_Variational_Inference
PyTorch implementation for "Probabilistic Circuits for Variational Inference in Discrete Graphical Models", NeurIPS 2020
☆17Updated 3 years ago