LCS2-IIITD / TransEvolveLinks

☆11

Alternatives and similar repositories for TransEvolve

Users that are interested in TransEvolve are comparing it to the libraries listed below

Sorting:

mfederici / dsit
Implementation of the models and datasets used in "An Information-theoretic Approach to Distribution Shifts"
☆25Updated 3 years ago
gcucurull / jax-gat
JAX implementation of Graph Attention Networks
☆13Updated 3 years ago
antofuller / configaformers
A python library for highly configurable transformers - easing model architecture search and experimentation.
☆49Updated 3 years ago
CHARM-Tx / linear_mem_attention_pytorch
Unofficially Implements https://arxiv.org/abs/2112.05682 to get Linear Memory Cost on Attention for PyTorch
☆12Updated 3 years ago
lucidrains / ESBN-pytorch
Usable implementation of Emerging Symbol Binding Network (ESBN), in Pytorch
☆25Updated 4 years ago
jmtomczak / git_flow
General Invertible Transformations for Flow-based Generative Models
☆18Updated 4 years ago
Holmeswww / PPOGAN
☆24Updated last year
google-deepmind / ssl_hsic
☆37Updated 11 months ago
giannisdaras / smyrf
[NeurIPS 2020] Official Implementation: "SMYRF: Efficient Attention using Asymmetric Clustering".
☆50Updated last year
juliagusak / neural-ode-metasolver
Supplementary code for the paper "Meta-Solver for Neural Ordinary Differential Equations" https://arxiv.org/abs/2103.08561
☆25Updated 4 years ago
samiraabnar / Reflect
Official Implementation of "Transferring Inductive Biases Through Knowledge Distillation"
☆14Updated 5 years ago
lucidrains / rela-transformer
Implementation of a Transformer using ReLA (Rectified Linear Attention) from https://arxiv.org/abs/2104.07012
☆49Updated 3 years ago
facebookresearch / grounding-inductive-biases
reproduces experiments from "Grounding inductive biases in natural images: invariance stems from variations in data"
☆17Updated 9 months ago
stanfordmlgroup / disentanglement
Official repository for our ICLR 2021 paper Evaluating the Disentanglement of Deep Generative Models with Manifold Topology
☆36Updated 4 years ago
shoaibahmed / metadata_archaeology
Official code for the paper: "Metadata Archaeology"
☆19Updated 2 years ago
yaohungt / Pointwise_Dependency_Neural_Estimation
☆20Updated 5 years ago
ryoungj / mcbits
[ICML'21 Oral] Improving Lossless Compression Rates via Monte Carlo Bits-Back Coding
☆14Updated 4 years ago
shwinshaker / LipGrow
An adaptive training algorithm for residual network
☆15Updated 4 years ago
acmi-lab / pretraining-with-nonsense
Pretraining summarization models using a corpus of nonsense
☆13Updated 3 years ago
ermongroup / f-wgan
Code for "Bridging the Gap between f-GANs and Wasserstein GANs", ICML 2020
☆14Updated 4 years ago
matbun / EBM--Generative-Energy-Based-Modeling
Energy Based Models are a quite novel technique for density estimation. In this university project I explore this new research topic and …
☆16Updated 4 years ago
imd-iclr / imd
The Shape of Data: Intrinsic Distance for Comparing Data Distributions
☆12Updated 5 years ago
OliverRichter / normalized-attention
Code publication to the paper "Normalized Attention Without Probability Cage"
☆16Updated 3 years ago
lucidrains / remixer-pytorch
Implementation of the Remixer Block from the Remixer paper, in Pytorch
☆36Updated 3 years ago
lucidrains / learning-to-expire-pytorch
An implementation of Transformer with Expire-Span, a circuit for learning which memories to retain
☆34Updated 4 years ago
AranKomat / Metroplex
☆21Updated 2 years ago
lucidrains / isab-pytorch
An implementation of (Induced) Set Attention Block, from the Set Transformers paper
☆60Updated 2 years ago
lucidrains / esbn-transformer
An attempt to merge ESBN with Transformers, to endow Transformers with the ability to emergently bind symbols
☆16Updated 3 years ago
jiwoongim / Online-Hyperparameter-Optimization-by-real-time-recurrent-learning
Online Hyperparameter Optimization
☆10Updated 4 years ago
salesforce / NeuralBayes
☆24Updated 2 months ago