OATML / non-parametric-transformersLinks

Code for "Self-Attention Between Datapoints: Going Beyond Individual Input-Output Pairs in Deep Learning"

☆415

Alternatives and similar repositories for non-parametric-transformers

Users that are interested in non-parametric-transformers are comparing it to the libraries listed below

Sorting:

f-dangel / cockpit
Cockpit: A Practical Debugging Tool for Training Deep Neural Networks
☆484Updated 3 years ago
google-research / robustness_metrics
☆471Updated last month
google-deepmind / enn
☆312Updated 8 months ago
uclnlp / torch-imle
Implicit MLE: Backpropagating Through Discrete Exponential Family Distributions
☆259Updated 2 years ago
archinetai / surgeon-pytorch
A library to inspect and extract intermediate layers of PyTorch models.
☆474Updated 3 years ago
mlpen / Nystromformer
☆386Updated 2 years ago
suinleelab / path_explain
A repository for explaining feature attributions and feature interactions in deep neural networks.
☆191Updated 3 years ago
SirRob1997 / Crowded-Valley---Results
This repository contains the results for the paper: "Descending through a Crowded Valley - Benchmarking Deep Learning Optimizers"
☆184Updated 4 years ago
google-research / fast-soft-sort
Fast Differentiable Sorting and Ranking
☆612Updated last year
kzl / universal-computation
Official codebase for Pretrained Transformers as Universal Computation Engines.
☆247Updated 3 years ago
brohrer / sharpened-cosine-similarity
An alternative to convolution in neural networks
☆258Updated last year
teddykoker / torchsort
Fast, differentiable sorting and ranking in PyTorch
☆844Updated 5 months ago
facebookincubator / flowtorch
This library would form a permanent home for reusable components for deep probabilistic programming. The library would form and harness a…
☆309Updated 4 months ago
leopard-ai / betty
Betty: an automatic differentiation library for generalized meta-learning and multilevel optimization
☆343Updated last year
Kaleidophon / deep-significance
Enabling easy statistical significance testing for deep neural networks.
☆338Updated last year
nalexai / hyperlib
Library that contains implementations of machine learning components in the hyperbolic space
☆143Updated last year
izmailovpavel / understandingbdl
☆251Updated 2 years ago
mle-infrastructure / mle-hyperopt
Lightweight Hyperparameter Optimization 🚂
☆149Updated last year
ml-jku / hopular
Hopular: Modern Hopfield Networks for Tabular Data
☆312Updated 3 years ago
facebookresearch / ppuda
Code for Parameter Prediction for Unseen Deep Architectures (NeurIPS 2021)
☆492Updated 2 years ago
g-benton / loss-surface-simplexes
☆100Updated 3 years ago
decile-team / cords
Reduce end to end training time from days to hours (or hours to minutes), and energy requirements/costs by an order of magnitude using co…
☆343Updated 2 years ago
lkhphuc / lightning-hydra-template
Deep Learning project template best practices with Pytorch Lightning, Hydra, Tensorboard.
☆159Updated 4 years ago
cvxgrp / pymde
Minimum-distortion embedding with PyTorch
☆563Updated 4 months ago
f-dangel / backpack
BackPACK - a backpropagation package built on top of PyTorch which efficiently computes quantities other than the gradient.
☆601Updated 10 months ago
y0ast / slurm-for-ml
A Machine Learning workflow for Slurm.
☆151Updated 4 years ago
louislva / deepmind-perceiver
My implementation of DeepMind's Perceiver
☆63Updated 4 years ago
Felix-Petersen / diffsort
Differentiable Sorting Networks
☆124Updated 2 years ago
awslabs / adatune
Gradient based Hyperparameter Tuning library in PyTorch
☆290Updated 5 years ago
ray-project / ray_lightning
Pytorch Lightning Distributed Accelerators using Ray
☆215Updated 2 years ago