Code for "Self-Attention Between Datapoints: Going Beyond Individual Input-Output Pairs in Deep Learning"
β416Mar 21, 2024Updated last year
Alternatives and similar repositories for non-parametric-transformers
Users that are interested in non-parametric-transformers are comparing it to the libraries listed below
Sorting:
- Repository for the "Gotta Go Fast When Generating Data with Score-Based Models" paperβ105Nov 20, 2021Updated 4 years ago
- Lightweight Cluster/Cloud VM Job Management πβ42Aug 27, 2024Updated last year
- β30Jan 17, 2022Updated 4 years ago
- Re-implementation of 'Grokking: Generalization beyond overfitting on small algorithmic datasets'β38Dec 4, 2021Updated 4 years ago
- β100Dec 8, 2021Updated 4 years ago
- Supplementary code for the paper "Meta-Solver for Neural Ordinary Differential Equations" https://arxiv.org/abs/2103.08561β25Mar 30, 2021Updated 4 years ago
- higher is a pytorch library allowing users to obtain higher order gradients over losses spanning training loops rather than individual trβ¦β1,627Mar 25, 2022Updated 3 years ago
- Lightweight ML Experiment Logging πβ81Aug 26, 2024Updated last year
- Cockpit: A Practical Debugging Tool for Training Deep Neural Networksβ488Jul 1, 2022Updated 3 years ago
- Official code Cross-Covariance Image Transformer (XCiT)β674Sep 28, 2021Updated 4 years ago
- Official implementation of the paper "Topographic VAEs learn Equivariant Capsules"β81Mar 4, 2022Updated 3 years ago
- Fast and Easy Infinite Neural Networks in Pythonβ2,375Mar 1, 2024Updated 2 years ago
- Exemplar VAE: Linking Generative Models, Nearest Neighbor Retrieval, and Data Augmentationβ69Dec 9, 2020Updated 5 years ago
- β21Mar 15, 2023Updated 2 years ago
- Self-Similarity Priors: Neural Collages as Differentiable Fractal Representationsβ29Nov 26, 2022Updated 3 years ago
- A library for programmatically generating equivariant layers through constraint solvingβ281May 8, 2023Updated 2 years ago
- β388Oct 18, 2023Updated 2 years ago
- β252Dec 27, 2022Updated 3 years ago
- Official PyTorch Implementation of Long-Short Transformer (NeurIPS 2021).β228Apr 18, 2022Updated 3 years ago
- Code for the paper "True Few-Shot Learning in Language Models" (https://arxiv.org/abs/2105.11447)β143Oct 25, 2021Updated 4 years ago
- β11Apr 14, 2022Updated 3 years ago
- Minimum-distortion embedding with PyTorchβ579Updated this week
- ICML 2020 Paper: Latent Variable Modelling with Hyperbolic Normalizing Flowsβ54Dec 8, 2022Updated 3 years ago
- My implementation of DeepMind's Perceiverβ63Apr 23, 2021Updated 4 years ago
- Code repository for the ICLR 2022 paper "FlexConv: Continuous Kernel Convolutions With Differentiable Kernel Sizes" https://openreview.neβ¦β116Nov 30, 2022Updated 3 years ago
- Repository for the paper "Very Deep VAEs Generalize Autoregressive Models and Can Outperform Them on Images"β451Apr 28, 2023Updated 2 years ago
- Implicit MLE: Backpropagating Through Discrete Exponential Family Distributionsβ259Oct 29, 2023Updated 2 years ago
- Energy-based models for atomic-resolution protein conformationsβ103Mar 22, 2022Updated 3 years ago
- Code release for SLIP Self-supervision meets Language-Image Pre-trainingβ787Feb 9, 2023Updated 3 years ago
- Code to reproduce the results for Compositional Attentionβ59Nov 16, 2022Updated 3 years ago
- This repository contains the code for our CVPR 2022 paper on "Integrating Language Guidance into Vision-based Deep Metric Learning".β44Aug 9, 2022Updated 3 years ago
- β113Sep 23, 2022Updated 3 years ago
- Probabilistic Solution of Differential Equationsβ13Jun 19, 2022Updated 3 years ago
- High-quality implementations of standard and SOTA methods on a variety of tasks.β1,567Feb 2, 2026Updated last month
- Differentiable SDE solvers with GPU support and efficient sensitivity analysis.β1,704Dec 30, 2024Updated last year
- Type annotations and dynamic checking for a tensor's shape, dtype, names, etc.β1,472May 2, 2025Updated 10 months ago
- Official codebase for Decision Transformer: Reinforcement Learning via Sequence Modeling.β2,773Apr 29, 2024Updated last year
- Official codebase for Pretrained Transformers as Universal Computation Engines.β246Jan 14, 2022Updated 4 years ago
- A library for evaluating representations.β77Nov 21, 2021Updated 4 years ago