brohrer / sharpened-cosine-similarityLinks
An alternative to convolution in neural networks
☆254Updated last year
Alternatives and similar repositories for sharpened-cosine-similarity
Users that are interested in sharpened-cosine-similarity are comparing it to the libraries listed below
Sorting:
- Implementation of the Adan (ADAptive Nesterov momentum algorithm) Optimizer in Pytorch☆252Updated 2 years ago
- Convert scikit-learn models to PyTorch modules☆161Updated last year
- A library to inspect and extract intermediate layers of PyTorch models.☆473Updated 3 years ago
- Cyclemoid implementation for PyTorch☆89Updated 3 years ago
- Deep Learning project template best practices with Pytorch Lightning, Hydra, Tensorboard.☆159Updated 4 years ago
- Simple and efficient RevNet-Library for PyTorch with XLA and DeepSpeed support and parameter offload☆127Updated 2 years ago
- Memory mapped numpy arrays of varying shapes☆298Updated 11 months ago
- Unofficial JAX implementations of deep learning research papers☆156Updated 2 years ago
- Minimal standalone example of diffusion model☆158Updated 2 years ago
- My implementation of DeepMind's Perceiver☆63Updated 4 years ago
- Lightweight Hyperparameter Optimization 🚂☆147Updated 9 months ago
- Named tensors with first-class dimensions for PyTorch☆329Updated last year
- Drop-in replacement for any ResNet with a significantly reduced memory footprint and better representation capabilities☆209Updated last year
- The most parameter efficient machine learning models on a few popular benchmarks☆42Updated 3 years ago
- Hopular: Modern Hopfield Networks for Tabular Data☆310Updated 3 years ago
- Cockpit: A Practical Debugging Tool for Training Deep Neural Networks☆479Updated 2 years ago
- Probing the representations of Vision Transformers.☆323Updated 2 years ago
- FFCV-SSL Fast Forward Computer Vision for Self-Supervised Learning.☆207Updated last year
- A Pytree Module system for Deep Learning in JAX☆214Updated 2 years ago
- A pure-functional implementation of a machine learning transformer model in Python/JAX☆179Updated 3 weeks ago
- Implementation of Mega, the Single-head Attention with Multi-headed EMA architecture that currently holds SOTA on Long Range Arena☆203Updated last year
- NumPy arrays, ready for human consumption☆69Updated 2 weeks ago
- Ranger deep learning optimizer rewrite to use newest components☆329Updated last year
- ☆376Updated last year
- Official Pytorch and JAX implementation of "Efficient-VDVAE: Less is more"☆196Updated 2 years ago
- Official code for the Stochastic Polyak step-size optimizer☆139Updated 11 months ago
- Implicit MLE: Backpropagating Through Discrete Exponential Family Distributions☆258Updated last year
- Code release for "Dropout Reduces Underfitting"☆313Updated 2 years ago
- 🧀 Pytorch code for the Fromage optimiser.☆124Updated 10 months ago
- This repository contains the results for the paper: "Descending through a Crowded Valley - Benchmarking Deep Learning Optimizers"☆180Updated 3 years ago