BobMcDear / simsiam-pytorchLinks

PyTorch implementation of SimSiam

☆8

Alternatives and similar repositories for simsiam-pytorch

Users that are interested in simsiam-pytorch are comparing it to the libraries listed below

Sorting:

devvrit / SONew
☆9Updated last year
MarlonBecker / MSAM
☆19Updated last year
The-AI-Summer / pytorch-ddp
code for the ddp tutorial
☆32Updated 3 years ago
lzhangbv / eva
[ICLR 2023] Eva: Practical Second-order Optimization with Kronecker-vectorized Approximation
☆12Updated last year
lucidrains / local-attention-flax
Local Attention - Flax module for Jax
☆22Updated 4 years ago
Jannoshh / simple-sam
Sharpness-Aware Minimization for Efficiently Improving Generalization
☆41Updated 3 years ago
ahennequ / pytorch-custom-mma
☆29Updated 2 years ago
vvvm23 / mamba-jax
Unofficial but Efficient Implementation of "Mamba: Linear-Time Sequence Modeling with Selective State Spaces" in JAX
☆83Updated last year
shikaiqiu / compute-better-spent
☆53Updated 8 months ago
srush / mamba-scans
Blog post
☆17Updated last year
wtong98 / mlp-icl
☆10Updated 9 months ago
sail-sg / win
☆9Updated 2 years ago
hyeon95y / SparseLinear
A custom PyTorch layer that is capable of implementing extremely wide and sparse linear layers efficiently
☆50Updated last year
pkuzengqi / Skyformer
Skyformer: Remodel Self-Attention with Gaussian Kernel and Nystr\"om Method (NeurIPS 2021)
☆61Updated 3 years ago
google-research / precondition
☆31Updated last week
sayakpaul / MLP-Mixer-CIFAR10
Implements MLP-Mixer (https://arxiv.org/abs/2105.01601) with the CIFAR-10 dataset.
☆56Updated 3 years ago
IlanPrice / DCTpS
Code for testing DCT plus Sparse (DCTpS) networks
☆14Updated 4 years ago
teddykoker / tinyloader
☆64Updated 3 months ago
lucidrains / token-shift-gpt
Implementation of Token Shift GPT - An autoregressive model that solely relies on shifting the sequence space for mixing
☆50Updated 3 years ago
Trel725 / forward-forward
A simple Python implementation of forward-forward NN training by G. Hinton from NeurIPS 2022
☆21Updated 2 years ago
JeanKaddour / LAWA
Latest Weight Averaging (NeurIPS HITY 2022)
☆30Updated 2 years ago
rovle / gpt3-in-context-fitting
Experiments on GPT-3's ability to fit numerical models in-context.
☆14Updated 2 years ago
HazyResearch / prefix-linear-attention
☆55Updated 11 months ago
dtunai / Griffin-Jax
Jax implementation of "Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models"
☆14Updated last year
minhtannguyen / SRSGD
Code base for SRSGD.
☆28Updated 5 years ago
proger / nanokitchen
Parallel Associative Scan for Language Models
☆18Updated last year
lucidrains / mogrifier
Usable implementation of Mogrifier, a circuit for enhancing LSTMs and potentially other networks, from Deepmind
☆19Updated last year
peerdavid / layerwise-batch-entropy
Layerwise Batch Entropy Regularization
☆23Updated 2 years ago
maximzubkov / fft-scan
Efficient PScan implementation in PyTorch
☆16Updated last year
ayulockin / LossLandscape
Explores the ideas presented in Deep Ensembles: A Loss Landscape Perspective (https://arxiv.org/abs/1912.02757) by Stanislav Fort, Huiyi …
☆65Updated 4 years ago