ravidziv / SimplifyingImbalancedTrainingLinks

☆8

Alternatives and similar repositories for SimplifyingImbalancedTraining

Users that are interested in SimplifyingImbalancedTraining are comparing it to the libraries listed below

Sorting:

dlmacedo / distinction-maximization-loss
A project to improve out-of-distribution detection (open set recognition) and uncertainty estimation by changing a few lines of code in y…
☆45Updated 2 years ago
SamsungSAILMontreal / PAPA
Repository for the PopulAtion Parameter Averaging (PAPA) paper
☆26Updated last year
shoaibahmed / metadata_archaeology
Official code for the paper: "Metadata Archaeology"
☆19Updated 2 years ago
lucidrains / compositional-attention-pytorch
Implementation of "compositional attention" from MILA, a multi-head attention variant that is reframed as a two-step attention process wi…
☆51Updated 3 years ago
lucidrains / logavgexp-torch
Implementation of LogAvgExp for Pytorch
☆36Updated 3 months ago
lucidrains / metaformer-gpt
Implementation of Metaformer, but in an autoregressive manner
☆26Updated 3 years ago
data2ml / all-clip
Load any clip model with a standardized interface
☆21Updated last year
crypdick / timm-lr-scheduler-explorer
A dashboard for exploring timm learning rate schedulers
☆19Updated 8 months ago
crowsonkb / torch-dist-utils
Utilities for PyTorch distributed
☆24Updated 5 months ago
facebookresearch / adaptive_scheduling
Experimental scripts for researching data adaptive learning rate scheduling.
☆23Updated last year
lucidrains / rela-transformer
Implementation of a Transformer using ReLA (Rectified Linear Attention) from https://arxiv.org/abs/2104.07012
☆49Updated 3 years ago
lucidrains / quartic-transformer
Exploring an idea where one forgets about efficiency and carries out attention across each edge of the nodes (tokens)
☆52Updated 4 months ago
sayakpaul / AdaMatch-TF
Includes additional materials for the following keras.io blog post.
☆12Updated 4 years ago
microsoft / ResiDual
ResiDual: Transformer with Dual Residual Connections, https://arxiv.org/abs/2304.14802
☆95Updated last year
tkasarla / max-separation-as-inductive-bias
Github code for the paper Maximum Class Separation as Inductive Bias in One Matrix. Arxiv link: https://arxiv.org/abs/2206.08704
☆29Updated 2 years ago
facebookresearch / SIE
Code for the paper Self-Supervised Learning of Split Invariant Equivariant Representations
☆28Updated last year
lucidrains / zorro-pytorch
Implementation of Zorro, Masked Multimodal Transformer, in Pytorch
☆97Updated last year
JeanKaddour / LAWA
Latest Weight Averaging (NeurIPS HITY 2022)
☆31Updated 2 years ago
lucidrains / panoptic-transformer
Another attempt at a long-context / efficient transformer by me
☆38Updated 3 years ago
rom1504 / CLIP
Contrastive Language-Image Pretraining
☆38Updated last year
OpenNLPLab / HGRN2
HGRN2: Gated Linear RNNs with State Expansion
☆55Updated 11 months ago
layer6ai-labs / calo-forest
A scalable implementation of diffusion and flow-matching with XGBoost models, applied to calorimeter data.
☆18Updated 9 months ago
facebookresearch / ViP-MAE
This is a PyTorch implementation of the paperViP A Differentially Private Foundation Model for Computer Vision
☆36Updated 2 years ago
jonkahana / CLIPPR
An official PyTorch implementation for CLIPPR
☆29Updated 2 years ago
lucidrains / light-recurrent-unit-pytorch
Implementation of a Light Recurrent Unit in Pytorch
☆48Updated 10 months ago
tml-epfl / why-weight-decay
Why Do We Need Weight Decay in Modern Deep Learning? [NeurIPS 2024]
☆66Updated 10 months ago
eth-easl / fmengine
Utilities for Training Very Large Models
☆58Updated 10 months ago
mkirchhof / url
Uncertainty-aware representation learning (URL) benchmark
☆105Updated 4 months ago
Zasder3 / open_clip_juwels
An open source implementation of CLIP.
☆32Updated 2 years ago
jiaweizzhao / ZerO-initialization
☆74Updated 2 years ago