ReiherGroup / CoRe_optimizerLinks

Continual Resilient (CoRe) Optimizer for PyTorch

☆10

Alternatives and similar repositories for CoRe_optimizer

Users that are interested in CoRe_optimizer are comparing it to the libraries listed below

Sorting:

layer6ai-labs / calo-forest
A scalable implementation of diffusion and flow-matching with XGBoost models, applied to calorimeter data.
☆18Updated 8 months ago
lucidrains / simplicial-attention
Implementation of 2-simplicial attention proposed by Clift et al. (2019) and the recent attempt to make practical in Fast and Simplex, Ro…
☆34Updated this week
lucidrains / transformer-lm-gan
Explorations into adversarial losses on top of autoregressive loss for language modeling
☆37Updated 4 months ago
AndyShih12 / LongHorizonTemperatureScaling
PyTorch implementation for "Long Horizon Temperature Scaling", ICML 2023
☆20Updated 2 years ago
crypdick / timm-lr-scheduler-explorer
A dashboard for exploring timm learning rate schedulers
☆19Updated 7 months ago
graphcore-research / jax-scalify
JAX Scalify: end-to-end scaled arithmetics
☆16Updated 8 months ago
adihaviv / nopos
☆22Updated last year
lucidrains / kalman-filtering-attention
Implementation of the Kalman Filtering Attention proposed in "Kalman Filtering Attention for User Behavior Modeling in CTR Prediction"
☆58Updated last year
lucidrains / insertion-deletion-ddpm
Implementation of Insertion-deletion Denoising Diffusion Probabilistic Models
☆30Updated 3 years ago
lucidrains / light-recurrent-unit-pytorch
Implementation of a Light Recurrent Unit in Pytorch
☆48Updated 9 months ago
SamsungSAILMontreal / nino
Code for "Accelerating Training with Neuron Interaction and Nowcasting Networks" [to appear at ICLR 2025]
☆19Updated last month
lucidrains / rela-transformer
Implementation of a Transformer using ReLA (Rectified Linear Attention) from https://arxiv.org/abs/2104.07012
☆49Updated 3 years ago
facebookresearch / MultiModalExplorer
Visualize multi-model embedding spaces. The first goal is to quickly get a lay of the land of any embedding space. Then be able to scroll…
☆27Updated last year
RE-N-Y / sae
☆17Updated 7 months ago
fishmingyu / GeoT
GeoT: Tensor Centric Library for Graph Neural Network via Efficient Segment Reduction on GPU
☆23Updated 3 months ago
Eliyas0007 / Pytorch-Intention
Unofficial implementation of paper : Exploring the Space of Key-Value-Query Models with Intention
☆12Updated 2 years ago
fal-ai-community / minDDPD
☆33Updated 6 months ago
sustcsonglin / gated_linear_attention_layer
☆32Updated last year
ajayjain / journey-diffusion-samplers
Code for "Journey to the BAOAB-limit: finding effective MCMC samplers for score-based models". See more at https://ajayj.com/journey.
☆12Updated 2 years ago
annosubmission / GRC-Cache
☆16Updated 2 years ago
kyegomez / SoundStream
Implementation of SoundtStream from the paper: "SoundStream: An End-to-End Neural Audio Codec"
☆12Updated 5 months ago
lucidrains / strassen-attention
Implementation of Strassen attention, from Kozachinskiy et al. of National Center of AI in Chile
☆26Updated this week
microsoft / ResiDual
ResiDual: Transformer with Dual Residual Connections, https://arxiv.org/abs/2304.14802
☆94Updated last year
plutonium-239 / memsave_torch
Lowering PyTorch's Memory Consumption for Selective Differentiation
☆11Updated 10 months ago
tencent-ailab / TriNet
TriNet: stabilizing self-supervised learning from complete or slow collapse on ASR.
☆26Updated 2 years ago
CHARM-Tx / linear_mem_attention_pytorch
Unofficially Implements https://arxiv.org/abs/2112.05682 to get Linear Memory Cost on Attention for PyTorch
☆12Updated 3 years ago
lucidrains / coordinate-descent-attention
Implementation of an Attention layer where each head can attend to more than just one token, using coordinate descent to pick topk
☆46Updated last year
tchaton / pytorch2lightning
☆15Updated 3 years ago
crowsonkb / torch-dist-utils
Utilities for PyTorch distributed
☆24Updated 4 months ago
GoGoDuck912 / pytorch-vector-quantization
A Pytorch Implementations for Various Vector Quantization Methods
☆30Updated 3 years ago