crypdick / timm-lr-scheduler-explorerLinks

A dashboard for exploring timm learning rate schedulers

☆19

Alternatives and similar repositories for timm-lr-scheduler-explorer

Users that are interested in timm-lr-scheduler-explorer are comparing it to the libraries listed below

Sorting:

data2ml / all-clip
Load any clip model with a standardized interface
☆21Updated last year
lucidrains / light-recurrent-unit-pytorch
Implementation of a Light Recurrent Unit in Pytorch
☆48Updated 9 months ago
facebookresearch / adaptive_scheduling
Experimental scripts for researching data adaptive learning rate scheduling.
☆23Updated last year
eth-easl / fmengine
Utilities for Training Very Large Models
☆58Updated 10 months ago
lucidrains / tableformer-pytorch
Implementation of TableFormer, Robust Transformer Modeling for Table-Text Encoding, in Pytorch
☆39Updated 3 years ago
huggingface / pixparse
Pixel Parsing. A reproduction of OCR-free end-to-end document understanding models with open data
☆21Updated last year
lucidrains / transformer-lm-gan
Explorations into adversarial losses on top of autoregressive loss for language modeling
☆37Updated 5 months ago
lucidrains / rela-transformer
Implementation of a Transformer using ReLA (Rectified Linear Attention) from https://arxiv.org/abs/2104.07012
☆49Updated 3 years ago
lucidrains / memory-editable-transformer
My explorations into editing the knowledge and memories of an attention network
☆35Updated 2 years ago
lucidrains / holodeck-pytorch
Implementation of a holodeck, written in Pytorch
☆18Updated last year
layer6ai-labs / calo-forest
A scalable implementation of diffusion and flow-matching with XGBoost models, applied to calorimeter data.
☆18Updated 9 months ago
crowsonkb / torch-dist-utils
Utilities for PyTorch distributed
☆24Updated 5 months ago
Zasder3 / open_clip_juwels
An open source implementation of CLIP.
☆32Updated 2 years ago
lucidrains / quartic-transformer
Exploring an idea where one forgets about efficiency and carries out attention across each edge of the nodes (tokens)
☆52Updated 4 months ago
google-research / precondition
☆31Updated last month
kyegomez / MM1
PyTorch Implementation of the paper "MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training"
☆24Updated last week
rwightman / imagenet-12k
ImageNet-12k subset of ImageNet-21k (fall11)
☆21Updated 2 years ago
lucidrains / autoregressive-linear-attention-cuda
CUDA implementation of autoregressive linear attention, with all the latest research findings
☆44Updated 2 years ago
microsoft / ResiDual
ResiDual: Transformer with Dual Residual Connections, https://arxiv.org/abs/2304.14802
☆95Updated last year
ravidziv / SimplifyingImbalancedTraining
☆8Updated last year
lucidrains / mixture-of-attention
Some personal experiments around routing tokens to different autoregressive attention, akin to mixture-of-experts
☆120Updated 9 months ago
ChristophReich1996 / HyperMixer
PyTorch reimplementation of the paper "HyperMixer: An MLP-based Green AI Alternative to Transformers" [arXiv 2022].
☆17Updated 3 years ago
lucidrains / hourglass-transformer-pytorch
Implementation of Hourglass Transformer, in Pytorch, from Google and OpenAI
☆91Updated 3 years ago
patil-suraj / vit-vqgan
JAX implementation ViT-VQGAN
☆83Updated 2 years ago
lucidrains / MaMMUT-pytorch
Implementation of MaMMUT, a simple vision-encoder text-decoder architecture for multimodal tasks from Google, in Pytorch
☆103Updated last year
graphcore-research / jax-scalify
JAX Scalify: end-to-end scaled arithmetics
☆16Updated 9 months ago
lucidrains / kalman-filtering-attention
Implementation of the Kalman Filtering Attention proposed in "Kalman Filtering Attention for User Behavior Modeling in CTR Prediction"
☆58Updated last year
lucidrains / GAF-microbatch-pytorch
Implementation of Gradient Agreement Filtering, from Chaubard et al. of Stanford, but for single machine microbatches, in Pytorch
☆25Updated 6 months ago
lucidrains / self-reasoning-tokens-pytorch
Exploration into the proposed "Self Reasoning Tokens" by Felipe Bonetto
☆56Updated last year
EleutherAI / training-jacobian
☆23Updated 7 months ago