nreHieW / lossLinks

Visualising Losses in Deep Neural Networks

☆16

Alternatives and similar repositories for loss

Users that are interested in loss are comparing it to the libraries listed below

Sorting:

huggingface / pixparse
Pixel Parsing. A reproduction of OCR-free end-to-end document understanding models with open data
☆21Updated 10 months ago
sayakpaul / big_vision_experiments
Contains my experiments with the `big_vision` repo to train ViTs on ImageNet-1k.
☆22Updated 2 years ago
EleutherAI / training-jacobian
☆23Updated 6 months ago
crowsonkb / dice-mc
DiCE: The Infinitely Differentiable Monte-Carlo Estimator
☆31Updated last year
lucidrains / light-recurrent-unit-pytorch
Implementation of a Light Recurrent Unit in Pytorch
☆48Updated 8 months ago
lucidrains / holodeck-pytorch
Implementation of a holodeck, written in Pytorch
☆18Updated last year
data2ml / all-clip
Load any clip model with a standardized interface
☆21Updated last year
crypdick / timm-lr-scheduler-explorer
A dashboard for exploring timm learning rate schedulers
☆19Updated 7 months ago
thecharlieblake / lovely-llama
An implementation of the Llama architecture, to instruct and delight
☆21Updated 3 weeks ago
sayakpaul / BiT-jax2tf
This repository hosts the code to port NumPy model weights of BiT-ResNets to TensorFlow SavedModel format.
☆14Updated 3 years ago
lucidrains / local-attention-flax
Local Attention - Flax module for Jax
☆22Updated 4 years ago
joey00072 / ohara
Collection of autoregressive model implementation
☆85Updated 2 months ago
lucidrains / token-shift-gpt
Implementation of Token Shift GPT - An autoregressive model that solely relies on shifting the sequence space for mixing
☆50Updated 3 years ago
Zasder3 / open_clip_juwels
An open source implementation of CLIP.
☆32Updated 2 years ago
lucidrains / self-reasoning-tokens-pytorch
Exploration into the proposed "Self Reasoning Tokens" by Felipe Bonetto
☆56Updated last year
lucidrains / quartic-transformer
Exploring an idea where one forgets about efficiency and carries out attention across each edge of the nodes (tokens)
☆51Updated 3 months ago
okarthikb / state-space-models
☆27Updated 11 months ago
facebookresearch / DIG-In
This library supports evaluating disparities in generated image quality, diversity, and consistency between geographic regions.
☆20Updated last year
lucidrains / kalman-filtering-attention
Implementation of the Kalman Filtering Attention proposed in "Kalman Filtering Attention for User Behavior Modeling in CTR Prediction"
☆58Updated last year
lucidrains / metaformer-gpt
Implementation of Metaformer, but in an autoregressive manner
☆25Updated 3 years ago
lucidrains / GAF-microbatch-pytorch
Implementation of Gradient Agreement Filtering, from Chaubard et al. of Stanford, but for single machine microbatches, in Pytorch
☆25Updated 5 months ago
cloneofsimo / zeroshampoo
☆34Updated 9 months ago
catid / spectral_ssm
Implementation of Spectral State Space Models
☆16Updated last year
evanatyourservice / llm-jax
Train a SmolLM-style llm on fineweb-edu in JAX/Flax with an assortment of optimizers.
☆17Updated 3 months ago
lucidrains / transformer-lm-gan
Explorations into adversarial losses on top of autoregressive loss for language modeling
☆37Updated 4 months ago
ahennequ / pytorch-custom-mma
☆29Updated 2 years ago
lucidrains / autoregressive-linear-attention-cuda
CUDA implementation of autoregressive linear attention, with all the latest research findings
☆44Updated 2 years ago
EleutherAI / rnngineering
Engineering the state of RNN language models (Mamba, RWKV, etc.)
☆32Updated last year
lernapparat / torchhacks
Hacks for PyTorch
☆19Updated 2 years ago
lucidrains / tableformer-pytorch
Implementation of TableFormer, Robust Transformer Modeling for Table-Text Encoding, in Pytorch
☆39Updated 3 years ago