EleutherAI/training-jacobian

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/EleutherAI/training-jacobian)

EleutherAI / training-jacobian

☆24

Alternatives and similar repositories for training-jacobian

Users that are interested in training-jacobian are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

fastai / fastpy
View on GitHub
An easy way to start a python programming environment using GitHub Codespaces.
☆15Sep 9, 2020Updated 5 years ago
edwardmilsom / function-space-learning-rates-paper
View on GitHub
Code for the paper "Function-Space Learning Rates"
☆23Jun 3, 2025Updated last year
SDLAML / disco
View on GitHub
☆16Dec 11, 2025Updated 7 months ago
Blkalkin / Optimal-TestTime
View on GitHub
☆10Mar 24, 2025Updated last year
berndprach / AOL
View on GitHub
Code for paper Almost-Orthogonal Layers for Efficient General-Purpose Lipschitz Networks
☆13Aug 9, 2022Updated 3 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
Ryu1845 / hyena-jax
View on GitHub
Implementation of Hyena Hierarchy in JAX
☆10Apr 30, 2023Updated 3 years ago
RobertCsordas / onion_representations
View on GitHub
☆13Aug 19, 2024Updated last year
MaheepChaudhary / SAE-Ravel
View on GitHub
Providing the answer to "How to do patching on all available SAEs on GPT-2?". It is an official repository of the implementation of the p…
☆13Jan 26, 2025Updated last year
graphcore-research / out-of-the-box-fp8-training
View on GitHub
Demo of the unit_scaling library, showing how a model can be easily adapted to train in FP8.
☆46Jul 17, 2024Updated 2 years ago
EleutherAI / rnngineering
View on GitHub
Engineering the state of RNN language models (Mamba, RWKV, etc.)
☆33May 25, 2024Updated 2 years ago
OpenNLPLab / ETSC-Exact-Toeplitz-to-SSM-Conversion
View on GitHub
[EMNLP 2023] Official implementation of the algorithm ETSC: Exact Toeplitz-to-SSM Conversion our EMNLP 2023 paper - Accelerating Toeplitz…
☆14Oct 17, 2023Updated 2 years ago
crowsonkb / jax-wavelets
View on GitHub
The 2D discrete wavelet transform for JAX
☆45Feb 28, 2023Updated 3 years ago
rpatrik96 / nl-causal-representations
View on GitHub
This is the code for the paper Jacobian-based Causal Discovery with Nonlinear ICA, demonstrating how identifiable representations (partic…
☆22Sep 5, 2024Updated last year
zhichaoxu-shufe / context-aware-decoding-qfs
View on GitHub
☆14Jan 10, 2024Updated 2 years ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
Arongil / lipschitz-transformers
View on GitHub
Don't just regulate gradients like in Muon, regulate the weights too
☆32Jul 30, 2025Updated 11 months ago
FlyingPumba / InterpBench
View on GitHub
A benchmark for mechanistic discovery of circuits in Transformers
☆17Dec 15, 2024Updated last year
damek / specgd
View on GitHub
Code to generate figures of paper "When do spectral gradient updates help in deep learning?"
☆16Dec 3, 2025Updated 7 months ago
howard-hou / EmbeddingRWKV
View on GitHub
A high-efficiency text embedding and reranking model based on RWKV architecture.
☆20Jan 10, 2026Updated 6 months ago
darklife / udarkrisc
View on GitHub
u[Dark]RISC -- "micro-darkrisc" -- an early 16-bit micro-RISC processor defined before DarkRISCV
☆19Jul 25, 2023Updated 3 years ago
recursal / GoldFinch-paper
View on GitHub
GoldFinch and other hybrid transformer components
☆46Jul 20, 2024Updated 2 years ago
tweetdeck-archive / txAMQP
View on GitHub
Twisted client library for AMQP (tested against RabbitMQ). This is a mirror and fork of the launchpad project: https://launchpad.net/txam…
☆18May 23, 2012Updated 14 years ago
PredictiveIntelligenceLab / ActNet
View on GitHub
Repository for some of the experiments presented in the paper "Deep Learning Alternatives of the Kolmogorov Superposition Theorem", Spotl…
☆22Mar 28, 2025Updated last year
leloykun / adaptive-muon
View on GitHub
A single-line modification to any (dualizer-based) optimizer that allows the optimizer to adapt to the scale of the gradients as they cha…
☆19Jan 11, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
RWKV-Vibe / rwkv-kit
View on GitHub
☆24Dec 28, 2024Updated last year
afiaka87 / latent-diffusion-deepspeed
View on GitHub
Finetune the 1.4B latent diffusion text2img-large checkpoint from CompVis using deepspeed. (work-in-progress)
☆36Apr 17, 2022Updated 4 years ago
thestephencasper / latent_adversarial_training
View on GitHub
☆24Jul 25, 2024Updated 2 years ago
sarahscheffler / mapofcrypto
View on GitHub
A map of relationships among cryptographic primitives
☆12Dec 17, 2018Updated 7 years ago
cchan / fp8_mul
View on GitHub
A tiny FP8 multiplication unit written in Verilog. TinyTapeout 2 submission.
☆14Nov 23, 2022Updated 3 years ago
yanji84 / deep-recurrent-attention-model
View on GitHub
Apply reinforcement learning to visual attention
☆18Oct 13, 2016Updated 9 years ago
TristanThrush / perplexity-correlations
View on GitHub
Simple and scalable tools for data-driven pretraining data selection.
☆30Jun 9, 2025Updated last year
main-horse / hnet-old
View on GitHub
H-Net Dynamic Hierarchical Architecture
☆81Sep 11, 2025Updated 10 months ago
bpowers / liquid-types
View on GitHub
Logically Qualified Data Types - automatically infer refinement types
☆17Aug 24, 2021Updated 4 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
srush / triton-autodiff
View on GitHub
Experiment of using Tangent to autodiff triton
☆81Jan 22, 2024Updated 2 years ago
ryanwebster90 / snip-dedup
View on GitHub
☆104Jan 26, 2024Updated 2 years ago
TeunvdWeij / sandbagging
View on GitHub
☆20Nov 15, 2024Updated last year
nikhilvyas / SOAP_MUON
View on GitHub
Combining SOAP and MUON
☆23Feb 11, 2025Updated last year
PygmalionAI / logbooks
View on GitHub
Where we keep our notes about model training runs.
☆16Mar 12, 2023Updated 3 years ago
kyleliang919 / Super_Muon
View on GitHub
☆68Mar 21, 2025Updated last year
TinyTapeout / tt-multiplexer
View on GitHub
☆20May 25, 2026Updated last month