janEbert/PyTorch-VeLO

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/janEbert/PyTorch-VeLO)

janEbert / PyTorch-VeLO

VeLO optimizer in PyTorch

☆20

Alternatives and similar repositories for PyTorch-VeLO

Users that are interested in PyTorch-VeLO are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

sail-sg / TEC
View on GitHub
☆17Mar 17, 2023Updated 3 years ago
SonicCodes / subcloning
View on GitHub
implementation of https://arxiv.org/pdf/2312.09299
☆21Jul 3, 2024Updated 2 years ago
Sike-Wang / low-bit-Shampoo
View on GitHub
4-bit Shampoo for Memory-Efficient Network Training (NeurIPS 2024)
☆13Feb 13, 2025Updated last year
opooladz / Preconditioned-Stochastic-Gradient-Descent
View on GitHub
A repo based on XiLin Li's PSGD repo that extends some of the experiments.
☆14Oct 7, 2024Updated last year
pharaouk / dharma
View on GitHub
☆13Apr 25, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
thomasahle / kanmlps
View on GitHub
KANs and MLPs
☆12Jun 7, 2024Updated 2 years ago
lixilinx / Fully-Trainable-SSM
View on GitHub
A fully trainable state space model (SSM)
☆16Mar 18, 2025Updated last year
evanatyourservice / llm-jax
View on GitHub
Train a SmolLM-style llm on fineweb-edu in JAX/Flax with an assortment of optimizers.
☆19Jul 24, 2025Updated 11 months ago
hongyanz / multibranch
View on GitHub
Codes for the paper "Deep Neural Networks with Multi-Branch Architectures Are Less Non-Convex"
☆21Jul 25, 2020Updated 5 years ago
JesseFarebro / flax-mup
View on GitHub
Maximal Update Parametrization (μP) with Flax & Optax.
☆16Dec 27, 2023Updated 2 years ago
google / learned_optimization
View on GitHub
☆811Jul 8, 2026Updated last week
google / drjax
View on GitHub
☆19Jul 8, 2026Updated 2 weeks ago
nreimers / se-pytorch-xla
View on GitHub
☆21Sep 6, 2021Updated 4 years ago
nitincodery / org-dex.el
View on GitHub
☆13May 14, 2025Updated last year
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
vaguenebula / AlpacaDataReflect
View on GitHub
An experiment to see if chatgpt can improve the output of the stanford alpaca dataset
☆12Mar 29, 2023Updated 3 years ago
halolimat / SpExtor
View on GitHub
SpExtor: Sparse Entity Extractor
☆11Feb 10, 2020Updated 6 years ago
davegurnell / css-selector
View on GitHub
Lift-style CSS selector transforms based on Scalate's Scuery
☆10Aug 23, 2012Updated 13 years ago
dengyang17 / LLM-Proactive
View on GitHub
☆15Nov 23, 2023Updated 2 years ago
Linzwcs / AFT
View on GitHub
☆13Jan 22, 2025Updated last year
brendanhogan / completion_tree_view
View on GitHub
☆15Apr 26, 2025Updated last year
jmanhype / Storm
View on GitHub
☆13Mar 25, 2026Updated 3 months ago
HazyResearch / embroid
View on GitHub
Embroid: Unsupervised Prediction Smoothing Can Improve Few-Shot Classification
☆11Aug 12, 2023Updated 2 years ago
dvruette / gidd-easydel
View on GitHub
☆25Dec 16, 2025Updated 7 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
UIC-InDeXLab / RSR
View on GitHub
An Efficient Matrix Multiplication Algorithm for Accelerating Inference in Binary and Ternary Neural Networks
☆17Mar 27, 2026Updated 3 months ago
zeroeightysix / tt-lectures
View on GitHub
Source code for student lectures on dependent type theory.
☆12Jun 9, 2025Updated last year
Kernel-Machines / kermac
View on GitHub
Pytorch routines for (Ker)nel (Mac)hines
☆12Oct 10, 2025Updated 9 months ago
jotaf98 / pytorch-curveball
View on GitHub
A second-order optimizer for deep networks
☆25Oct 31, 2019Updated 6 years ago
dayal-kalra / low-memory-adam
View on GitHub
☆14Mar 2, 2025Updated last year
filteredcophy / FilteredCoPhy
View on GitHub
☆10Nov 17, 2022Updated 3 years ago
yandex-research / tabgraphs
View on GitHub
A benchmark of meaningful graph datasets with tabular node features
☆16Oct 29, 2025Updated 8 months ago
erfanzar / Xerxes-Agents
View on GitHub
Agents for intelligence and coordination
☆26Updated this week
SahinLale / StochasticMirrorDescent
View on GitHub
Stochastic Mirror Descent on CIFAR-10
☆20Jul 3, 2019Updated 7 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
learning-at-home / collaborative-latent-diffusion
View on GitHub
Collaborative inference of latent diffusion via hivemind
☆12May 29, 2023Updated 3 years ago
gau-nernst / kokoro
View on GitHub
https://hf.co/hexgrad/Kokoro-82M
☆14Jan 14, 2026Updated 6 months ago
zs-zhong / D-LADMM
View on GitHub
The code for Differentiable Linearized ADMM (ICML 2019)
☆36Oct 9, 2019Updated 6 years ago
purzelrakete / Pagerank.jl
View on GitHub
Pagerank in Julia. An experiment in pagerank on graphs in the order of billions of edges. Currently tested with over half a billion edges…
☆12Aug 14, 2013Updated 12 years ago
shauli-ravfogel / conformal-prediction
View on GitHub
☆10Feb 2, 2023Updated 3 years ago
HeegyuKim / torch-xla-SPMD
View on GitHub
Pytorch/XLA SPMD Test code in Google TPU
☆23Apr 3, 2024Updated 2 years ago
radiosilence / wire
View on GitHub
[DEFUNCT - do not use, insecure!] Communication for the 21st century activist.
☆16Aug 5, 2025Updated 11 months ago