☆30May 17, 2026Updated last week
Alternatives and similar repositories for awesome-second-order-optimization
Users that are interested in awesome-second-order-optimization are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An implementation of PSGD Kron second-order optimizer for PyTorch☆100Jul 24, 2025Updated 10 months ago
- Implementation of PSGD optimizer in JAX☆35Dec 31, 2024Updated last year
- Pytorch implementation of preconditioned stochastic gradient descent (Kron and affine preconditioner, low-rank approximation precondition…☆198May 16, 2026Updated last week
- This project is a implementation in PyTorch for ZO-AdaMU optimization: Adapting Perturbation with the Momentum and Uncertainty in Zeroth-…☆14Dec 12, 2023Updated 2 years ago
- Official implementation of Our NeurIPS 2024 Paper "Boundary Matters: A Bi-Level Active Finetuning Method"☆14Feb 11, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [ICML 2024] SIRFShampoo: Structured inverse- and root-free Shampoo in PyTorch (https://arxiv.org/abs/2402.03496)☆15Nov 4, 2024Updated last year
- [ICLR 2025] Linear Combination of Saved Checkpoints Makes Consistency and Diffusion Models Better☆16Feb 15, 2025Updated last year
- beat swapping powered by AI☆14Jul 7, 2024Updated last year
- [CoLM 24] Official Repository of MambaByte: Token-free Selective State Space Model☆27Oct 12, 2024Updated last year
- Triton-based Symmetric Memory operators and examples☆100May 15, 2026Updated last week
- Odometry application of the accurate distance field based on Gaussian Processes☆26Feb 14, 2024Updated 2 years ago
- Deep Networks Grok All the Time and Here is Why☆39Apr 20, 2026Updated last month
- Your fruity companion for transformers☆14May 25, 2022Updated 3 years ago
- A repo based on XiLin Li's PSGD repo that extends some of the experiments.☆14Oct 7, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆13Apr 25, 2024Updated 2 years ago
- KANs and MLPs☆12Jun 7, 2024Updated last year
- ☆18Dec 2, 2024Updated last year
- Symmetrical Visual Contrastive Optimization: Aligning Vision-Language Models with Minimal Contrastive Images☆19Jun 4, 2025Updated 11 months ago
- ☆14Jun 22, 2025Updated 11 months ago
- ☆16Feb 4, 2025Updated last year
- Train to 94% on CIFAR-10 in 4.4 seconds on a single A100☆12Dec 30, 2023Updated 2 years ago
- Train a SmolLM-style llm on fineweb-edu in JAX/Flax with an assortment of optimizers.☆19Jul 24, 2025Updated 10 months ago
- Train a bidirectional or normal LSTM recurrent neural network to generate text on a free GPU using any dataset. Just upload your text fil…☆12Jan 29, 2019Updated 7 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- execute shell commands in the Unity Editor☆11May 12, 2025Updated last year
- Maximal Update Parametrization (μP) with Flax & Optax.☆16Dec 27, 2023Updated 2 years ago
- supporting pytorch FSDP for optimizers☆84Dec 8, 2024Updated last year
- This package is dedicated to high-order optimization methods. All the methods can be used similarly to standard PyTorch optimizers.☆30Jun 17, 2025Updated 11 months ago
- Bazel defs and rules for building Python projects with nanobind extensions.☆12Mar 12, 2026Updated 2 months ago
- ☆19May 16, 2026Updated last week
- Lego for GRPO☆30May 27, 2025Updated 11 months ago
- ☆13Jul 7, 2025Updated 10 months ago
- [AAAI 2025] Augmenting Math Word Problems via Iterative Question Composing (https://arxiv.org/abs/2401.09003)☆23Oct 2, 2025Updated 7 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- PyTorch optimizer based on nonlinear conjugate gradient method☆31Apr 25, 2025Updated last year
- A Unified Approach to Interpreting and Boosting Adversarial Transferability (ICLR2021)☆31Apr 22, 2022Updated 4 years ago
- ☆22Jan 23, 2024Updated 2 years ago
- [NeurIPS 2024 Datasets and Benchmarks Track] Benchmarking PtO and PnO Methods in the Predictive Combinatorial Optimization Regime☆24Mar 27, 2025Updated last year
- Source code for EMNLP2022 paper "Finding Skill Neurons in Pre-trained Transformers via Prompt Tuning".☆18Mar 13, 2023Updated 3 years ago
- A framework for benchmarking embedding models in hybrid search scenarios (BM25 + vector search) using Weaviate.☆40Updated this week
- P4Control: Line-Rate Cross-Host Attack Prevention via In-Network Information Flow Control Enabled by Programmable Switches and eBPF☆11May 20, 2024Updated 2 years ago