[ICLR 2025] AdaFisher: Adaptive Second Order Optimization via Fisher Information
☆51Feb 7, 2025Updated last year
Alternatives and similar repositories for AdaFisher
Users that are interested in AdaFisher are comparing it to the libraries listed below
Sorting:
- [ICML 2024] SIRFShampoo: Structured inverse- and root-free Shampoo in PyTorch (https://arxiv.org/abs/2402.03496)☆15Nov 4, 2024Updated last year
- Pytorch Tutorial given to IFT6135 Representation Learning Class☆13Jan 22, 2019Updated 7 years ago
- ☆13Jan 15, 2025Updated last year
- Example codes in the medium post titled "Optuna meets Weights and Biases."☆24Aug 11, 2022Updated 3 years ago
- A single-line modification to any (dualizer-based) optimizer that allows the optimizer to adapt to the scale of the gradients as they cha…☆19Jan 11, 2025Updated last year
- Image gallery of styles for FluxDev☆20Aug 13, 2024Updated last year
- Open Source 3D Scanner - Open Lab Starter Kit☆19Dec 3, 2025Updated 3 months ago
- Custom node to load Flux2 in INT8 for 2X Speed gains on 30 series cards.☆37Mar 2, 2026Updated 2 weeks ago
- ☆41Mar 13, 2026Updated last week
- This is a repository for code, data, and models associated with the paper LLM-RUBRIC: A Multidimensional, Calibrated Approach to Automate…☆26Feb 18, 2025Updated last year
- ☆10Apr 5, 2024Updated last year
- DFTTest re-implemetation for VapourSynth (CPU, CUDA and HIP)☆19Jan 20, 2026Updated 2 months ago
- ☆10Aug 18, 2016Updated 9 years ago
- We systematically studied the influencing factors when LLM generates benchmarks,By using our code, you can generate high-quality QA datas…☆20May 20, 2025Updated 10 months ago
- ☆13Oct 8, 2021Updated 4 years ago
- An implementation of a Brownian motion using ClojureScript with re-frame and Highcharts☆11Feb 8, 2019Updated 7 years ago
- ☆23Aug 17, 2025Updated 7 months ago
- A Tensorflow Implementation of the FastSpeech 2: Fast and High-Quality End-to-End Text to Speech☆11Aug 12, 2020Updated 5 years ago
- Implementation of paper - RepVGG-GELAN: ENHANCED GELAN WITH VGG-STYLE CONVNETS FOR BRAIN TUMOR DETECTION☆10Jul 19, 2025Updated 8 months ago
- This is unofficial repository for Towards Efficient and Scalable Sharpness-Aware Minimization.☆37Apr 15, 2024Updated last year
- [EMNLP'22] Textual Manifold-based Defense Against Natural Language Adversarial Examples☆11Apr 6, 2023Updated 2 years ago
- Why Do We Need Weight Decay in Modern Deep Learning? [NeurIPS 2024]☆71Sep 25, 2024Updated last year
- A MetaMask-fork to support pluggable identity contracts.☆11Dec 30, 2022Updated 3 years ago
- Estimating hardware and cloud costs of LLMs and transformer projects☆21Jan 15, 2026Updated 2 months ago
- ☆16Apr 3, 2024Updated last year
- A fast and robust algorithm for temporal difference learning☆22Updated this week
- Gradient-based Hyperparameter Optimization Over Long Horizons☆14Sep 29, 2021Updated 4 years ago
- a vulkan post processing layer☆35Jun 9, 2025Updated 9 months ago
- Tensorflow implementation of MuZero algorithm☆11Aug 23, 2022Updated 3 years ago
- Agar.io for Continual Reinforcement Learning☆24Jul 24, 2025Updated 7 months ago
- An Affordable LLM Pre-training Benchmark via Accurate Loss Prediction across Scales☆16Jun 6, 2024Updated last year
- ☆11Dec 8, 2022Updated 3 years ago
- A repo based on XiLin Li's PSGD repo that extends some of the experiments.☆14Oct 7, 2024Updated last year
- DiT (training + flow matching) in Jax☆11Jan 5, 2025Updated last year
- The code of “Improving Weak-to-Strong Generalization with Scalable Oversight and Ensemble Learning”☆17Feb 26, 2024Updated 2 years ago
- Pytorch implementation of preconditioned stochastic gradient descent (Kron and affine preconditioner, low-rank approximation precondition…☆192Jan 11, 2026Updated 2 months ago
- 🌸 A collection of Vietnamese women who are currently working in the field of Computer Science.☆13Mar 10, 2026Updated last week
- Official PyTorch implementation of "EdVAE: Mitigating Codebook Collapse with Evidential Discrete Variational Autoencoders"☆14Sep 20, 2024Updated last year
- Wildly unsound and experimental sampling for ComfyUI☆29Aug 9, 2025Updated 7 months ago