fKunstner / noise-sgd-adam-signView external linksLinks
☆16Apr 26, 2023Updated 2 years ago
Alternatives and similar repositories for noise-sgd-adam-sign
Users that are interested in noise-sgd-adam-sign are comparing it to the libraries listed below
Sorting:
- [TMLR 2022] Curvature access through the generalized Gauss-Newton's low-rank structure: Eigenvalues, eigenvectors, directional derivative…☆17Jul 19, 2023Updated 2 years ago
- [ICML 2024] SIRFShampoo: Structured inverse- and root-free Shampoo in PyTorch (https://arxiv.org/abs/2402.03496)☆15Nov 4, 2024Updated last year
- ☆19Jun 10, 2024Updated last year
- [ICML 2024] SINGD: KFAC-like Structured Inverse-Free Natural Gradient Descent (http://arxiv.org/abs/2312.05705)☆24Nov 4, 2024Updated last year
- A Kernel-Based View of Language Model Fine-Tuning https://arxiv.org/abs/2210.05643☆78Sep 4, 2023Updated 2 years ago
- Open source code for EigenGame.☆34May 15, 2023Updated 2 years ago
- Code for the paper "Distinguishing the Knowable from the Unknowable with Language Models"☆11Apr 15, 2024Updated last year
- PyTorch implementation of the Hessian-free optimizer☆36Jun 14, 2024Updated last year
- ☆11Jan 9, 2024Updated 2 years ago
- LLA is a PyTorch library that allows to visualize and analyze loss landscapes of neural networks.☆13Dec 9, 2025Updated 2 months ago
- Sample application for Android lock screen☆10Dec 29, 2014Updated 11 years ago
- A framework for steering MoE models by detecting and controlling behavior-linked experts.☆29Sep 12, 2025Updated 5 months ago
- Embedding language models in probability space via log-likelihood vectors☆16Oct 25, 2025Updated 3 months ago
- Demo of the unit_scaling library, showing how a model can be easily adapted to train in FP8.☆46Jul 17, 2024Updated last year
- A Wadler–Lindig pretty printer for Python☆44Jan 16, 2026Updated last month
- Laplace Redux -- Effortless Bayesian Deep Learning☆44Jun 6, 2025Updated 8 months ago
- Code accompanying the paper "Massive Activations in Large Language Models"☆196Mar 4, 2024Updated last year
- Posterior Refinement Improves Sample Efficiency in Bayesian Neural Networks☆10Oct 21, 2022Updated 3 years ago
- Convolutional Sparse Coding☆10Jul 18, 2014Updated 11 years ago
- codes and plots for "Active-Dormant Attention Heads: Mechanistically Demystifying Extreme-Token Phenomena in LLMs"☆10Dec 30, 2024Updated last year
- Learning to Skip the Middle Layers of Transformers☆17Aug 7, 2025Updated 6 months ago
- Jupyter notebooks from our weekly (or so) hackathons☆11Dec 3, 2024Updated last year
- Code for Semi-crowdsourced Clustering with Deep Generative Models☆12Dec 9, 2022Updated 3 years ago
- Study for Instant neural graphics primitives (Unofficial)☆11Jan 18, 2022Updated 4 years ago
- ☆10Aug 26, 2022Updated 3 years ago
- Code for Augment & Reduce, a scalable stochastic algorithm for large categorical distributions☆10May 16, 2018Updated 7 years ago
- ACL24☆11Jun 7, 2024Updated last year
- Accelerating Transfer Learning with Robust Neural Nets☆11Oct 2, 2020Updated 5 years ago
- The code generates preferential attachment networks with homophily and two groups.☆13Apr 4, 2018Updated 7 years ago
- ☆20Feb 3, 2025Updated last year
- An unofficial jax/haiku implementation of Crystal Graph Convolutional Neural Networks (CGCNN)☆10Dec 17, 2022Updated 3 years ago
- ☆11Mar 25, 2021Updated 4 years ago
- This repository contains the code used in a publication 'Active Learning for Decision-Making from Imbalanced Observational Data', Iiris S…☆11May 14, 2019Updated 6 years ago
- The official implementation of the paper DADF for industrial VAD☆12Dec 1, 2023Updated 2 years ago
- ☆10Oct 16, 2017Updated 8 years ago
- ☆47Aug 15, 2019Updated 6 years ago
- Composable kernels for scikit-learn implemented in JAX.☆47Oct 26, 2020Updated 5 years ago
- ☆50Oct 22, 2020Updated 5 years ago
- A Multiscene RGB-Hyperspectral Benchmark Dataset of Printed Circuit Boards☆13May 21, 2025Updated 8 months ago