Don't just regulate gradients like in Muon, regulate the weights too
☆32Jul 30, 2025Updated 7 months ago
Alternatives and similar repositories for lipschitz-transformers
Users that are interested in lipschitz-transformers are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official implementation of the paper "What Makes for a Good Stereoscopic Image" CVPRW 2025☆19May 27, 2025Updated 9 months ago
- Auto math prover.☆11Jul 10, 2024Updated last year
- "Towards Scaling Difference Target Propagation by Learning Backprop Targets" (ICML 2022)☆12Jan 17, 2023Updated 3 years ago
- [NeurIPS 2023] and [ICLR 2024] for robustness certification.☆10Nov 30, 2024Updated last year
- ☆10Oct 27, 2023Updated 2 years ago
- Code for Spectral Norm of Convolutional Layers with Circular and Zero Paddings and Efficient Bound of Lipschitz Constant for Convolutiona…☆15Feb 2, 2024Updated 2 years ago
- On Lipschitz Regularization of Convolutional Layers using Toeplitz Matrix Theory☆10Aug 19, 2021Updated 4 years ago
- ☆13Jul 2, 2024Updated last year
- ☆15Mar 14, 2020Updated 6 years ago
- ☆17Nov 18, 2025Updated 4 months ago
- The official code of Multi-player Nash Preference Optimization [ICLR 2026]☆33Feb 4, 2026Updated last month
- Adaptable Agent Populations via a Generative Model of Policies☆12Oct 14, 2021Updated 4 years ago
- User-friendly implementation of the Mixture-of-Sparse-Attention (MoSA). MoSA selects distinct tokens for each head with expert choice rou…☆28May 3, 2025Updated 10 months ago
- PyTorch Implementation of Variance Reduced Optimization Algorithms -- SARAH and SVRG.☆15Jul 11, 2021Updated 4 years ago
- ☆26Jul 3, 2025Updated 8 months ago
- The official implementation of HybridNorm: Towards Stable and Efficient Transformer Training via Hybrid Normalization☆18Mar 7, 2025Updated last year
- 📄🕸️ Generalizing Cross-Document Event Coreference Resolution Across Multiple Corpora☆10May 25, 2022Updated 3 years ago
- ☆15Jul 13, 2025Updated 8 months ago
- Official Pytorch implementation of Chromatic Graph Transformers☆10Jun 14, 2023Updated 2 years ago
- Clustered Compositional Embeddings☆11Oct 25, 2023Updated 2 years ago
- Ἀνατομή is a PyTorch library to analyze representation of neural networks☆13Jan 31, 2024Updated 2 years ago
- Code to supplement the FIGRIM (FIne-GRained Image Memorability) project and dataset.☆13Apr 6, 2016Updated 9 years ago
- codes and plots for "Active-Dormant Attention Heads: Mechanistically Demystifying Extreme-Token Phenomena in LLMs"☆10Dec 30, 2024Updated last year
- A web app for sharing, editing, and commenting on kifus (game records for the board game Go)☆10Jan 22, 2019Updated 7 years ago
- Official implementation of the NeurIPS 2023 paper: "Uncertainty Quantification via Neural Posterior Principal Components"☆13Jun 18, 2024Updated last year
- Implementation of Unified Embedding: Battle-Tested Feature Representations for Web-Scale ML Systems☆14Nov 11, 2023Updated 2 years ago
- A Zen approach to configuring your Python project☆16Feb 27, 2026Updated 3 weeks ago
- ☆12Sep 16, 2024Updated last year
- Fast extremal eigensolvers for PyTorch.☆17Jun 4, 2021Updated 4 years ago
- [ICLR 2026] RPG: KL-Regularized Policy Gradient (https://arxiv.org/abs/2505.17508)☆65Feb 19, 2026Updated last month
- 📄Small Batch Size Training for Language Models☆81Updated this week
- For Certified Robustness to Text Adversarial Attacks by Randomized [MASK]☆17Oct 8, 2024Updated last year
- Complete set of English dialect transformation rules and evaluation code☆16Jun 7, 2024Updated last year
- Unofficial Scalable-Softmax Is Superior for Attention☆20May 30, 2025Updated 9 months ago
- This is the official implementation for paper "On Powerful Ways to Generate: Autoregression, Diffusion, and Beyond".☆20Nov 17, 2025Updated 4 months ago
- This repo contains the code to reproduce our results in CVPR21 Challenge on Agriculture-Vision.☆11Jan 3, 2022Updated 4 years ago
- A remote Scala code evaluator☆14May 16, 2023Updated 2 years ago
- Example agents I've built using the LiveKit Agents (https://github.com/livekit/agents) framework☆20May 11, 2024Updated last year
- ☆12Jan 17, 2024Updated 2 years ago