Don't just regulate gradients like in Muon, regulate the weights too
☆31Jul 30, 2025Updated 7 months ago
Alternatives and similar repositories for lipschitz-transformers
Users that are interested in lipschitz-transformers are comparing it to the libraries listed below
Sorting:
- ☆17Nov 18, 2025Updated 3 months ago
- Official implementation of the paper "What Makes for a Good Stereoscopic Image" CVPRW 2025☆17May 27, 2025Updated 9 months ago
- ☆25Jul 3, 2025Updated 8 months ago
- User-friendly implementation of the Mixture-of-Sparse-Attention (MoSA). MoSA selects distinct tokens for each head with expert choice rou…☆28May 3, 2025Updated 10 months ago
- [ICLR 2026] RPG: KL-Regularized Policy Gradient (https://arxiv.org/abs/2505.17508)☆64Feb 19, 2026Updated 2 weeks ago
- Jeroen Cottaar's work for the Kaggle Geophysical Waveform Inversion competition (2nd place)☆11Aug 11, 2025Updated 6 months ago
- [NeurIPS 2025] Official code for "Tropical Attention: Neural Algorithmic Reasoning for Combinatorial Algorithms"☆23Oct 23, 2025Updated 4 months ago
- ☆33Dec 10, 2025Updated 2 months ago
- ☆11Aug 3, 2023Updated 2 years ago
- ☆10Apr 12, 2025Updated 10 months ago
- 📄🕸️ Generalizing Cross-Document Event Coreference Resolution Across Multiple Corpora☆10May 25, 2022Updated 3 years ago
- UB ImGui (Unity Better ImGui) is a binding of "Dear ImGui" for Unity. It allows you to create interfaces by code, in immediate mode, in a…☆15Sep 10, 2025Updated 5 months ago
- ETL project to download and process both CME open interest data, COT data from the CFTC and NAV/shares-outstanding data from various ETF …☆12Jul 13, 2021Updated 4 years ago
- Gaussian Splating 2d implemented in triton☆11Mar 19, 2024Updated last year
- code for Towards Data Science article on prompt-loss-weight☆11Jun 4, 2025Updated 9 months ago
- ☆10Oct 27, 2023Updated 2 years ago
- DynamicTree: Interactive Real Tree Animation via Sparse Voxel Spectrum☆29Dec 2, 2025Updated 3 months ago
- ChatGPT solutions for the MLE interview☆14Dec 9, 2022Updated 3 years ago
- Create beautiful, stylized grass with complete control over blade shape, distribution, and appearance. Perfect for games with stylized ae…☆18Nov 25, 2024Updated last year
- Example of application of genetic algorithm for evolution kart navigation.☆11Nov 21, 2019Updated 6 years ago
- Auto math prover.☆11Jul 10, 2024Updated last year
- ☆13Feb 10, 2021Updated 5 years ago
- A web app for sharing, editing, and commenting on kifus (game records for the board game Go)☆10Jan 22, 2019Updated 7 years ago
- ☆10Mar 31, 2022Updated 3 years ago
- External Radar Cheat for Counter-Strike: Source on Linux☆10Jun 13, 2013Updated 12 years ago
- This repo contains the code to reproduce our results in CVPR21 Challenge on Agriculture-Vision.☆10Jan 3, 2022Updated 4 years ago
- Complete set of English dialect transformation rules and evaluation code☆16Jun 7, 2024Updated last year
- [CVPR2024] OmniLocalRF: Omnidirectional Local Radiance Fields from Dynamic Videos☆17May 29, 2024Updated last year
- Ruby ORM for HBase - NOTE: I haven't maintained this in years.☆60Sep 27, 2013Updated 12 years ago
- [ICML 2024] Temporal Spiking Neural Networks with Synaptic Delay for Graph Reasoning☆11Jun 1, 2024Updated last year
- ☆15Nov 9, 2024Updated last year
- Conditional Linear Dynamical Systems☆15Oct 7, 2025Updated 4 months ago
- Concise Reasoning via Reinforcement Learning☆13Apr 16, 2025Updated 10 months ago
- a library which can be used to create story driven clustered load-testing packages through a very readable and understandable api.☆30May 20, 2010Updated 15 years ago
- codes and plots for "Active-Dormant Attention Heads: Mechanistically Demystifying Extreme-Token Phenomena in LLMs"☆10Dec 30, 2024Updated last year
- [BMVC 2024 Oral] PT43D: A Probabilistic Transformer for Generating 3D Shapes from Single Highly-Ambiguous RGB Images☆13Oct 25, 2025Updated 4 months ago
- A visual, module-based, gracefully degrading "job expression" generator for OpenFn☆12Oct 5, 2015Updated 10 years ago
- A Zen approach to configuring your Python project☆15Updated this week
- Unofficial implementation for Sigmoid Loss for Language Image Pre-Training☆11Sep 26, 2023Updated 2 years ago