Official Implementation for NorMuon paper
☆68Apr 30, 2026Updated last week
Alternatives and similar repositories for NorMuon
Users that are interested in NorMuon are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆20Feb 2, 2026Updated 3 months ago
- This is a simple torch implementation of the high performance Multi-Query Attention☆16Aug 23, 2023Updated 2 years ago
- [ICML-2025] We introduce Lie group Relative position Encodings (LieRE) that goes beyond RoPE in supporting n-dimensional inputs.☆14Aug 8, 2025Updated 8 months ago
- Switch EMA: A Free Lunch for Better Flatness and Sharpness☆28Feb 16, 2024Updated 2 years ago
- ☆29Mar 10, 2026Updated last month
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- A repository aimed at pruning DeepSeek V3, R1 and R1-zero to a usable size☆86Sep 5, 2025Updated 8 months ago
- a script to add replay controls on a grafana dashboard☆22May 28, 2024Updated last year
- Flutter Client for the stability.ai GRPC protocol, should be compatible with grpc.stability.ai and hafriedlander/stable-diffusion-grpcser…☆14Oct 17, 2022Updated 3 years ago
- FDFO: Finite Difference Flow Optimization☆97Apr 27, 2026Updated last week
- Experiments Notebook of "Understanding the Skill Gap in Recurrent Language Models: The Role of the Gather-and-Aggregate Mechanism"☆15Apr 30, 2025Updated last year
- Reasoning-based Evaluation and Ranking of Translations.☆20Jul 18, 2025Updated 9 months ago
- ☆12Aug 22, 2025Updated 8 months ago
- manipulating cointegrated pairs to achieve a market-neutral strategy that outperforms indices☆10Jan 12, 2021Updated 5 years ago
- Digitizing Paper ECGs at Scale: An Open-Source Algorithm for Clinical Research☆81Jan 15, 2026Updated 3 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- LEMMA: Logical Engine for Multi-domain Mathematical Analysis☆28Feb 14, 2026Updated 2 months ago
- ☆10Aug 18, 2016Updated 9 years ago
- An implementation of AlphaZero, trained to master Tic-Tac-Toe and Four in a row☆28Dec 8, 2022Updated 3 years ago
- ☆13Oct 8, 2021Updated 4 years ago
- A PyTorch implementation of a conditional Denoising Diffusion Probabilistic Model (DDPM) for multi-modal trajectory prediction. This proj…☆38Feb 20, 2026Updated 2 months ago
- Flash Attention Triton kernel with support for second-order derivatives☆164Mar 10, 2026Updated last month
- ☆13Jan 14, 2026Updated 3 months ago
- coloring terminal text with intensities (used for plotting probability, entropy with tokens)☆12Oct 11, 2024Updated last year
- Weird autoencoder experiments☆24Apr 24, 2026Updated last week
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Implementation of Pluribus by Noam Brown & Tuomas Sandholm, introduced in the paper "Superhuman AI for multiplayer poker".☆21May 15, 2022Updated 3 years ago
- ☆49Sep 8, 2025Updated 7 months ago
- Scratchpad/Chain-of-Thought Prompts☆12Jun 6, 2022Updated 3 years ago
- vTPM with SGX protection☆11May 30, 2019Updated 6 years ago
- Simple local all-in-one install for IDEA2.ART☆26Jan 8, 2023Updated 3 years ago
- REAP expert pruning for MoE LLMs on Apple Silicon via MLX☆55Mar 16, 2026Updated last month
- Combining SOAP and MUON☆20Feb 11, 2025Updated last year
- Official implementation of Categorical Flow Maps on text.☆56Feb 16, 2026Updated 2 months ago
- [CVPR 2025] Official Implementation of LOCORE: Image Re-ranking with Long-Context☆16Apr 23, 2026Updated 2 weeks ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- An implementation of Tiny Recursive Models (TRM)☆118Mar 30, 2026Updated last month
- ☆40Feb 14, 2026Updated 2 months ago
- libtpms / swtpm software emulation of a Trusted Platform Module (TPM 1.2 and TPM 2.0) compile script☆13Sep 16, 2020Updated 5 years ago
- ☆21Dec 9, 2025Updated 4 months ago
- 5Hz Deep-Compression Speech VAE for AR-Diffusion and CALMs☆57Nov 19, 2025Updated 5 months ago
- A system for automating selection and optimization of pre-trained models from the TAO Model Zoo☆30Jun 28, 2024Updated last year
- ☆15Mar 2, 2025Updated last year