Official Implementation for NorMuon paper
☆62Mar 11, 2026Updated 2 weeks ago
Alternatives and similar repositories for NorMuon
Users that are interested in NorMuon are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆19Feb 2, 2026Updated last month
- This is a simple torch implementation of the high performance Multi-Query Attention☆16Aug 23, 2023Updated 2 years ago
- [ICML-2025] We introduce Lie group Relative position Encodings (LieRE) that goes beyond RoPE in supporting n-dimensional inputs.☆14Aug 8, 2025Updated 7 months ago
- FDFO: Finite Difference Flow Optimization☆55Mar 16, 2026Updated last week
- An assembler for the Microchip PIC instruction set, written in Swift.☆14May 3, 2021Updated 4 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- SKT A.X LLM 3.1☆13Jul 24, 2025Updated 8 months ago
- ☆27Mar 10, 2026Updated 2 weeks ago
- Training tiny models to prove hard theorems☆64Mar 5, 2026Updated 3 weeks ago
- A lightweight, single header OpenGL engine.☆15Sep 6, 2025Updated 6 months ago
- Zero Allocation WASM☆58Feb 18, 2026Updated last month
- Website using FinBERT + live financial news scraping to assess short-term investment potential.☆14Mar 7, 2025Updated last year
- Official implementation for paper "How Far Are We from Genuinely Useful Deep Research Agents?"☆64Dec 10, 2025Updated 3 months ago
- MLX Implementation of Recursive Reasoning with Tiny Networks☆79Oct 11, 2025Updated 5 months ago
- Rigid-body physics engine in C++☆21Sep 27, 2025Updated 6 months ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- ☆13Nov 18, 2025Updated 4 months ago
- Flash-Muon: An Efficient Implementation of Muon Optimizer☆247Jun 15, 2025Updated 9 months ago
- ☆17Dec 9, 2024Updated last year
- Flutter Client for the stability.ai GRPC protocol, should be compatible with grpc.stability.ai and hafriedlander/stable-diffusion-grpcser…☆14Oct 17, 2022Updated 3 years ago
- Web上に公開されている小説をスクレイピングして青空文庫形式のテキストにする☆19Feb 9, 2017Updated 9 years ago
- Reasoning-based Evaluation and Ranking of Translations.☆20Jul 18, 2025Updated 8 months ago
- LEMMA: Logical Engine for Multi-domain Mathematical Analysis☆28Feb 14, 2026Updated last month
- manipulating cointegrated pairs to achieve a market-neutral strategy that outperforms indices☆12Jan 12, 2021Updated 5 years ago
- Analyzing partial dimensional collapse in non-contrastive self-supervised learning. "Understanding Collapse in Non-Contrastive Siamese Re…☆16Nov 12, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- ☆10Aug 18, 2016Updated 9 years ago
- Result builders for Swift and Foundation types☆24Mar 4, 2026Updated 3 weeks ago
- An implementation of AlphaZero, trained to master Tic-Tac-Toe and Four in a row☆26Dec 8, 2022Updated 3 years ago
- REAP expert pruning for MoE LLMs on Apple Silicon via MLX☆49Mar 16, 2026Updated last week
- ☆13Oct 8, 2021Updated 4 years ago
- API server for VibeVoice☆27Sep 28, 2025Updated 5 months ago
- ☆19Aug 23, 2025Updated 7 months ago
- coloring terminal text with intensities (used for plotting probability, entropy with tokens)☆12Oct 11, 2024Updated last year
- ☆57Feb 24, 2026Updated last month
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- A few models converted from caffe to CoreMLs format.☆15Jun 6, 2017Updated 8 years ago
- Tiny Llama model trained to play chess☆29Jul 22, 2025Updated 8 months ago
- The official repo for "CodeScaler: Scaling Code LLM Training and Test-Time Inference via Execution-Free Reward Models"☆32Updated this week
- Scratchpad/Chain-of-Thought Prompts☆12Jun 6, 2022Updated 3 years ago
- ☆49Sep 8, 2025Updated 6 months ago
- vTPM with SGX protection☆11May 30, 2019Updated 6 years ago
- [CVPR 2025] Official Implementation of LOCORE: Image Re-ranking with Long-Context☆15Apr 15, 2025Updated 11 months ago