(ICLR 2026) Unveiling Super Experts in Mixture-of-Experts Large Language Models
☆41Sep 25, 2025Updated 8 months ago
Alternatives and similar repositories for Super-Experts-Profilling
Users that are interested in Super-Experts-Profilling are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The code implementation for TTCS: Test-Time Curriculum Synthesis for Self-Evolving.☆47Apr 22, 2026Updated last month
- What do CLIP Vision Transformers learn? Feature Visualization can show you!☆15Aug 29, 2024Updated last year
- Martingale posterior neural networks for fast sequential decision making @ Neurips 2025☆25Nov 13, 2025Updated 6 months ago
- DiTASK: Multi-Task Fine-Tuning with Diffeomorphic Transformations (CVPR 2025)☆14Jun 1, 2025Updated 11 months ago
- ☆16Sep 4, 2025Updated 8 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆69Jun 1, 2025Updated 11 months ago
- Official Implementation of UTrice: Unifying Primitives in Differentiable Ray Tracing and Rasterization via Triangles for Particle-Based 3…☆30Jan 13, 2026Updated 4 months ago
- ☆26Jan 5, 2026Updated 4 months ago
- 🎓Automatically Update circult-eda-mlsys-tinyml Papers Daily using Github Actions (Update Every 8th hours)☆10May 18, 2026Updated last week
- Official implementation of paper "Think-at-Hard: Selective Latent Iterations to Improve Reasoning Language Models"☆69Apr 4, 2026Updated last month
- EANN(Pytorch)☆10Mar 12, 2022Updated 4 years ago
- ☆27Oct 27, 2025Updated 6 months ago
- [ICLR 2025] Drop-Upcycling: Training Sparse Mixture of Experts with Partial Re-initialization☆24Oct 5, 2025Updated 7 months ago
- A framework for steering MoE models by detecting and controlling behavior-linked experts.☆35Sep 12, 2025Updated 8 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- self-adaptive in-context learning☆45May 5, 2023Updated 3 years ago
- ChineseHarm-Bench: A Chinese Harmful Content Detection Benchmark☆55Sep 2, 2025Updated 8 months ago
- A lightweight Text-to-Image Retrieval model [Web App]☆29Dec 6, 2024Updated last year
- [EMNLP 2025] RouterLens☆29Sep 15, 2025Updated 8 months ago
- ☆27Apr 14, 2025Updated last year
- LLM KV Cache compression - K+V dual compression, 73-99% VRAM savings, zero accuracy loss☆55Mar 30, 2026Updated last month
- ☆29Mar 16, 2025Updated last year
- ☆13Mar 5, 2024Updated 2 years ago
- Universal preflight security scanner for AI coding agents — Detects hooks injection, credential exfiltration & backdoors in .cursorrules,…☆70Apr 9, 2026Updated last month
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- This repository provides the official implementation of QSVD, a method for efficient low-rank approximation that unifies Query-Key-Value …☆26May 16, 2026Updated last week
- 手搓Llama,个人学习用☆16May 21, 2024Updated 2 years ago
- Skill-Inject: Measuring Agent Vulnerability to Skill File Attacks☆72May 7, 2026Updated 2 weeks ago
- ☆16May 16, 2025Updated last year
- An interactive attention visualization and intervention tool for LLM Decode Stage.☆48Jan 6, 2026Updated 4 months ago
- An AI-powered content conversion tool that transforms text, web content, or HTML code into beautifully designed card images.一款基于AI的内容转换工…☆34Jul 29, 2025Updated 9 months ago
- Large Language Models in Molecular Embeddings☆12May 1, 2024Updated 2 years ago
- DRAM/SSD hybrid caching system☆15Mar 13, 2025Updated last year
- ☆29Feb 27, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- LLM training parallelisms (DP, FSDP, TP, PP) in pure C☆28Jan 27, 2026Updated 3 months ago
- ☆12Mar 26, 2024Updated 2 years ago
- A record of reading list on some MLsys popular topic☆24Mar 20, 2025Updated last year
- A lightweight Inference Engine built for block diffusion models☆46Apr 12, 2026Updated last month
- RISC-V SingleCycle/Pipeline CPU (lab of ZJU Computer System Series)☆16Jul 6, 2023Updated 2 years ago
- 清华大学电子工程系数字逻辑与处理器基础实验大作业——流水线 CPU☆12Aug 8, 2021Updated 4 years ago
- Papers and Related work to help learn ICL conveniently for everyone who interests.☆14Feb 28, 2024Updated 2 years ago