(ICLR 2026) Unveiling Super Experts in Mixture-of-Experts Large Language Models
☆40Sep 25, 2025Updated 7 months ago
Alternatives and similar repositories for Super-Experts-Profilling
Users that are interested in Super-Experts-Profilling are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The code implementation for TTCS: Test-Time Curriculum Synthesis for Self-Evolving.☆45Apr 22, 2026Updated 2 weeks ago
- [AAAI 2026] This is the official implementation of the paper "ExtendAttack: Attacking Servers of LRMs via Extending Reasoning".☆22Mar 18, 2026Updated last month
- Martingale posterior neural networks for fast sequential decision making @ Neurips 2025☆25Nov 13, 2025Updated 5 months ago
- ☆16Sep 4, 2025Updated 8 months ago
- [ICLR 2025] FLAT: LLM Unlearning via Loss Adjustment with Only Forget Data☆14Feb 26, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A framework for steering MoE models by detecting and controlling behavior-linked experts.☆33Sep 12, 2025Updated 7 months ago
- A Model Agnostic function to directly remove specified layers from the LLM☆10May 23, 2024Updated last year
- Official implementation of paper "Think-at-Hard: Selective Latent Iterations to Improve Reasoning Language Models"☆69Apr 4, 2026Updated last month
- Implement of Implicit Knowledge Extraction Attack.☆23Apr 17, 2026Updated 2 weeks ago
- [NeurIPS 2025] Official Implementation for "Glocal Information Bottleneck for Time Series Imputation"☆14Nov 4, 2025Updated 6 months ago
- EANN(Pytorch)☆10Mar 12, 2022Updated 4 years ago
- An official implementation of "Catastrophic Failure of LLM Unlearning via Quantization" (ICLR 2025)☆38Feb 22, 2025Updated last year
- [ICLR 2025] Drop-Upcycling: Training Sparse Mixture of Experts with Partial Re-initialization☆24Oct 5, 2025Updated 7 months ago
- ☆26Oct 27, 2025Updated 6 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Focused Papers, Delivered Simply :)☆55Dec 25, 2025Updated 4 months ago
- 📚 LaTeX templates and tools for creating beautiful, structured documents 📝☆14Oct 24, 2025Updated 6 months ago
- ☆12Dec 13, 2022Updated 3 years ago
- This repository provides the official implementation of QSVD, a method for efficient low-rank approximation that unifies Query-Key-Value …☆25Dec 1, 2025Updated 5 months ago
- ☆27Apr 14, 2025Updated last year
- ☆28Mar 16, 2025Updated last year
- 😎 All your need for future is FollowGPT.☆13Nov 8, 2023Updated 2 years ago
- Universal preflight security scanner for AI coding agents — Detects hooks injection, credential exfiltration & backdoors in .cursorrules,…☆69Apr 9, 2026Updated 3 weeks ago
- 手搓Llama,个人学习用☆16May 21, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆16May 16, 2025Updated 11 months ago
- Skill-Inject: Measuring Agent Vulnerability to Skill File Attacks☆66Apr 8, 2026Updated 3 weeks ago
- An interactive attention visualization and intervention tool for LLM Decode Stage.☆47Jan 6, 2026Updated 4 months ago
- ☆12Jun 11, 2021Updated 4 years ago
- Official repository of "Distort, Distract, Decode: Instruction-Tuned Model Can Refine its Response from Noisy Instructions", ICLR 2024 Sp…☆21Mar 7, 2024Updated 2 years ago
- Large Language Models in Molecular Embeddings☆12May 1, 2024Updated 2 years ago
- An automatic MLLM hallucination detection framework☆19Sep 26, 2023Updated 2 years ago
- Self implementation of course projects for Computer Architecture 2022 Spring☆11Sep 17, 2022Updated 3 years ago
- DRAM/SSD hybrid caching system☆15Mar 13, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆20Jan 25, 2026Updated 3 months ago
- this repo is mnbvc text quality classification using fastText☆16Oct 2, 2023Updated 2 years ago
- ☆47Jun 24, 2025Updated 10 months ago
- RISC-V SingleCycle/Pipeline CPU (lab of ZJU Computer System Series)☆16Jul 6, 2023Updated 2 years ago
- 清华大学电子工程系数字逻辑与处理器基础实验大作业——流水线 CPU☆12Aug 8, 2021Updated 4 years ago
- Papers and Related work to help learn ICL conveniently for everyone who interests.☆14Feb 28, 2024Updated 2 years ago
- Official Repo for SwS: A Weakness-driven Problem Synthesis Framework in RL for LLM Reasoning☆42Nov 11, 2025Updated 5 months ago