dinobby/Skill-MoE

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/dinobby/Skill-MoE)

dinobby / Skill-MoE

The code implementation of Skill-MoE

☆46

Alternatives and similar repositories for Skill-MoE

Users that are interested in Skill-MoE are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Elvin-Yiming-Du / Memory-T1
View on GitHub
This respository is used for time reasoning task for mult-session dialogue system.
☆16Feb 7, 2026Updated 5 months ago
Linzwcs / AFT
View on GitHub
☆13Jan 22, 2025Updated last year
RLHFlow / GVM
View on GitHub
☆16Jul 29, 2025Updated 11 months ago
HazyResearch / smoothie
View on GitHub
☆15Dec 10, 2024Updated last year
TsinghuaC3I / FS-GEN
View on GitHub
Fast and Slow Generating: An Empirical Study on Large and Small Language Models Collaborative Decoding.
☆13Nov 19, 2024Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
GAIR-NLP / LIMOPro
View on GitHub
☆15May 27, 2025Updated last year
Ziyang412 / Video-RTS
View on GitHub
Code for EMNLP25 paper "Video-RTS: Rethinking Reinforcement Learning and Test-Time Scaling for Efficient and Enhanced Video Reasoning"
☆24Feb 18, 2026Updated 5 months ago
UNITES-Lab / Mew
View on GitHub
[ECCV 2024] Code for the paper "Mew: Multiplexed Immunofluorescence Image Analysis through an Efficient Multiplex Network"
☆17Jul 27, 2024Updated last year
jialuli-luka / Video-MSG
View on GitHub
Training-free Guidance in Text-to-Video Generation via Multimodal Planning and Structured Noise Initialization
☆28Apr 14, 2025Updated last year
wenzhe-li / Self-MoA
View on GitHub
☆17Feb 4, 2025Updated last year
Yui010206 / Adaptive-Visual-Imagination-Control
View on GitHub
When and How Much to Imagine: Adaptive Test-Time Scaling with World Models for Visual Spatial Reasoning
☆18Jun 2, 2026Updated last month
zyfzs0 / NTU_OS2023
View on GitHub
本人为同济大学21级数据科学与大数据技术专业本科生，于2023年春季前往台湾大学电机资讯学院资讯工程学系交换一个学期，以下是我学习os专业课的作业。
☆11Jun 15, 2023Updated 3 years ago
ulab-uiuc / GraphRouter
View on GitHub
[ICLR 2025] "GraphRouter: A Graph-based Router for LLM Selections", Tao Feng, Yanzhen Shen, Jiaxuan You
☆74Dec 30, 2025Updated 6 months ago
guyuntian / CoT_benchmark
View on GitHub
Code for "Towards Revealing the Mystery behind Chain of Thought: a Theoretical Perspective"
☆21Jul 16, 2023Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
yanxue7 / E3T-Overcooked
View on GitHub
☆15May 4, 2024Updated 2 years ago
Fzkuji / swat-attention
View on GitHub
🚀 Sliding Window Attention Training for Efficient Large Language Models
☆19Jun 7, 2026Updated last month
erik-sv / MoA
View on GitHub
Fork to work with local LLMs (LM Studio)
☆20Jun 18, 2024Updated 2 years ago
sunblaze-ucb / reasoning_ladder
View on GitHub
☆35May 16, 2025Updated last year
BaichuanSEED / BaichuanSEED.github.io
View on GitHub
Official Repository for Paper "BaichuanSEED: Sharing the Potential of ExtensivE Data Collection and Deduplication by Introducing a Compet…
☆18Aug 28, 2024Updated last year
ryoungj / BoLT
View on GitHub
Code for "Reasoning to Learn from Latent Thoughts"
☆134Mar 28, 2025Updated last year
daeunni / VideoRepair
View on GitHub
Code for "VideoRepair: Improving Text-to-Video Generation via Misalignment Evaluation and Localized Refinement [ACL 2026 Findings]"
☆52Apr 7, 2026Updated 3 months ago
Illyasville / ExpertTokenRouting
View on GitHub
☆11Feb 16, 2024Updated 2 years ago
zhourunlong / Reflect-RL
View on GitHub
Reflect-RL: Two-Player Online RL Fine-Tuning for LMs
☆18Jul 19, 2025Updated last year
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
scalable-model-editing / unified-model-editing
View on GitHub
We introduce EMMET and unify model editing with popular algorithms ROME and MEMIT.
☆29Dec 16, 2024Updated last year
john-hewitt / implicit-ins
View on GitHub
Codebase for Instruction Following without Instruction Tuning
☆36Sep 24, 2024Updated last year
MingLiiii / Gradient_Unified
View on GitHub
How Instruction and Reasoning Data shape Post-Training: Data Quality through the Lens of Layer-wise Gradients
☆20Jun 17, 2025Updated last year
Arvid-pku / ATOKE
View on GitHub
[AAAI 2024] History Matters: Temporal Knowledge Editing in Large Language Model
☆13Dec 17, 2023Updated 2 years ago
feiyang-k / AutoScale
View on GitHub
Official Code Repository for [AutoScale📈: Scale-Aware Data Mixing for Pre-Training LLMs] Published as a conference paper at **COLM 2025*…
☆14Aug 8, 2025Updated 11 months ago
kaist-silab / meta-sage
View on GitHub
[ICML 2023] Meta-SAGE: Scale Meta-Learning Scheduled Adaptation with Guided Exploration for Mitigating Scale Shift on Combinatorial Optim…
☆10Dec 19, 2023Updated 2 years ago
gyhdog99 / MoCLE
View on GitHub
MoCLE (First MLLM with MoE for instruction customization and generalization!) (https://arxiv.org/abs/2312.12379)
☆46Jul 1, 2025Updated last year
junchenzhi / Awesome-LLM-Ensemble
View on GitHub
A curated list of Awesome-LLM-Ensemble papers for the survey "Harnessing Multiple Large Language Models: A Survey on LLM Ensemble"
☆251Updated this week
nishadsinghi / sc-genrm-scaling
View on GitHub
[COLM 2025] Official code for "When To Solve, When To Verify: Compute-Optimal Problem Solving and Generative Verification for LLM Reasoni…
☆15Oct 31, 2025Updated 8 months ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
ulab-uiuc / GraphEval
View on GitHub
[ICLR 2025] "GraphEval: A Lightweight Graph-Based LLM Framework for Idea Evaluation", Tao Feng, Yihang Sun, Jiaxuan You
☆18Mar 18, 2025Updated last year
tmlr-group / BayesianLM
View on GitHub
[NeurIPS 2024 Oral] "Bayesian-Guided Label Mapping for Visual Reprogramming"
☆12Dec 20, 2024Updated last year
JerryYin777 / Jerry_CV
View on GitHub
☆13Jan 21, 2024Updated 2 years ago
jarcms / wechat-smdc
View on GitHub
【视频教程】微信扫码点餐小程序
☆12May 26, 2025Updated last year
OSU-BMBL / scGNN2.0
View on GitHub
☆12Dec 4, 2023Updated 2 years ago
tmlr-group / SCT
View on GitHub
[NeurIPS 2024] "Self-Calibrated Tuning of Vision-Language Models for Out-of-Distribution Detection"
☆13Oct 28, 2024Updated last year
klauscc / DAM
View on GitHub
Official code for DAM: Dynamic Adapter Merging for Continual Video QA Learning
☆15Apr 25, 2024Updated 2 years ago