The code implementation of Skill-MoE
☆46May 22, 2026Updated last month
Alternatives and similar repositories for Skill-MoE
Users that are interested in Skill-MoE are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICLR 2025] "GraphRouter: A Graph-based Router for LLM Selections", Tao Feng, Yanzhen Shen, Jiaxuan You☆74Dec 30, 2025Updated 5 months ago
- ☆13Jan 22, 2025Updated last year
- ☆16Jul 29, 2025Updated 10 months ago
- 🚀 Sliding Window Attention Training for Efficient Large Language Models☆18Jun 7, 2026Updated 3 weeks ago
- Fast and Slow Generating: An Empirical Study on Large and Small Language Models Collaborative Decoding.☆13Nov 19, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Reflect-RL: Two-Player Online RL Fine-Tuning for LMs☆18Jul 19, 2025Updated 11 months ago
- [CVPR 2026] FocusUI: Efficient UI Grounding via Position-Preserving Visual Token Selection☆36Jun 7, 2026Updated 3 weeks ago
- ☆17Feb 4, 2025Updated last year
- 本人为同济大学21级数据科学与大数据技术专业本科生,于2023年春季前往台湾大学电机资讯学院资讯工程学系交换一个学期,以下是我学习os专业课的作业。☆11Jun 15, 2023Updated 3 years ago
- ☆20Mar 18, 2026Updated 3 months ago
- [EMNLP 2025 Oral] IPIGuard: A Novel Tool Dependency Graph-Based Defense Against Indirect Prompt Injection in LLM Agents☆21Sep 16, 2025Updated 9 months ago
- Steering Forward-Process Reinforcement Learning for Distilled Autoregressive Video Models☆148Mar 24, 2026Updated 3 months ago
- Code for "Towards Revealing the Mystery behind Chain of Thought: a Theoretical Perspective"☆21Jul 16, 2023Updated 2 years ago
- ☆35May 16, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Official Repository for Paper "BaichuanSEED: Sharing the Potential of ExtensivE Data Collection and Deduplication by Introducing a Compet…☆18Aug 28, 2024Updated last year
- [NeurIPS 2024] "Mind the Gap between Prototypes and Images in Cross-domain Finetuning"☆11Nov 15, 2024Updated last year
- ☆11Feb 16, 2024Updated 2 years ago
- ☆12Oct 24, 2024Updated last year
- Codebase for Instruction Following without Instruction Tuning☆36Sep 24, 2024Updated last year
- ☆10Jul 7, 2025Updated 11 months ago
- Code for "VideoRepair: Improving Text-to-Video Generation via Misalignment Evaluation and Localized Refinement [ACL 2026 Findings]"☆51Apr 7, 2026Updated 2 months ago
- Fork to work with local LLMs (LM Studio)☆20Jun 18, 2024Updated 2 years ago
- Code for "Reasoning to Learn from Latent Thoughts"☆131Mar 28, 2025Updated last year
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- CAI Advanced Processing Service is a collection of modules, wrapped in several APIs that help you to enrich your conversational AI applic…☆18Feb 16, 2024Updated 2 years ago
- Simple python rasterizer tool implemented by OpenGL and C++☆15Nov 10, 2025Updated 7 months ago
- [AAAI 2024] History Matters: Temporal Knowledge Editing in Large Language Model☆13Dec 17, 2023Updated 2 years ago
- We introduce EMMET and unify model editing with popular algorithms ROME and MEMIT.☆29Dec 16, 2024Updated last year
- ☆17Nov 20, 2024Updated last year
- Open-source implementations on real robots☆35Nov 25, 2024Updated last year
- ☆47Apr 7, 2026Updated 2 months ago
- MoCLE (First MLLM with MoE for instruction customization and generalization!) (https://arxiv.org/abs/2312.12379)☆46Jul 1, 2025Updated 11 months ago
- Working with images in frequency space☆10Nov 5, 2020Updated 5 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- [ICML 2024] Code for the paper "MoE-RBench: Towards Building Reliable Language Models with Sparse Mixture-of-Experts"☆11Jul 1, 2024Updated last year
- Official Code Repository for [AutoScale📈: Scale-Aware Data Mixing for Pre-Training LLMs] Published as a conference paper at **COLM 2025*…☆14Aug 8, 2025Updated 10 months ago
- [ACL 2024 Findings] Learning Fine-Grained Grounded Citations for Attributed Large Language Models☆20Oct 24, 2024Updated last year
- ☆32Jun 5, 2025Updated last year
- ☆11Aug 26, 2024Updated last year
- [NeurIPS 2024 Oral] "Bayesian-Guided Label Mapping for Visual Reprogramming"☆12Dec 20, 2024Updated last year
- Various agents from all of the top agent frameworks to integrate into swarms! Langchain, Griptape, CrewAI, and more!☆18Dec 22, 2025Updated 6 months ago