shuzhangzhong / HybriMoE-PreviewLinks
☆16Updated 3 months ago
Alternatives and similar repositories for HybriMoE-Preview
Users that are interested in HybriMoE-Preview are comparing it to the libraries listed below
Sorting:
- How Do LLMs Acquire New Knowledge? A Knowledge Circuits Perspective on Continual Pre-Training☆36Updated 3 months ago
- ☆19Updated 2 months ago
- This repository contains the code for the paper: SirLLM: Streaming Infinite Retentive LLM☆59Updated last year
- ☆37Updated 9 months ago
- The official implementation of Preference Data Reward-Augmentation.☆17Updated 2 months ago
- Code repository for the paper "The Inherent Limits of Pretrained LLMs: The Unexpected Convergence of Instruction Tuning and In-Context Le…☆13Updated 6 months ago
- Code and data releases for the paper -- DelTA: An Online Document-Level Translation Agent Based on Multi-Level Memory☆45Updated 5 months ago
- The official implementation of Cross-Task Experience Sharing (COPS)☆22Updated 8 months ago
- Code for this paper "HyperRouter: Towards Efficient Training and Inference of Sparse Mixture of Experts via HyperNetwork"☆33Updated last year
- ☆23Updated last month
- This repository contains the resource introduced in the paper: "Truth or Mirage? Towards End-to-End Factuality Evaluation with LLM-Oasis"…☆22Updated 7 months ago
- Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data genera…☆73Updated 2 weeks ago
- CS194-196 Course Project☆15Updated 4 months ago
- A Recipe for Building LLM Reasoners to Solve Complex Instructions☆19Updated 3 weeks ago
- AgentSynth: Scalable Task Generation for Generalist Computer-Use Agents☆24Updated last month
- Code for "C3PO: Critical-Layer, Core-Expert, Collaborative Pathway Optimization for Test-Time Expert Re-Mixing"☆16Updated 3 months ago
- A framework for pitting LLMs against each other in an evolving library of games ⚔☆32Updated 2 months ago
- ☆13Updated 8 months ago
- [ACL 2025 Findings] Implicit Reasoning in Transformers is Reasoning through Shortcuts☆15Updated 4 months ago
- A benchmark for testing memorization abilities of LMs☆20Updated 9 months ago
- The official implementation of Regularized Policy Gradient (RPG) (https://arxiv.org/abs/2505.17508)☆35Updated last week
- Repository for the Q-Filters method (https://arxiv.org/pdf/2503.02812)☆33Updated 4 months ago
- RetroLLM: Empowering LLMs to Retrieve Fine-grained Evidence within Generation [ACL 2025]☆114Updated 5 months ago
- ☆21Updated 7 months ago
- [ACL 2024] Do Large Language Models Latently Perform Multi-Hop Reasoning?☆71Updated 4 months ago
- XmodelLM☆39Updated 7 months ago
- ☆36Updated last month
- FastCuRL: Curriculum Reinforcement Learning with Stage-wise Context Scaling for Efficient LLM Reasoning☆52Updated last month
- Initialization using Update Approximation is a Silver Bullet for Extremely Efficient Low-Rank Fine-Tuning☆49Updated last month
- ☆36Updated 2 months ago