shuzhangzhong/HybriMoE-Preview

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/shuzhangzhong/HybriMoE-Preview)

shuzhangzhong / HybriMoE-Preview

☆17

Alternatives and similar repositories for HybriMoE-Preview

Users that are interested in HybriMoE-Preview are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

UKPLab / arxiv2025-inherent-limits-plms
View on GitHub
Code repository for the paper "The Inherent Limits of Pretrained LLMs: The Unexpected Convergence of Instruction Tuning and In-Context Le…
☆14Jan 16, 2025Updated last year
EsmaeilNarimissa / aws-sft-grpo-budget-llm-finetune
View on GitHub
☆19May 17, 2025Updated last year
tianyi-lab / C3PO
View on GitHub
[COLM 2025] "C3PO: Critical-Layer, Core-Expert, Collaborative Pathway Optimization for Test-Time Expert Re-Mixing"
☆21Apr 9, 2025Updated last year
shenao-zhang / reward-augmented-preference
View on GitHub
The official implementation of Preference Data Reward-Augmentation.
☆18May 1, 2025Updated last year
nick7nlp / FastCuRL
View on GitHub
FastCuRL: Curriculum Reinforcement Learning with Stage-wise Context Scaling for Efficient LLM Reasoning (EMNLP 2025)
☆61Oct 10, 2025Updated 9 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
HaroldChen19 / VistaDPO
View on GitHub
[ICML 2025] VistaDPO: Video Hierarchical Spatial-Temporal Direct Preference Optimization for Large Video Models
☆42Jun 14, 2025Updated last year
Babelscape / LLM-Oasis
View on GitHub
This repository contains the resource introduced in the paper: "Truth or Mirage? Towards End-to-End Factuality Evaluation with LLM-Oasis"…
☆25Oct 15, 2025Updated 9 months ago
TianheL / LM-Implicit-Reasoning
View on GitHub
[ACL 2025 Findings] Implicit Reasoning in Transformers is Reasoning through Shortcuts
☆18Mar 11, 2025Updated last year
Quehry / HelloBench
View on GitHub
HelloBench: Evaluating Long Text Generation Capabilities of Large Language Models
☆60Nov 26, 2024Updated last year
Geaming2002 / Ruler
View on GitHub
Ruler: A Model-Agnostic Method to Control Generated Length for Large Language Models
☆40Sep 30, 2024Updated last year
jmzhao / pbos
View on GitHub
☆19Oct 10, 2020Updated 5 years ago
mxzheng / TrojViT
View on GitHub
[CVPR 2023] "TrojViT: Trojan Insertion in Vision Transformers" by Mengxin Zheng, Qian Lou, Lei Jiang
☆15Jan 5, 2024Updated 2 years ago
microsoft / MMLU-CF
View on GitHub
A Contamination-free Multi-task Language Understanding Benchmark [Official, ACL 2025]
☆126May 17, 2025Updated last year
facebookresearch / Qinco
View on GitHub
Residual Quantization with Implicit Neural Codebooks
☆118Jul 17, 2026Updated last week
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
Bilkent-CYBORG / VOPy
View on GitHub
A Framework for Black-box Vector Optimization
☆31Mar 16, 2026Updated 4 months ago
AidenGeunGeun / OpencodeOrchestra
View on GitHub
Multi-layer agent orchestration. PM plans, specialists execute.
☆16May 24, 2026Updated 2 months ago
justarter / E2URec
View on GitHub
Official Code for paper "Towards Efficient and Effective Unlearning of Large Language Models for Recommendation" (Frontiers of Computer S…
☆38Jul 19, 2024Updated 2 years ago
ryota-komatsu / speaker_disentangled_hubert
View on GitHub
Official repository of the IEEE OJSP paper "Speaker-Disentangled Chunk-Wise Regression for Syllabic Tokenization"
☆46Updated this week
Hao840 / ADEM-VL
View on GitHub
PyTorch code for "ADEM-VL: Adaptive and Embedded Fusion for Efficient Vision-Language Tuning"
☆21Oct 28, 2024Updated last year
NiuTrans / ForgettingCurve
View on GitHub
A benchmark for testing memorization abilities of LMs
☆24Oct 15, 2024Updated last year
Gengzigang / TokenSet
View on GitHub
Official PyTorch implementation of TokenSet.
☆129Mar 21, 2025Updated last year
AhmedZgaren / Save
View on GitHub
☆33Oct 2, 2025Updated 9 months ago
sail-sg / FlowReasoner
View on GitHub
☆145May 6, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
PKU-SEC-Lab / AdapMoE
View on GitHub
Code release for AdapMoE accepted by ICCAD 2024
☆39Apr 28, 2025Updated last year
mwatkins1970 / SAE_Feature_Interpretability_Tool
View on GitHub
A tool to assist in the interpretation of learned features in sparse autoencoders (in particular the four SAE's trained by Joseph Bloom o…
☆19Oct 4, 2024Updated last year
CMU-SAFARI / transpimlib
View on GitHub
TransPimLib is a library for transcendental (and other hard-to-calculate) functions in general-purpose PIM systems, TransPimLib provides …
☆15Apr 21, 2023Updated 3 years ago
hetailang / SqueezeAttention
View on GitHub
☆37Oct 10, 2024Updated last year
Amshaker / Mobile-VideoGPT
View on GitHub
Mobile-VideoGPT: Fast and Accurate Video Understanding Language Model
☆142Aug 6, 2025Updated 11 months ago
runchu-tian / LongPiBench
View on GitHub
The repository for papaer "Distance between Relevant Information Pieces Causes Bias in Long-Context LLMs"
☆14Dec 16, 2024Updated last year
KFM135 / chiplet-optimizer
View on GitHub
This repository contains the code for this paper: Chiplet-Gym: An RL-based Optimization Framework for Chiplet-based AI Accelerator
☆22Sep 28, 2024Updated last year
kimjy99 / kimjy99.github.io
View on GitHub
MY BLOG
☆16Updated this week
lzhxmu / AccDiffusion_v2
View on GitHub
Code release for AccDiffusionV2 (TPAMI)
☆34Nov 4, 2025Updated 8 months ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
d6tdev / top10-mistakes-statistics-binder
View on GitHub
☆10Jun 12, 2019Updated 7 years ago
MacavityT / REF-VLM
View on GitHub
☆31Jan 18, 2026Updated 6 months ago
linhaowei1 / kumo
View on GitHub
☁️ KUMO: Generative Evaluation of Complex Reasoning in Large Language Models
☆20Jun 4, 2025Updated last year
yousei-github / ChampSim-Ramulator
View on GitHub
A simulator integrates ChampSim and Ramulator.
☆23Jul 20, 2026Updated last week
shaochenze / EAR
View on GitHub
☆42May 15, 2025Updated last year
viig99 / muvfde
View on GitHub
Generate fixed dimensional embeddings for multi-dimensional vectors in python based on Muvera from Google.
☆20Jun 28, 2025Updated last year
mira-ai-lab / DoG
View on GitHub
☆25Apr 15, 2025Updated last year