huyphan168/PEER

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/huyphan168/PEER)

huyphan168 / PEER

Mixture of A Million Experts

☆56

Alternatives and similar repositories for PEER

Users that are interested in PEER are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

HuyNguyen-hust / hopper-gemm-101
View on GitHub
☆14Dec 22, 2024Updated last year
james-oldfield / MxD
View on GitHub
[NeurIPS'25] Towards Interpretability Without Sacrifice: Faithful Dense Layer Decomposition with Mixture of Decoders
☆16May 28, 2025Updated last year
imoneoi / bf16_fused_adam
View on GitHub
BFloat16 Fused Adam Operator for PyTorch
☆20Nov 16, 2024Updated last year
HuyNguyen-hust / flash-attn-101
View on GitHub
☆22Sep 3, 2024Updated last year
BryceZhuo / HybridNorm
View on GitHub
The official implementation of HybridNorm: Towards Stable and Efficient Transformer Training via Hybrid Normalization
☆19Mar 7, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
lernapparat / torchhacks
View on GitHub
Hacks for PyTorch
☆19Apr 18, 2023Updated 3 years ago
IBM / selective-dense-state-space-model
View on GitHub
Open-sourcing code associated with the AAAI-25 paper "On the Expressiveness and Length Generalization of Selective State-Space Models on …
☆16Sep 18, 2025Updated 10 months ago
wj2 / disentangled
View on GitHub
☆12Sep 8, 2025Updated 10 months ago
kyegomez / Mixture-of-Depths
View on GitHub
Implementation of the paper: "Mixture-of-Depths: Dynamically allocating compute in transformer-based language models"
☆123Updated this week
gdevos010 / Scalable-Softmax
View on GitHub
Unofficial Scalable-Softmax Is Superior for Attention
☆21May 30, 2025Updated last year
zihuanqiu / MINGLE
View on GitHub
The code repository for "MINGLE: Mixture of Null-Space Gated Low-Rank Experts for Test-Time Continual Model Merging"(NeurIPS25) in PyTorc…
☆15Jun 2, 2026Updated last month
tripolskypetr / node-ollama-telegram-agent-swarm
View on GitHub
Multi-agent orchestration for OllamaJS. Includes TTS synthesis and speech recognition for simplified testing. Connected to telegram bot w…
☆13Feb 13, 2025Updated last year
mserdarsanli / elf-explorer
View on GitHub
Something like objdump
☆16Jan 23, 2019Updated 7 years ago
SMSD75 / MoSiC
View on GitHub
This repo contains the official implementation of ICCV 2025 paper "MoSiC: Optimal-Transport Motion Trajectory for Dense Self-Supervised L…
☆22Sep 12, 2025Updated 10 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
mayank31398 / ladder-residual-inference
View on GitHub
☆14Jul 13, 2025Updated last year
RobertCsordas / switchhead
View on GitHub
☆16Jun 11, 2025Updated last year
tim-lawson / mlsae
View on GitHub
Multi-Layer Sparse Autoencoders (ICLR 2025)
☆30Feb 6, 2026Updated 5 months ago
hyounghk / CoSIm
View on GitHub
Code and dataset for NAACL 2022 paper "CoSIm: Commonsense Reasoning for Counterfactual Scene Imagination" Hyounghun Kim, Abhay Zala, Mohi…
☆16Nov 26, 2022Updated 3 years ago
thomasahle / cce
View on GitHub
Clustered Compositional Embeddings
☆13Oct 25, 2023Updated 2 years ago
GuoTianYu2000 / Active-Dormant-Attention
View on GitHub
codes and plots for "Active-Dormant Attention Heads: Mechanistically Demystifying Extreme-Token Phenomena in LLMs"
☆11Dec 30, 2024Updated last year
RobertCsordas / moe_attention
View on GitHub
Official repository for the paper "SwitchHead: Accelerating Transformers with Mixture-of-Experts Attention"
☆101Sep 30, 2024Updated last year
wtong98 / mlp-icl
View on GitHub
☆12Sep 16, 2024Updated last year
Cranial-XIX / longhorn
View on GitHub
Official PyTorch Implementation of the Longhorn Deep State Space Model
☆57Dec 4, 2024Updated last year
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
google-deepmind / exedec
View on GitHub
☆14May 9, 2024Updated 2 years ago
Doraemonzzz / xmixers
View on GitHub
Xmixers: A collection of SOTA efficient token/channel mixers
☆29Sep 4, 2025Updated 10 months ago
tinkoff-ai / cnf
View on GitHub
Official implementation for "Let Offline RL Flow: Training Conservative Agents in the Latent Space of Normalizing Flows", NeurIPS 2022, O…
☆12Jan 31, 2023Updated 3 years ago
RobertCsordas / moe_layer
View on GitHub
sigma-MoE layer
☆21Jan 5, 2024Updated 2 years ago
astramind-ai / Mixture-of-depths
View on GitHub
Unofficial implementation for the paper "Mixture-of-Depths: Dynamically allocating compute in transformer-based language models"
☆175Jun 20, 2024Updated 2 years ago
yixiaoer / tpu-training-example
View on GitHub
☆16Jul 8, 2024Updated 2 years ago
nhatpd / iADMM
View on GitHub
iADMM for a low-rank representation optimization problem
☆13Feb 5, 2021Updated 5 years ago
aliang8 / varibad_jax
View on GitHub
☆10Jun 27, 2024Updated 2 years ago
keirp / return_transforms
View on GitHub
☆20Oct 19, 2022Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
Bond1995 / Markov
View on GitHub
Code for experiments on transformers using Markovian data.
☆22Nov 22, 2024Updated last year
umich-sota / TF-as-SVM
View on GitHub
☆12Jan 17, 2024Updated 2 years ago
landskape-ai / Progressive-Pruning
View on GitHub
Official pytorch code for "APP: Anytime Progressive Pruning" (DyNN @ ICML, 2022; CLL @ ACML, 2022, SNN @ ICML, 2022 and SlowDNN 2023)
☆16Nov 22, 2022Updated 3 years ago
ykpku / COTSA
View on GitHub
The implement of "COTSAE: CO-Training of Structure and Attribute Embeddings for Entity Alignment"
☆14Nov 18, 2019Updated 6 years ago
wesg52 / llm-context-neurons
View on GitHub
Find context neurons in Pythia models.
☆13Jun 13, 2023Updated 3 years ago
nobody996 / FastSVC
View on GitHub
Audio Demo for "FastSVC: Fast Cross-Domain Singing Voice Conversion with Feature-wise Linear Modulation"
☆21Apr 7, 2021Updated 5 years ago
WOWNICE / ssl-small
View on GitHub
Code implementation for paper "On the Efficacy of Small Self-Supervised Contrastive Models without Distillation Signals".
☆17Dec 15, 2021Updated 4 years ago