Community Implementation of the paper: "Multi-Head Mixture-of-Experts" In PyTorch
☆29Jan 31, 2026Updated last month
Alternatives and similar repositories for MHMoE
Users that are interested in MHMoE are comparing it to the libraries listed below
Sorting:
- Implementation of VisionLLaMA from the paper: "VisionLLaMA: A Unified LLaMA Interface for Vision Tasks" in PyTorch and Zeta☆16Nov 11, 2024Updated last year
- Implementation of MoE Mamba from the paper: "MoE-Mamba: Efficient Selective State Space Models with Mixture of Experts" in Pytorch and Ze…☆120Jan 31, 2026Updated last month
- Paper dataset for "Factored Verification: Detecting and Reducing Hallucination in Summaries of Academic Papers"☆12Oct 20, 2024Updated last year
- Implementation of the Pairformer model used in AlphaFold 3☆14Feb 23, 2026Updated last week
- An open source replication of the stawberry method that leverages Monte Carlo Search with PPO and or DPO☆29Feb 23, 2026Updated last week
- OmniByteFormer is a generalized Transformer model that can process any type of data by converting it into byte sequences, bypassing tradi…☆15Feb 23, 2026Updated last week
- Simple Implementation of TinyGPTV in super simple Zeta lego blocks☆16Nov 11, 2024Updated last year
- Train a production grade GPT in less than 400 lines of code. Better than Karpathy's verison and GIGAGPT☆16Feb 6, 2026Updated 3 weeks ago
- An implementation of the base GPT-3 Model architecture from the paper by OPENAI "Language Models are Few-Shot Learners"☆20Jun 29, 2024Updated last year
- Simple Implementation of a Transformer in the new framework MLX by Apple☆19Nov 18, 2024Updated last year
- Conformer RNN-Transducer☆14May 25, 2022Updated 3 years ago
- Implementation of xLSTM in Pytorch from the paper: "xLSTM: Extended Long Short-Term Memory"☆118Feb 6, 2026Updated 3 weeks ago
- PyTorch Implementation of the paper "MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training"☆26Feb 9, 2026Updated 3 weeks ago
- ☆29Oct 9, 2024Updated last year
- Implementation of Liquid Nets in Pytorch☆69Jan 31, 2026Updated last month
- https://x.com/BlinkDL_AI/status/1884768989743882276☆28May 4, 2025Updated 10 months ago
- GPI-Space: Memory Driven Computing and Big Data☆10Jan 2, 2025Updated last year
- Community Open Source Implementation of GPT4o in PyTorch☆26Feb 9, 2026Updated 3 weeks ago
- Reranking for Multi-objective Optimized Recommender Systems☆11Aug 3, 2023Updated 2 years ago
- Demo repository showcasing how to use reusable workflows to build artifact attestations☆14Feb 16, 2026Updated 2 weeks ago
- Self-evaluating RAG application on LangCheck docs☆11Sep 10, 2025Updated 5 months ago
- Repository dedicated to developing a robust and modular framework for Multi-Agent Reinforcement Learning (MARL) algorithms.☆13Mar 3, 2024Updated 2 years ago
- Implementation of the model from "Faster sorting algorithms discovered using deep reinforcement learning" that discovered an all-new ult…☆11Aug 29, 2023Updated 2 years ago
- Implementation of Qformer from BLIP2 in Zeta Lego blocks.☆48Nov 11, 2024Updated last year
- Architecture Visualization Demo with Unreal Engine 4☆13May 11, 2021Updated 4 years ago
- Code and software used to design de novo protein nanomachines. Supplementary material for "Computational design of nanoscale rotational m…☆10Mar 19, 2022Updated 3 years ago
- IonQ iQuHACK 2024 Remote Challenge☆11Feb 3, 2024Updated 2 years ago
- PSI-MOD ontology for modified and unmodified amino acid residues☆14Jan 8, 2026Updated last month
- ☆41Jul 21, 2024Updated last year
- A TypeScript starter template for building web applications with Next.js and Firebase using lerna monorepo.☆13Apr 15, 2023Updated 2 years ago
- Amplify your coding capabilities with AI - your smart co-pilot for an elevated coding experience.☆14Feb 18, 2026Updated 2 weeks ago
- ☆21Updated this week
- 免注册免费使用 ChatGPT,请关注微信公众号【胖竹同学】。☆10Apr 4, 2023Updated 2 years ago
- A NOMAD plugin containing base sections for material processing.☆11Jan 20, 2026Updated last month
- C and C++ to Luau compiler for Roblox.☆10Feb 6, 2024Updated 2 years ago
- gammcor code☆11Sep 25, 2025Updated 5 months ago
- A Python client library for accessing IQM quantum computers☆12Mar 26, 2025Updated 11 months ago
- A framework for few-shot evaluation of autoregressive language models.☆12Jul 14, 2025Updated 7 months ago
- 经过强化的goose3通用网页提取器(添加作者VX: 862187570 , Python交流学习)☆16Nov 18, 2021Updated 4 years ago