Community Implementation of the paper: "Multi-Head Mixture-of-Experts" In PyTorch
☆31May 11, 2026Updated last week
Alternatives and similar repositories for MHMoE
Users that are interested in MHMoE are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Implementation of VisionLLaMA from the paper: "VisionLLaMA: A Unified LLaMA Interface for Vision Tasks" in PyTorch and Zeta☆15Nov 11, 2024Updated last year
- Implementation of MoE Mamba from the paper: "MoE-Mamba: Efficient Selective State Space Models with Mixture of Experts" in Pytorch and Ze…☆127May 12, 2026Updated last week
- An open source replication of the stawberry method that leverages Monte Carlo Search with PPO and or DPO☆31Updated this week
- Implementation of xLSTM in Pytorch from the paper: "xLSTM: Extended Long Short-Term Memory"☆119May 11, 2026Updated last week
- Train a production grade GPT in less than 400 lines of code. Better than Karpathy's verison and GIGAGPT☆17May 11, 2026Updated last week
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Paper dataset for "Factored Verification: Detecting and Reducing Hallucination in Summaries of Academic Papers"☆13Oct 20, 2024Updated last year
- Conformer RNN-Transducer☆14May 25, 2022Updated 3 years ago
- a simplified version of Google's Gemma model to be used for learning☆26Mar 2, 2024Updated 2 years ago
- An implementation of the base GPT-3 Model architecture from the paper by OPENAI "Language Models are Few-Shot Learners"☆21Jun 29, 2024Updated last year
- Implementaion RNN tranceducer☆23Jun 25, 2019Updated 6 years ago
- A curated collection of prompts for Grok Imagine by xAI☆29Oct 19, 2025Updated 7 months ago
- ☆29Oct 9, 2024Updated last year
- [NeurIPS D&B'24]Enhancing vision-language models for medical imaging: bridging the 3D gap with innovative slice selection☆24Mar 25, 2026Updated last month
- PyTorch Implementation of the paper "MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training"☆28Updated this week
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Simple Implementation of a Transformer in the new framework MLX by Apple☆19Nov 18, 2024Updated last year
- One command to start a streaming ASR server.☆12Oct 2, 2024Updated last year
- Per function, Lua JIT using LLVM C++ toolchain☆10Jun 8, 2017Updated 8 years ago
- Tool to generate documentation for Nelua source files.☆10Dec 11, 2021Updated 4 years ago
- Train toy models using multi-token prediction objective☆14Apr 18, 2026Updated last month
- https://x.com/BlinkDL_AI/status/1884768989743882276☆28May 4, 2025Updated last year
- Compute WER and SER for speech recognition evaluation☆26Mar 18, 2026Updated 2 months ago
- FreeSWITCH ASR module fork from mod_audio_stream, use FunASR online cpu version☆18Jun 27, 2025Updated 10 months ago
- Implementation of Liquid Nets in Pytorch☆71May 12, 2026Updated last week
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- a WIP architecture designed to allow transformers to think in a manner without tokens☆20Apr 12, 2024Updated 2 years ago
- ☆12Dec 14, 2024Updated last year
- This repository contains the code for UNETR: Transformers for 3D Medical Image Segmentation [1]. UNETR is the first 3D segmentation netwo…☆15Jul 8, 2022Updated 3 years ago
- funasr语音转文字的简单api版本,funasr+fastapi,方便部署在服务器上☆13Aug 10, 2024Updated last year
- A simple and concise templating engine that takes advantage of elegant Lua syntax.☆11Nov 25, 2023Updated 2 years ago
- ☆11Dec 24, 2024Updated last year
- Focus handling and navigation library with React integration. This is a read-only mirror.☆15Dec 19, 2024Updated last year
- Wind visualization over time☆101Oct 23, 2025Updated 7 months ago
- Implementation of the premier Text to Video model from OpenAI☆57Nov 11, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- 这是基于FunASR实现的区分说话人语音识别API | This is a speaker-diarization-based speech recognition API implemented using FunASR.☆26Feb 12, 2026Updated 3 months ago
- RWKV6 in native pytorch and triton:)☆11Aug 4, 2024Updated last year
- ☆12Jul 11, 2024Updated last year
- Exploration into the Firefly algorithm in Pytorch☆41Feb 14, 2025Updated last year
- ☆10Feb 21, 2023Updated 3 years ago
- ☆14Aug 9, 2021Updated 4 years ago
- 基于wenet的短时在线语音识别服务☆11Feb 25, 2023Updated 3 years ago