Community Implementation of the paper: "Multi-Head Mixture-of-Experts" In PyTorch
☆30Apr 13, 2026Updated 3 weeks ago
Alternatives and similar repositories for MHMoE
Users that are interested in MHMoE are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Implementation of MoE Mamba from the paper: "MoE-Mamba: Efficient Selective State Space Models with Mixture of Experts" in Pytorch and Ze…☆127Apr 13, 2026Updated 2 weeks ago
- Simple Implementation of TinyGPTV in super simple Zeta lego blocks☆16Nov 11, 2024Updated last year
- OmniByteFormer is a generalized Transformer model that can process any type of data by converting it into byte sequences, bypassing tradi…☆16Apr 13, 2026Updated 2 weeks ago
- An open source replication of the stawberry method that leverages Monte Carlo Search with PPO and or DPO☆30Apr 13, 2026Updated 2 weeks ago
- Implementation of xLSTM in Pytorch from the paper: "xLSTM: Extended Long Short-Term Memory"☆119Apr 13, 2026Updated 2 weeks ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Train a production grade GPT in less than 400 lines of code. Better than Karpathy's verison and GIGAGPT☆16Apr 13, 2026Updated 3 weeks ago
- Paper dataset for "Factored Verification: Detecting and Reducing Hallucination in Summaries of Academic Papers"☆14Oct 20, 2024Updated last year
- a simplified version of Google's Gemma model to be used for learning☆26Mar 2, 2024Updated 2 years ago
- An implementation of the base GPT-3 Model architecture from the paper by OPENAI "Language Models are Few-Shot Learners"☆20Jun 29, 2024Updated last year
- Implementaion RNN tranceducer☆23Jun 25, 2019Updated 6 years ago
- A curated collection of prompts for Grok Imagine by xAI☆28Oct 19, 2025Updated 6 months ago
- An unofficial implementation of "Mixture-of-Depths: Dynamically allocating compute in transformer-based language models"☆36Jun 7, 2024Updated last year
- Community Open Source Implementation of GPT4o in PyTorch☆29Apr 20, 2026Updated 2 weeks ago
- The official Languini Kitchen repository☆14May 6, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- [NeurIPS D&B'24]Enhancing vision-language models for medical imaging: bridging the 3D gap with innovative slice selection☆24Mar 25, 2026Updated last month
- PyTorch Implementation of the paper "MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training"☆28Apr 20, 2026Updated last week
- One command to start a streaming ASR server.☆12Oct 2, 2024Updated last year
- Per function, Lua JIT using LLVM C++ toolchain☆10Jun 8, 2017Updated 8 years ago
- Tool to generate documentation for Nelua source files.☆10Dec 11, 2021Updated 4 years ago
- https://x.com/BlinkDL_AI/status/1884768989743882276☆28May 4, 2025Updated 11 months ago
- Compute WER and SER for speech recognition evaluation☆27Mar 18, 2026Updated last month
- Implementation of Liquid Nets in Pytorch☆71Apr 13, 2026Updated 2 weeks ago
- ☆12Dec 14, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Code associated with the paper: CTC-DRO: Robust Optimization for Reducing Language Disparities in Speech Recognition.☆17May 16, 2025Updated 11 months ago
- Hpyformer base FunASR☆30Nov 5, 2024Updated last year
- CosyVoice语音合成简易API☆14Nov 1, 2024Updated last year
- funasr语音转文字的简单api版本,funasr+fastapi,方便部署在服务器上☆13Aug 10, 2024Updated last year
- Multi-modal approach for tumor segmentation and survival prediction using PET/CT imaging with attention mechanisms (MICCAI2021 HECKTOR Ch…☆12Apr 22, 2022Updated 4 years ago
- ☆11Dec 24, 2024Updated last year
- Implementation of the premier Text to Video model from OpenAI☆57Nov 11, 2024Updated last year
- ☆11Sep 18, 2023Updated 2 years ago
- 这是基于FunASR实现的区分说话人语音识别API | This is a speaker-diarization-based speech recognition API implemented using FunASR.☆25Feb 12, 2026Updated 2 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- RWKV6 in native pytorch and triton:)☆11Aug 4, 2024Updated last year
- ☆10Feb 21, 2023Updated 3 years ago
- SpeechPlus: Small LLM-Based Text-to-Speech Library 🚀☆21May 20, 2025Updated 11 months ago
- <综合> Funasr语音识别,调用Qwen大模型回答,通过GPTSovits输出语音的ai程序,其中调用模型还是在线,后续将添加离线大模型☆13Nov 30, 2024Updated last year
- ASR_LLM_TTS前端项目☆15Dec 3, 2024Updated last year
- ☆14Jan 22, 2025Updated last year
- Python client for Jikan.moe, MyAnimeList unofficial API with good intentions.☆14Dec 20, 2022Updated 3 years ago