Community Implementation of the paper: "Multi-Head Mixture-of-Experts" In PyTorch
☆29Mar 22, 2026Updated 3 weeks ago
Alternatives and similar repositories for MHMoE
Users that are interested in MHMoE are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Implementation of VisionLLaMA from the paper: "VisionLLaMA: A Unified LLaMA Interface for Vision Tasks" in PyTorch and Zeta☆16Nov 11, 2024Updated last year
- Simple Implementation of TinyGPTV in super simple Zeta lego blocks☆16Nov 11, 2024Updated last year
- OmniByteFormer is a generalized Transformer model that can process any type of data by converting it into byte sequences, bypassing tradi…☆15Apr 6, 2026Updated last week
- An open source replication of the stawberry method that leverages Monte Carlo Search with PPO and or DPO☆29Apr 6, 2026Updated last week
- Implementation of xLSTM in Pytorch from the paper: "xLSTM: Extended Long Short-Term Memory"☆119Mar 20, 2026Updated 3 weeks ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Train a production grade GPT in less than 400 lines of code. Better than Karpathy's verison and GIGAGPT☆16Mar 20, 2026Updated 3 weeks ago
- Paper dataset for "Factored Verification: Detecting and Reducing Hallucination in Summaries of Academic Papers"☆13Oct 20, 2024Updated last year
- Conformer RNN-Transducer☆14May 25, 2022Updated 3 years ago
- a simplified version of Google's Gemma model to be used for learning☆26Mar 2, 2024Updated 2 years ago
- An implementation of the base GPT-3 Model architecture from the paper by OPENAI "Language Models are Few-Shot Learners"☆20Jun 29, 2024Updated last year
- Implementaion RNN tranceducer☆23Jun 25, 2019Updated 6 years ago
- A curated collection of prompts for Grok Imagine by xAI☆26Oct 19, 2025Updated 5 months ago
- [NeurIPS D&B'24]Enhancing vision-language models for medical imaging: bridging the 3D gap with innovative slice selection☆22Mar 25, 2026Updated 2 weeks ago
- ☆29Oct 9, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- An unofficial implementation of "Mixture-of-Depths: Dynamically allocating compute in transformer-based language models"☆36Jun 7, 2024Updated last year
- The official Languini Kitchen repository☆14May 6, 2024Updated last year
- Community Open Source Implementation of GPT4o in PyTorch☆26Mar 30, 2026Updated 2 weeks ago
- PyTorch Implementation of the paper "MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training"☆27Mar 30, 2026Updated 2 weeks ago
- Code for IJCAI 2023 paper 'SLViT: Scale-Wise Language-Guided Vision Transformer for Referring Image Segmentation'☆11May 28, 2023Updated 2 years ago
- Tool to generate documentation for Nelua source files.☆10Dec 11, 2021Updated 4 years ago
- Train toy models using multi-token prediction objective☆14May 8, 2024Updated last year
- https://x.com/BlinkDL_AI/status/1884768989743882276☆28May 4, 2025Updated 11 months ago
- FreeSWITCH ASR module fork from mod_audio_stream, use FunASR online cpu version☆17Jun 27, 2025Updated 9 months ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Implementation of Liquid Nets in Pytorch☆70Mar 22, 2026Updated 3 weeks ago
- funasr语音转文字的简单api版本,funasr+fastapi,方便部署在服务器上☆13Aug 10, 2024Updated last year
- Implementation of the premier Text to Video model from OpenAI☆57Nov 11, 2024Updated last year
- ☆11Sep 18, 2023Updated 2 years ago
- RWKV6 in native pytorch and triton:)☆11Aug 4, 2024Updated last year
- ☆12Jul 11, 2024Updated last year
- ☆10Feb 21, 2023Updated 3 years ago
- ☆14Aug 9, 2021Updated 4 years ago
- SpeechPlus: Small LLM-Based Text-to-Speech Library 🚀☆20May 20, 2025Updated 10 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆14Sep 23, 2022Updated 3 years ago
- <综合> Funasr语音识别,调用Qwen大模型回答,通过GPTSovits输出语音的ai程序,其中调用模型还是在线,后续将添加离线大模型☆13Nov 30, 2024Updated last year
- ASR_LLM_TTS前端项目☆15Dec 3, 2024Updated last year
- ☆14Jan 22, 2025Updated last year
- Lua bindings for Chipmunk2D☆13Apr 8, 2023Updated 3 years ago
- Python client for Jikan.moe, MyAnimeList unofficial API with good intentions.☆14Dec 20, 2022Updated 3 years ago
- A simple WebAssembly Linker in JavaScript☆17Jun 15, 2021Updated 4 years ago