☆29Oct 9, 2024Updated last year
Alternatives and similar repositories for Multi_Head_Mixture_of_Experts__MH-MOE
Users that are interested in Multi_Head_Mixture_of_Experts__MH-MOE are comparing it to the libraries listed below
Sorting:
- ☆19Nov 5, 2024Updated last year
- ☆16Mar 1, 2025Updated last year
- [ICCV 2025] EA-ViT: Efficient Adaptation for Elastic Vision Transformer☆26Jul 28, 2025Updated 7 months ago
- Community Implementation of the paper: "Multi-Head Mixture-of-Experts" In PyTorch☆29Jan 31, 2026Updated last month
- Look, Compare, Decide: Alleviating Hallucination in Large Vision-Language Models via Multi-View Multi-Path Reasoning☆24Sep 9, 2024Updated last year
- benchmarks for LLM tokenizers☆17Feb 27, 2026Updated last week
- Transport means detection using image detection by Yolo. It can count free parking places and transport means in the parking area☆10Oct 6, 2021Updated 4 years ago
- [NeurIPS2024] Multi-Scale VMamba: Hierarchy in Hierarchy Visual State Space Model☆80Dec 25, 2024Updated last year
- 免注册免费使用 ChatGPT,请关注微信公众号【胖竹同学】。☆10Apr 4, 2023Updated 2 years ago
- A Model Agnostic function to directly remove specified layers from the LLM☆10May 23, 2024Updated last year
- Reinforcement Learning (PPO) applied to a multiplayer simple card game (Witches)☆10Jun 7, 2020Updated 5 years ago
- A simple web-app for generating glassmorphism UI effect!☆12Aug 5, 2023Updated 2 years ago
- Vite + Mantine + Vanilla extract template☆12Updated this week
- ☆22Jun 10, 2025Updated 8 months ago
- the analysis script for PrismNet☆10May 17, 2020Updated 5 years ago
- A gym game for Contra that for reinforcement learning☆10Oct 18, 2021Updated 4 years ago
- Sudoku solver in Golang☆10Sep 6, 2020Updated 5 years ago
- ☆12Aug 1, 2025Updated 7 months ago
- free library for clustering and neuro-fuzzy systems☆10Feb 25, 2026Updated last week
- Unofficial Implementation of Evolutionary Model Merging☆41Mar 28, 2024Updated last year
- An unnecessarily tiny and minimal implementation of GPT-2 in NumPy.☆11Feb 12, 2023Updated 3 years ago
- A Modular Pytorch ViTGAN implementation☆12Mar 15, 2022Updated 3 years ago
- [ICML 2024] PyTorch implementation for "Diversified Batch Selection for Training Acceleration"☆10Jul 30, 2024Updated last year
- ☆10Feb 21, 2023Updated 3 years ago
- A rewrite of scambier/markov-strings to utilize a relational SQL database rather than an in-memory object. The goal is to reduce memory u…☆10Mar 1, 2026Updated last week
- Identify and automatically fix issues in shell scripts☆15Nov 24, 2023Updated 2 years ago
- Minimized version of the Orchis server hosted at https://orchis.cherrymint.live☆10Nov 27, 2023Updated 2 years ago
- a minimalistic todo app☆10May 10, 2023Updated 2 years ago
- A toolset and pipeline for running zero shot and supervised protein fitness prediction, drop in compatible with scikitlearn☆13Nov 28, 2025Updated 3 months ago
- Source code of "Multimodal Matching-aware Co-attention Networks with Mutual Knowledge Distillation for Fake News Detection"☆13Nov 17, 2023Updated 2 years ago
- Beyond Pixels: Semi-Supervised Semantic Segmentation with a Multi-scale Patch-based Multi-Label Classifier (Accepted ECCV 2024)☆10May 6, 2025Updated 10 months ago
- Ray Tracer written in Rust☆13Nov 22, 2021Updated 4 years ago
- ☆10Oct 25, 2024Updated last year
- SAM Adaptation using SVD☆12Jul 13, 2025Updated 7 months ago
- JAX implementation of GPTQ quantization algorithm☆10Jul 19, 2023Updated 2 years ago
- Hybrid RT DETR: Hybrid encoder-decoder network for end-to-end object detection in UAV imagery☆14May 22, 2024Updated last year
- ☆16Oct 31, 2025Updated 4 months ago
- Multi-Modal Multi-Task (3MT) Road Segmentation, IEEE RA-L 2023☆15Feb 13, 2024Updated 2 years ago
- DCNv2_torch1.11☆10Sep 27, 2022Updated 3 years ago