β20Nov 5, 2024Updated last year
Alternatives and similar repositories for MH-MoE
Users that are interested in MH-MoE are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [NAACL'25 π SAC Award] Official code for "Advancing MoE Efficiency: A Collaboration-Constrained Routing (C2R) Strategy for Better Expertβ¦β16Feb 4, 2025Updated last year
- β18Mar 2, 2026Updated 3 months ago
- β70Dec 2, 2024Updated last year
- β13Feb 17, 2025Updated last year
- [ICLR2025] Codebase for "ReMoE: Fully Differentiable Mixture-of-Experts with ReLU Routing", built on Megatron-LM.β115Dec 20, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer β’ AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Mixture of Lora Expertsβ11Apr 7, 2024Updated 2 years ago
- β11Nov 9, 2022Updated 3 years ago
- β18Nov 25, 2024Updated last year
- This repository contains the dataset and implementation details of the paper "An In-depth Analysis of Implicit and Subtle Hate Speech Mesβ¦β10May 9, 2024Updated 2 years ago
- ControlLM is a method to control the personality traits and behaviors of language models in real-time at inference without costly traininβ¦β21Nov 6, 2024Updated last year
- Official PyTorch Implementation of Masked Temporal Interpolation Diffusion for Procedure Planning in Instructional Videosβ11Apr 26, 2026Updated last month
- Python interface to cd-hitβ10Feb 26, 2019Updated 7 years ago
- The code of SKSβ15Mar 22, 2022Updated 4 years ago
- β14Mar 6, 2022Updated 4 years ago
- AI Agents on DigitalOcean Gradient AI Platform β’ AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Public version for DistPepFoldβ10Jul 17, 2025Updated 11 months ago
- An example of DyNet autobatching for the NIPS "how to code a paper" workshopβ12Dec 9, 2017Updated 8 years ago
- Code and data for the paper: On the Reliability of Psychological Scales on Large Language Modelsβ30Dec 15, 2025Updated 6 months ago
- This is AI implementation (not official) of the DreamGym framework from the paper "Scaling Agent Learning via Experience Synthesis" (arXiβ¦β42Nov 9, 2025Updated 7 months ago
- AdaMoLE: Adaptive Mixture of LoRA Expertsβ38Oct 11, 2024Updated last year
- β28May 26, 2026Updated 3 weeks ago
- β13Aug 20, 2021Updated 4 years ago
- PyTorch implementation of "From Sparse to Soft Mixtures of Experts"β72Aug 22, 2023Updated 2 years ago
- β11Jun 4, 2021Updated 5 years ago
- Managed Database hosting by DigitalOcean β’ AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- The SimSite3D Software tools are designed to quickly search a database of three dimensional structures, in Protein Data Bank format, withβ¦β11Oct 18, 2018Updated 7 years ago
- Official Implementation of "Simulating Environments with Reasoning Models for Agent Training"β65Feb 18, 2026Updated 4 months ago
- Mixture of Attention Headsβ53Oct 10, 2022Updated 3 years ago
- β20Mar 12, 2025Updated last year
- A toolset and pipeline for running zero shot and supervised protein fitness prediction, drop in compatible with scikitlearnβ13Jun 11, 2026Updated last week
- Convolutional variational autoencoders and text-question, emoji-answer modelsβ11Jun 19, 2017Updated 8 years ago
- β16Mar 1, 2025Updated last year
- Converts AlphaFold distograms into distance matrices and saves them into a number of formatsβ15Dec 13, 2022Updated 3 years ago
- β22Feb 29, 2024Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean β’ AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- β44May 19, 2026Updated 3 weeks ago
- β27Nov 20, 2023Updated 2 years ago
- TUnA: Transformer-based Uncertainty Aware model for PPI Predictionβ17Dec 21, 2025Updated 5 months ago
- A Winograd Minimal Filter Implementation in CUDAβ30Aug 25, 2021Updated 4 years ago
- Efficiently apply modification functions to RLDS/TFDS datasets.β32Jun 19, 2024Updated last year
- A template for running Stable Diffusion 3 with Cogβ14Aug 20, 2024Updated last year
- ChineseHarm-Bench: A Chinese Harmful Content Detection Benchmarkβ60Sep 2, 2025Updated 9 months ago