Memory optimized Mixture of Experts
☆73Jul 25, 2025Updated 7 months ago
Alternatives and similar repositories for MoMoE-impl
Users that are interested in MoMoE-impl are comparing it to the libraries listed below
Sorting:
- Vortex: A Flexible and Efficient Sparse Attention Framework☆48Jan 21, 2026Updated last month
- ☆15Aug 19, 2025Updated 6 months ago
- Model explanation provides the ability to interpret the effect of the predictors on the composition of an individual score.☆13Jan 21, 2021Updated 5 years ago
- My tests and experiments with some popular dl frameworks.☆17Sep 11, 2025Updated 5 months ago
- Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels☆46Jan 30, 2026Updated last month
- Engine for collecting, uploading, and downloading model activations☆26Apr 2, 2025Updated 11 months ago
- Open deep learning compiler stack for cpu, gpu and specialized accelerators☆19Feb 24, 2026Updated last week
- NumGLUE: A Suite of Fundamental yet Challenging Mathematical Reasoning Tasks☆20May 10, 2022Updated 3 years ago
- Prototyp MegaScale-Infer: Serving Mixture-of-Experts at Scale with Disaggregated Expert Parallelism☆27Apr 4, 2025Updated 10 months ago
- an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)☆110Mar 7, 2025Updated 11 months ago
- ☆21Apr 17, 2025Updated 10 months ago
- Rust standalone inference of Namo-500M series models. Extremly tiny, runing VLM on CPU.☆24Mar 12, 2025Updated 11 months ago
- Software Engineering Back End Microservices Project☆15Nov 20, 2024Updated last year
- Tile-based language built for AI computation across all scales☆138Updated this week
- ☆80Mar 11, 2025Updated 11 months ago
- Flexible and Pluggable Serving Engine for Diffusion LLMs☆58Feb 14, 2026Updated 2 weeks ago
- Multimodal Open-O1 (MO1) is designed to enhance the accuracy of inference models by utilizing a novel prompt-based approach. This tool wo…☆29Sep 25, 2024Updated last year
- A better wrapper for using RDMA programming APIs in Rust flavor☆77Feb 10, 2026Updated 3 weeks ago
- ☆137Mar 20, 2025Updated 11 months ago
- [ACL 2024] Do Large Language Models Latently Perform Multi-Hop Reasoning?☆90Mar 18, 2025Updated 11 months ago
- imagetokenizer is a python package, helps you encoder visuals and generate visuals token ids from codebook, supports both image and video…☆41Jun 22, 2024Updated last year
- Use MobileNet SSD and openCV to detect and count car on road☆12Jan 13, 2020Updated 6 years ago
- Support for training SSD on TF2☆12Mar 29, 2023Updated 2 years ago
- Implementation of PCA algorithm using Gram-Scmidt modification on NIPALS☆10Jun 13, 2015Updated 10 years ago
- NVSHMEM‑Tutorial: Build a DeepEP‑like GPU Buffer☆163Feb 11, 2026Updated 2 weeks ago
- Exploring the minimal architecture required for coherent English language generation.☆12Mar 5, 2025Updated 11 months ago
- [NeurIPS 2025] Official code for "Tropical Attention: Neural Algorithmic Reasoning for Combinatorial Algorithms"☆23Oct 23, 2025Updated 4 months ago
- Protocol buffers and other common resources.☆13Jan 20, 2026Updated last month
- Spark projects. Learning book "Machine Learning with Spark"☆10Jun 3, 2017Updated 8 years ago
- ☆13Nov 5, 2024Updated last year
- ☆53Feb 10, 2025Updated last year
- EAFT(Entropy-Adaptive Fine-Tuning: Resolving Confident Conflicts to Mitigate Forgetting) official repo☆83Jan 15, 2026Updated last month
- The best library in the world to generate PDF from HTML☆13Feb 24, 2026Updated last week
- This project is based on the [LTX-Video](https://github.com/Lightricks/LTX-Video) algorithm of the diffusers and optimized and accelerate…☆13Dec 31, 2024Updated last year
- A collection of resources for CS 2051, an undergraduate Honors Discrete Mathematics course at Georgia Tech.☆10Jun 24, 2023Updated 2 years ago
- Neural Destruction Search for Vehicle Routing Problems☆18Oct 6, 2025Updated 4 months ago
- Multimodal data loader compatible with pytorch and tensorflow☆12Aug 14, 2024Updated last year
- Vietnamese GPT-J API service deployed with Docker & Helm chart☆10Dec 11, 2022Updated 3 years ago
- ☆118May 19, 2025Updated 9 months ago