Prototyp MegaScale-Infer: Serving Mixture-of-Experts at Scale with Disaggregated Expert Parallelism
☆31Apr 4, 2025Updated last year
Alternatives and similar repositories for MegaScale-Infer-Prototyp
Users that are interested in MegaScale-Infer-Prototyp are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Accepted to MLSys 2026☆85Apr 19, 2026Updated last month
- RPCNIC: A High-Performance and Reconfigurable PCIe-attached RPC Accelerator [HPCA2025]☆15Dec 9, 2024Updated last year
- ☆16Feb 10, 2023Updated 3 years ago
- Scaling Laws for Mixture of Experts Models☆15Feb 25, 2025Updated last year
- [ICML 2025] Code for "R2-T2: Re-Routing in Test-Time for Multimodal Mixture-of-Experts"☆19Mar 10, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- 🎓Automatically Update LLM inference systems Papers Daily using Github Actions (Update Every 12th hours)☆12Jun 1, 2026Updated last week
- [ICLR 2025] Drop-Upcycling: Training Sparse Mixture of Experts with Partial Re-initialization☆24Oct 5, 2025Updated 8 months ago
- 一步步通关GPU编程☆45Updated this week
- RankFormer: Listwise Learning-to-Rank Using Listwide Labels (KDD 2023).☆26Sep 12, 2023Updated 2 years ago
- Modular RDMA Interface☆130Updated this week
- High-performance distributed data shuffling (all-to-all) library for MoE training and inference☆121Mar 7, 2026Updated 3 months ago
- [SIGCOMM 2023] PacketGame: Multi-Stream Packet Gating for Concurrent Video Inference at Scale☆15Jul 1, 2023Updated 2 years ago
- A better wrapper for using RDMA programming APIs in Rust flavor☆85May 25, 2026Updated 2 weeks ago
- Extension for https://github.com/jenkinsci/workflow-multibranch-plugin add defaults pipeline script☆20Dec 16, 2023Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆13Mar 24, 2024Updated 2 years ago
- ☆17Oct 22, 2020Updated 5 years ago
- NVIDIA Networking NIC Configuration Operator For Kubernetes☆20Jun 2, 2026Updated last week
- An asynchronous streaming data management module for efficient post-training.☆88Jun 1, 2026Updated last week
- Simulating Distributed Training at Scale☆14Sep 15, 2025Updated 8 months ago
- ☆11Apr 23, 2020Updated 6 years ago
- ☆19Feb 14, 2023Updated 3 years ago
- Memory Topology for GPUs☆19May 11, 2026Updated 3 weeks ago
- Real-time statusline HUD for OpenAI Codex CLI - Monitor sessions, context usage, git status, and tool activity☆53Jun 2, 2026Updated last week
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- a follow-up work of HPM-MVS☆40May 20, 2024Updated 2 years ago
- [TBD] "m4: A Learned Flow-level Network Simulator" by Chenning Li, Anton A. Zabreyko, Om Chabra, Arash Nasr-Esfahany, Kevin Zhao, Pratees…☆20Updated this week
- Simple PyTorch graph capturing.☆21May 31, 2023Updated 3 years ago
- A framework for generating realistic LLM serving workloads☆149May 11, 2026Updated 3 weeks ago
- Plato is a system for viewport adaptation based bitrate adaptive VR video streaming.☆15May 1, 2018Updated 8 years ago
- The Easiest Pytorch Implementation of Branching-DQN☆12Feb 10, 2021Updated 5 years ago
- A deep model for speech recognition via Keras(front_end) and TensorFlow(back_end).☆12Feb 16, 2023Updated 3 years ago
- Cursor IDE (v2.6.22) backend endpoint API reverse engineered☆67Apr 2, 2026Updated 2 months ago
- Heterogeneous Gaussian Mechanism: Preserving Differential Privacy in Deep Learning with Provable Robustness (IJCAI'19).☆13Apr 16, 2021Updated 5 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Spatial Transformer Network (STN) provides attention to a particular region to in an image, by doing transformation to the input image. T…☆15Dec 21, 2020Updated 5 years ago
- [MobiCom '23] AccuMO: Accuracy-Centric Multitask Offloading in Edge-Assisted Mobile Augmented Reality☆18Oct 8, 2023Updated 2 years ago
- ☆47Sep 8, 2025Updated 9 months ago
- Low-Latency Live Video Streaming over a Low-Earth-Orbit Satellite Network with DASH☆18Sep 6, 2024Updated last year
- Blazing fast data loading with HuggingFace Dataset and Ray Data☆15Jan 12, 2024Updated 2 years ago
- turboquant-based compression engine for LLM KV cache☆61Apr 3, 2026Updated 2 months ago
- Pytorch Text GAN for lyrics generation☆10Apr 13, 2019Updated 7 years ago