[ICML‘25] Official code for paper "Occult: Optimizing Collaborative Communication across Experts for Accelerated Parallel MoE Training and Inference"
☆13Apr 17, 2025Updated 11 months ago
Alternatives and similar repositories for Occult
Users that are interested in Occult are comparing it to the libraries listed below
Sorting:
- Official code for the paper "HEXA-MoE: Efficient and Heterogeneous-Aware MoE Acceleration with Zero Computation Redundancy"☆15Mar 6, 2025Updated last year
- [NAACL'25 🏆 SAC Award] Official code for "Advancing MoE Efficiency: A Collaboration-Constrained Routing (C2R) Strategy for Better Expert…☆16Feb 4, 2025Updated last year
- ☆21May 2, 2025Updated 10 months ago
- ☆58May 4, 2024Updated last year
- [AAAI 25] Official Implementation for ”E-Bench: Subjective-Aligned Benchmark Suite for Text-Driven Video Editing Quality Assessment“☆52Apr 22, 2025Updated 11 months ago
- [ICME2024, Official Code] for paper "Bringing Textual Prompt to AI-Generated Image Quality Assessment"☆21Jul 9, 2024Updated last year
- ☆15Dec 1, 2023Updated 2 years ago
- ☆12Jul 24, 2024Updated last year
- AccelOpt: Self-improving Agents for AI Accelerator Kernel Optimization☆30Feb 18, 2026Updated last month
- [ICLR2025 Oral] ChartMoE: Mixture of Diversely Aligned Expert Connector for Chart Understanding☆95Apr 1, 2025Updated 11 months ago
- Tiny-FSDP, a minimalistic re-implementation of the PyTorch FSDP☆101Aug 20, 2025Updated 7 months ago
- Manages vllm-nccl dependency☆17Jun 3, 2024Updated last year
- Rhetorical sentence classification using LLMs☆11Oct 26, 2025Updated 4 months ago
- The official repository for the experiments included in the paper titled "Patch-level Routing in Mixture-of-Experts is Provably Sample-ef…☆14Feb 12, 2026Updated last month
- Kinetics: Rethinking Test-Time Scaling Laws☆86Jul 11, 2025Updated 8 months ago
- SJTU 中文简约 LaTeX 报告模板☆10Jun 7, 2021Updated 4 years ago
- ☆99Jun 23, 2025Updated 8 months ago
- This repo implements an interface to GTAV for SCENIC language.☆11Dec 7, 2019Updated 6 years ago
- ☆18Jan 27, 2025Updated last year
- An ITK implementation of the GraphCut framework. See 'Graph cuts and efficient ND image segmentation' by Boykov and Funka-Lea and 'Intera…☆12Sep 18, 2017Updated 8 years ago
- [Main EMNLP'25] LLMs do Multi-Label Classification Differently☆14Feb 28, 2026Updated 3 weeks ago
- ATC23 AE☆46May 11, 2023Updated 2 years ago
- Official implementation of paper "CoIRL-AD: Collaborative and Competitive Imitation–Reinforcement Learning for Autonomous Driving"☆35Jan 25, 2026Updated last month
- ☆13Oct 30, 2024Updated last year
- Inverse Scaling in Test-Time Compute☆25Dec 3, 2025Updated 3 months ago
- ☆30Jul 21, 2025Updated 8 months ago
- Prompt-based pipeline for extracting procedural knowledge graphs from text with LLMs☆15Feb 17, 2026Updated last month
- Information extraction from unstructured text to build a knowledge graph using techniques from traditional NLP to pre-trained transformer…☆16Jan 13, 2026Updated 2 months ago
- [ICLR‘24 Spotlight] Code for the paper "Merge, Then Compress: Demystify Efficient SMoE with Hints from Its Routing Policy"☆104Jun 20, 2025Updated 9 months ago
- This project leverages advanced AI agents from crewAI to assist doctors in diagnosing medical conditions and recommending treatment plans…☆14Nov 16, 2024Updated last year
- ✂️ EyeLipCropper is a Python tool to crop eyes and mouth ROIs of the given video.☆14Nov 28, 2021Updated 4 years ago
- Ongoing research training transformer models at scale☆18Updated this week
- Proteus: A High-Throughput Inference-Serving System with Accuracy Scaling☆12Mar 7, 2024Updated 2 years ago
- [CVPRW2024, Official Code] for paper "Exploring AIGC Video Quality: A Focus on Visual Harmony, Video-Text Consistency and Domain Distribu…☆14Jun 14, 2024Updated last year
- [ICML 2024] Code for the paper "MoE-RBench: Towards Building Reliable Language Models with Sparse Mixture-of-Experts"☆10Jul 1, 2024Updated last year
- HISIM introduces a suite of analytical models at the system level to speed up performance prediction for AI models, covering logic-on-log…☆64Mar 17, 2025Updated last year
- ITKGrowCut is a remote module for ITK. It segments a 3D image from user-provided foreground and background seeds.☆15Nov 15, 2025Updated 4 months ago
- Currently, there are many DeepSeek API providers on the market. Use DeepSeek Api Test to test which API performs the best☆19Feb 13, 2025Updated last year
- ☆17Aug 13, 2024Updated last year