☆52Dec 10, 2025Updated 2 months ago
Alternatives and similar repositories for Alpha-MoE
Users that are interested in Alpha-MoE are comparing it to the libraries listed below
Sorting:
- vTPM with SGX protection☆11May 30, 2019Updated 6 years ago
- The official repo for "CodeScaler: Scaling Code LLM Training and Test-Time Inference via Execution-Free Reward Models"☆29Feb 23, 2026Updated last week
- libtpms / swtpm software emulation of a Trusted Platform Module (TPM 1.2 and TPM 2.0) compile script☆13Sep 16, 2020Updated 5 years ago
- ☆18Dec 9, 2025Updated 2 months ago
- ☆30Oct 22, 2025Updated 4 months ago
- ☆40Jan 16, 2026Updated last month
- ☆14Feb 9, 2026Updated 3 weeks ago
- Framework for Algorithmic Correctness Testing of Operators☆16Feb 20, 2026Updated last week
- ☆13Jul 2, 2025Updated 8 months ago
- PSTensor provides a way to hack the memory management of tensors in TensorFlow and PyTorch by defining your own C++ Tensor Class.☆10Feb 10, 2022Updated 4 years ago
- OpenVINO LLM Benchmark☆11Dec 7, 2023Updated 2 years ago
- k8s CSI driver for FastCFS☆13Mar 17, 2024Updated last year
- Triton for OpenCL backend, and use mlir-translate to get source OpenCL code☆24Aug 27, 2025Updated 6 months ago
- Standalone commandline CLI tool for compiling Triton kernels☆20Sep 13, 2024Updated last year
- Tools for easier OpenVINO development/debugging☆10Jul 16, 2025Updated 7 months ago
- ☆15Mar 26, 2025Updated 11 months ago
- A lightweight, self-hosted infrastructure layer for deploying and managing LLM agents as resilient microservices. Features automatic r…☆17Aug 4, 2025Updated 6 months ago
- [ASPLOS'26] Taming the Long-Tail: Efficient Reasoning RL Training with Adaptive Drafter☆138Dec 5, 2025Updated 2 months ago
- Enable everyone to develop, optimize and deploy AI models natively on everyone's devices.☆13May 29, 2024Updated last year
- LLM implementation one matrix multiplication at a time☆13Aug 8, 2024Updated last year
- Tritonbench is a collection of PyTorch custom operators with example inputs to measure their performance.☆327Updated this week
- A custom Huggingface trainer which supports logging auxiliary losses returned by your model☆15Jul 27, 2025Updated 7 months ago
- Inference Llama 2 with a model compiled to native code by TorchInductor☆14Feb 8, 2024Updated 2 years ago
- My attempt to improve the speed of the newton schulz algorithm, starting from the dion implementation.☆32Dec 5, 2025Updated 2 months ago
- Fast low-bit matmul kernels in Triton☆433Feb 1, 2026Updated last month
- Inference SAM in C # based on OpenVINO, ONNX runtime, TensorRT☆18Jun 6, 2024Updated last year
- Authenticated Knowledge & Trust Architecture for AI Agents☆30Dec 17, 2025Updated 2 months ago
- ☆15Sep 22, 2024Updated last year
- Collection of scripts to build PyTorch and the domain libraries from source.☆13Feb 4, 2026Updated 3 weeks ago
- 🔍📃 LLM-powered PDF Table Extractor☆19Jun 26, 2025Updated 8 months ago
- The core repository for Katanemo's advanced function calling models with top-tier performance. Features three collections: Arch-Function …☆20Jun 23, 2025Updated 8 months ago
- Tutorials of Extending and importing TVM with CMAKE Include dependency.☆16Oct 11, 2024Updated last year
- Ready-to-use ML training recipes to help you build and deploy models on Baseten.☆42Updated this week
- NeuroBLAST v3 architecture code☆36Jan 6, 2026Updated last month
- An open-source simulator framework for neural processing units☆37Jan 30, 2026Updated last month
- assignment practice☆13Feb 22, 2019Updated 7 years ago
- Interact with various LLMs in your browser (LangChain.js, Angular)☆17Updated this week
- ☆25Apr 7, 2025Updated 10 months ago
- Converting Chinese sentences into pinyin sequences, implemented in C++, very fast and easy to deploy.☆20Jan 5, 2026Updated last month