β54Dec 10, 2025Updated 3 months ago
Alternatives and similar repositories for Alpha-MoE
Users that are interested in Alpha-MoE are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- β53Mar 3, 2026Updated 2 weeks ago
- [ICLR 2026 π₯] Official pytorch implementation for "Attention Is All You Need for KV Cache in Diffusion LLMs"β37Jan 23, 2026Updated 2 months ago
- Framework for Algorithmic Correctness Testing of Operatorsβ16Mar 9, 2026Updated last week
- Repository for MetaVC -- A Meta Local Search Framework For Minimum Vertex Cover (MinVC)β10Jan 15, 2022Updated 4 years ago
- Standalone commandline CLI tool for compiling Triton kernelsβ20Sep 13, 2024Updated last year
- Recursive Self-Aggregation evals on ARC-AGIβ29Jan 26, 2026Updated last month
- β19Aug 23, 2025Updated 7 months ago
- MS108 Course Project, SJTU ACM Class.β33Dec 20, 2022Updated 3 years ago
- Using OpenVINO to speed up MeloTTS inferenceβ15Nov 1, 2024Updated last year
- The official repo for "CodeScaler: Scaling Code LLM Training and Test-Time Inference via Execution-Free Reward Models"β31Mar 5, 2026Updated 2 weeks ago
- vTPM with SGX protectionβ11May 30, 2019Updated 6 years ago
- ι³ι’εεΊ¦η»δΈοΌι³ιε½δΈεε€ηβ13May 3, 2024Updated last year
- INACTIVE - http://mzl.la/ghe-archive - Tools to create ARPA models from cmu pocketsphinx dictionaries for proper g2p generationβ21Mar 29, 2019Updated 6 years ago
- Quantization Meets dLLMs: A Systematic Study of Post-training Quantization for Diffusion LLMsβ54Mar 13, 2026Updated last week
- OpenVINO LLM Benchmarkβ11Dec 7, 2023Updated 2 years ago
- libtpms / swtpm software emulation of a Trusted Platform Module (TPM 1.2 and TPM 2.0) compile scriptβ13Sep 16, 2020Updated 5 years ago
- β19Dec 9, 2025Updated 3 months ago
- Triton for OpenCL backend, and use mlir-translate to get source OpenCL codeβ25Aug 27, 2025Updated 6 months ago
- Source code to accompany research paper on training multi token prediction language models using self-distillation.β26Feb 21, 2026Updated last month
- Converting Chinese sentences into pinyin sequences, implemented in C++, very fast and easy to deploy.β20Jan 5, 2026Updated 2 months ago
- Metadata Editor user and practice guideβ17Mar 11, 2026Updated last week
- Verify that any MCP server is running the intended and untampered code via hardware attestation.β18Mar 28, 2025Updated 11 months ago
- Website for the ICML 2021 tutorial on Random Matrix Theory and Machine Learningβ16Dec 8, 2021Updated 4 years ago
- β30Oct 22, 2025Updated 5 months ago
- β14Feb 9, 2026Updated last month
- k8s CSI driver for FastCFSβ13Mar 17, 2024Updated 2 years ago
- Official scripts modified by yours truly (@starhopp3r) and @jameshi16 that allow OpenVINOβ’ to run on Ubuntu 18.04.β19Sep 30, 2021Updated 4 years ago
- π Collection of libraries used with fms-hf-tuning to accelerate fine-tuning and training of large models.β13Jan 30, 2026Updated last month
- Inference Llama 2 with a model compiled to native code by TorchInductorβ14Feb 8, 2024Updated 2 years ago
- My attempt to improve the speed of the newton schulz algorithm, starting from the dion implementation.β33Dec 5, 2025Updated 3 months ago
- Tritonbench is a collection of PyTorch custom operators with example inputs to measure their performance.β332Updated this week
- Fast low-bit matmul kernels in Tritonβ438Feb 1, 2026Updated last month
- β13Jul 2, 2025Updated 8 months ago
- β27Apr 7, 2025Updated 11 months ago
- β40Jan 16, 2026Updated 2 months ago
- A lightweight, self-hosted infrastructure layer for deploying and managing LLM agents as resilient microservices. Features automatic rβ¦β18Aug 4, 2025Updated 7 months ago
- Power measurement for CUDA programs by polling using NVIDIA Management Library (nvml) APIs.β26Jun 24, 2017Updated 8 years ago
- β21Mar 3, 2025Updated last year
- β21Updated this week