MLSys competition for the best MOE NKI kernels
☆41Apr 15, 2026Updated this week
Alternatives and similar repositories for nki-moe
Users that are interested in nki-moe are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆15Updated this week
- ☆14May 29, 2024Updated last year
- ☆12Dec 20, 2025Updated 3 months ago
- ☆30Feb 28, 2025Updated last year
- The repository guides you through generating a synthetic dataset for a QA-RAG application using the Bedrock API, Python and Langchain.☆20Sep 17, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆18May 9, 2024Updated last year
- ☆33Apr 11, 2026Updated last week
- Evaluation Suite for NVMe devices☆14Nov 14, 2024Updated last year
- ☆18Jun 12, 2025Updated 10 months ago
- libsmctrl论文的复现,添加了python端接口,可以在python端灵活调用接口来分配计算资源☆12May 21, 2024Updated last year
- Welcome to the HR-AGI-Tool repository! This project aims to revolutionize the way autonomous agents interact with Human Resource tasks. E…☆16Aug 21, 2023Updated 2 years ago
- ☆62Apr 11, 2026Updated last week
- ☆18Feb 25, 2026Updated last month
- GPU-Accelerated Multi-Agent Reinforcement Learning for High-Frequency Trading☆52Mar 16, 2026Updated last month
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- ☆12Apr 30, 2024Updated last year
- Seq2act: Mapping Natural Language Instructions to Mobile UI Action Sequences from Google research☆15Jul 13, 2020Updated 5 years ago
- a simple API to use CUPTI☆10Aug 19, 2025Updated 8 months ago
- ☆19Dec 6, 2024Updated last year
- Fast GPU based tensor core reductions☆13Jan 13, 2023Updated 3 years ago
- ☆20May 24, 2025Updated 10 months ago
- Ichigo Whisper is a compact (22M parameters), open-source speech tokenizer for the Whisper-medium, designed to enhance performance on mul…☆17Jan 20, 2025Updated last year
- DiscreteTom's Blog Boilerplate.☆10Mar 6, 2023Updated 3 years ago
- Benchmark suite for LLMs from Fireworks.ai☆99Updated this week
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Some crazy experiments☆35Sep 3, 2025Updated 7 months ago
- Query Only Linear Adapter Training for Fine Tuned Embedding Model Query Representation☆28Sep 12, 2024Updated last year
- setup pytorch on android☆12Mar 2, 2020Updated 6 years ago
- socket program to send data with encryption☆12Jun 1, 2021Updated 4 years ago
- Federated Learning - PyTorch☆15Jun 27, 2021Updated 4 years ago
- C++ implement a simple CNN framework to train mnist data. Done!☆10Mar 29, 2022Updated 4 years ago
- High performance RMSNorm Implement by using SM Core Storage(Registers and Shared Memory)☆30Jan 22, 2026Updated 2 months ago
- Benchmark SGLang on SLURM☆24Updated this week
- Trends, Tools, News timeline ...☆20Oct 13, 2025Updated 6 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆15Apr 15, 2022Updated 4 years ago
- For advanced physics-driven combined with neural network enhancement force field.☆17Mar 9, 2026Updated last month
- The implement of geometric solver PGPSNet☆30Jan 30, 2025Updated last year
- TileGraph is an experimental DNN compiler that utilizes static code generation and kernel fusion techniques.☆11Sep 18, 2024Updated last year
- GEMM by WMMA (tensor core)☆15Jul 31, 2022Updated 3 years ago
- ☆43Jan 29, 2026Updated 2 months ago
- Medusa: Accelerating Serverless LLM Inference with Materialization [ASPLOS'25]☆12Nov 8, 2024Updated last year