☆148Apr 4, 2026Updated 3 weeks ago
Alternatives and similar repositories for popcorn-cli
Users that are interested in popcorn-cli are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official Problem Sets / Reference Kernels for the GPU MODE Leaderboard!☆256Apr 13, 2026Updated 2 weeks ago
- Write a fast kernel and see how you compare against the best humans and AI on gpumode.com☆92Apr 21, 2026Updated last week
- MSLK (Meta Superintelligence Labs Kernels) is a collection of PyTorch GPU operator libraries that are designed and optimized for GenAI tr…☆99Updated this week
- ☆21Mar 3, 2025Updated last year
- PolyLib official git.☆11Jan 27, 2026Updated 3 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- AI Tensor Engine for ROCm☆414Updated this week
- Musings in GEMM (General Matrix Multiplication)☆14Dec 14, 2025Updated 4 months ago
- ☆13Apr 17, 2026Updated last week
- Explore training for quantized models☆26Jul 12, 2025Updated 9 months ago
- High Performance FP8 GEMM Kernels for SM89 and later GPUs.☆21Jan 24, 2025Updated last year
- This repository documents my 100-day journey of learning and writing CUDA kernels.☆30Mar 29, 2026Updated last month
- ☆13Jul 2, 2025Updated 9 months ago
- Quantized LLM training in pure CUDA/C++.☆243Mar 6, 2026Updated last month
- Minimum Description Length probing for neural network representations☆20Jan 28, 2025Updated last year
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Personal solutions to the Triton Puzzles☆20Jul 18, 2024Updated last year
- See https://github.com/cuda-mode/triton-index/ instead!☆11May 8, 2024Updated last year
- Why Low-Precision Transformer Training Fails: An Analysis on Flash Attention☆62Apr 7, 2026Updated 3 weeks ago
- ☆53Nov 3, 2025Updated 5 months ago
- ☆91Feb 29, 2024Updated 2 years ago
- The Automated LLM Speedrunning Benchmark measures how well LLM agents can reproduce previous innovations and discover new ones in languag…☆140Apr 8, 2026Updated 3 weeks ago
- General Matrix Multiplication using NVIDIA Tensor Cores☆28Jan 25, 2025Updated last year
- Step by step implementation of a fast softmax kernel in CUDA☆68Jan 6, 2025Updated last year
- 📚 A curated list of awesome matrix-matrix multiplication (A * B = C) frameworks, libraries and software☆64Feb 23, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- 训练营讲义☆21Jan 21, 2025Updated last year
- ☆42Apr 11, 2026Updated 2 weeks ago
- learn llvm from scratch☆14Apr 29, 2023Updated 3 years ago
- ☆43May 9, 2025Updated 11 months ago
- ☆26Apr 17, 2026Updated last week
- Supporting code for "LLMs for your iPhone: Whole-Tensor 4 Bit Quantization"☆11Mar 31, 2024Updated 2 years ago
- ☆66Updated this week
- Framework for Algorithmic Correctness Testing of Operators☆17Mar 9, 2026Updated last month
- Does all kind of cool stuff to make analyzing meta classes easier. Now featuring WRedLogger.py, the previous backend of NetDbg☆10Jun 7, 2023Updated 2 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- a website for accessing many models through api(deepseek、Qwen、Hunyuan etc.)☆16Jul 12, 2025Updated 9 months ago
- ☆17Apr 30, 2025Updated last year
- ☆40Mar 25, 2026Updated last month
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆113Apr 22, 2026Updated last week
- Graph model execution API for Candle☆17Jul 27, 2025Updated 9 months ago
- 一起来数三角形吧!☆10Jun 27, 2024Updated last year
- working implimention of deepseek MLA☆44Jan 8, 2025Updated last year