☆541Jun 8, 2026Updated this week
Alternatives and similar repositories for AI-Infra-Auto-Driven-SKILLS
Users that are interested in AI-Infra-Auto-Driven-SKILLS are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ToyLLM: Learning LLM from Scratch☆25Jun 1, 2026Updated last week
- CODA: Rewriting Transformer Blocks as GEMM-Epilogue Programs☆193May 22, 2026Updated 2 weeks ago
- Flash-Linear-Attention models beyond language☆21Aug 28, 2025Updated 9 months ago
- Open-source toolkit for training, Priming, and serving next generation Hybrid architectures☆71May 9, 2026Updated 3 weeks ago
- This repo is reproduction resources for linear alignment paper, still working☆18May 19, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Slowdown prediction module of Echo: Simulating Distributed Training at Scale☆13May 17, 2025Updated last year
- ☆173Updated this week
- NVIDIA cuTile learn☆168Dec 9, 2025Updated 5 months ago
- Flash Attention in 300-500 lines of CUDA/C++☆36Aug 22, 2025Updated 9 months ago
- 📚 Claude Skills 开发完全指南 - 从基础到精通 | Complete guide for developing Claude Skills - from basics to mastery☆195Oct 27, 2025Updated 7 months ago
- ☆101May 10, 2026Updated 3 weeks ago
- ATLAHS: An Application-centric Network Simulator Toolchain for AI, HPC, and Distributed Storage☆86May 12, 2026Updated 3 weeks ago
- TensorRT encapsulation, learn, rewrite, practice.☆30Oct 19, 2022Updated 3 years ago
- DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling☆24Updated this week
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- 使用VC检测车道线(曲线)☆10Apr 23, 2018Updated 8 years ago
- This is the official implementation for paper "On Powerful Ways to Generate: Autoregression, Diffusion, and Beyond".☆22Nov 17, 2025Updated 6 months ago
- Longitudinal Evaluation of LLMs via Data Compression☆33May 29, 2024Updated 2 years ago
- Terminal UI for NVIDIA Nsight Systems profiles — timeline viewer, kernel navigator, NVTX hierarchy☆58Updated this week
- ☆45Nov 1, 2025Updated 7 months ago
- mHC-lite: You Don’t Need 20 Sinkhorn-Knopp Iterations☆89Jan 12, 2026Updated 4 months ago
- ☆13Mar 24, 2024Updated 2 years ago
- Expert Specialization MoE Solution based on CUTLASS☆27Apr 14, 2026Updated last month
- ☆248Nov 19, 2025Updated 6 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆13Jan 21, 2024Updated 2 years ago
- 使用 cutlass 仓库在 ada 架构上实现 fp8 的 flash attention☆81Aug 12, 2024Updated last year
- APPy (Annotated Parallelism for Python) enables users to annotate loops and tensor expressions in Python with compiler directives akin to…☆29Mar 22, 2026Updated 2 months ago
- Real-time statusline HUD for OpenAI Codex CLI - Monitor sessions, context usage, git status, and tool activity☆53Jun 2, 2026Updated last week
- A bunch of kernels that might make stuff slower 😉☆90Updated this week
- The official github repo for MixEval-X, the first any-to-any, real-world benchmark.☆17Feb 15, 2025Updated last year
- Accepted to MLSys 2026☆85Apr 19, 2026Updated last month
- [TBD] "m4: A Learned Flow-level Network Simulator" by Chenning Li, Anton A. Zabreyko, Om Chabra, Arash Nasr-Esfahany, Kevin Zhao, Pratees…☆20Updated this week
- MNBVC项目-ShareGPT语料清洗☆16Oct 4, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Simple PyTorch graph capturing.☆21May 31, 2023Updated 3 years ago
- 使用 cutlass 实现 flash-attention 精简版,具有教学意义☆59Aug 12, 2024Updated last year
- Top-K Deep Video Analytics: A Probabilistic Approach☆13Jul 21, 2022Updated 3 years ago
- Bias Benchmark for Natural Language Inference. Code repo for the Findings of NAACL 2022 paper "On Measuring Social Biases in Prompt-Based…☆15Apr 28, 2022Updated 4 years ago
- ☆11Jul 20, 2023Updated 2 years ago
- Cursor IDE (v2.6.22) backend endpoint API reverse engineered☆67Apr 2, 2026Updated 2 months ago
- My Implementation of Q-Sparse: All Large Language Models can be Fully Sparsely-Activated☆37Aug 14, 2024Updated last year