☆606Jun 27, 2026Updated this week
Alternatives and similar repositories for AI-Infra-Auto-Driven-SKILLS
Users that are interested in AI-Infra-Auto-Driven-SKILLS are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ToyLLM: Learning LLM from Scratch☆25Jun 22, 2026Updated last week
- CODA: Rewriting Transformer Blocks as GEMM-Epilogue Programs☆216Updated this week
- Flash-Linear-Attention models beyond language☆21Aug 28, 2025Updated 10 months ago
- Open-source toolkit for training, Priming, and serving next generation Hybrid architectures☆72Jun 15, 2026Updated 2 weeks ago
- Mirror of Sven Verdoolaege's isl at http://repo.or.cz/w/isl.git (occasionally with changes for islpy)☆10Dec 16, 2025Updated 6 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- This repo is reproduction resources for linear alignment paper, still working☆17May 19, 2024Updated 2 years ago
- Evaluate state-of-the-art GPU joins☆14Nov 29, 2023Updated 2 years ago
- Slowdown prediction module of Echo: Simulating Distributed Training at Scale☆13May 17, 2025Updated last year
- NVIDIA cuTile learn☆168Dec 9, 2025Updated 6 months ago
- Flash Attention in 300-500 lines of CUDA/C++☆38Aug 22, 2025Updated 10 months ago
- ☆50Jun 7, 2025Updated last year
- ☆21May 17, 2015Updated 11 years ago
- ATLAHS: An Application-centric Network Simulator Toolchain for AI, HPC, and Distributed Storage☆90May 12, 2026Updated last month
- ☆201Updated this week
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- TensorRT encapsulation, learn, rewrite, practice.☆30Oct 19, 2022Updated 3 years ago
- DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling☆28Jun 20, 2026Updated last week
- This is the official implementation for paper "On Powerful Ways to Generate: Autoregression, Diffusion, and Beyond".☆22Nov 17, 2025Updated 7 months ago
- Longitudinal Evaluation of LLMs via Data Compression☆32May 29, 2024Updated 2 years ago
- The official implementation for the paper 'mmSampler: Efficient Frame Sampler for Multimodal Video Retrieval'.☆11Aug 23, 2022Updated 3 years ago
- Terminal UI for NVIDIA Nsight Systems profiles — timeline viewer, kernel navigator, NVTX hierarchy☆60Jun 18, 2026Updated last week
- This is my CUDA optimization of OpenCV seamlessClone API at NORMAL_CLONE mode.☆10Oct 29, 2023Updated 2 years ago
- ☆45Nov 1, 2025Updated 7 months ago
- mHC-lite: You Don’t Need 20 Sinkhorn-Knopp Iterations☆90Jan 12, 2026Updated 5 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆13Mar 24, 2024Updated 2 years ago
- Expert Specialization MoE Solution based on CUTLASS☆27Apr 14, 2026Updated 2 months ago
- ☆13Jan 21, 2024Updated 2 years ago
- 使用 cutlass 仓库在 ada 架构上实现 fp8 的 flash attention☆82Aug 12, 2024Updated last year
- APPy (Annotated Parallelism for Python) enables users to annotate loops and tensor expressions in Python with compiler directives akin to…☆29Mar 22, 2026Updated 3 months ago
- A fork of Allen Smith's Bricksmith macOS application for building LEGO models with LDraw☆10May 15, 2024Updated 2 years ago
- Real-time statusline HUD for OpenAI Codex CLI - Monitor sessions, context usage, git status, and tool activity☆57Jun 2, 2026Updated 3 weeks ago
- A bunch of kernels that might make stuff slower 😉☆91Jun 18, 2026Updated last week
- The official github repo for MixEval-X, the first any-to-any, real-world benchmark.☆17Feb 15, 2025Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Accepted to MLSys 2026☆88Apr 19, 2026Updated 2 months ago
- Bias Benchmark for Natural Language Inference. Code repo for the Findings of NAACL 2022 paper "On Measuring Social Biases in Prompt-Based…☆15Apr 28, 2022Updated 4 years ago
- 5Hz Deep-Compression Speech VAE for AR-Diffusion and CALMs☆57Nov 19, 2025Updated 7 months ago
- My Implementation of Q-Sparse: All Large Language Models can be Fully Sparsely-Activated☆37Aug 14, 2024Updated last year
- https://interactivetraining.ai/☆18Oct 2, 2025Updated 8 months ago
- ☆10Dec 12, 2020Updated 5 years ago
- Official repository for ICML 2024 paper "MoRe Fine-Tuning with 10x Fewer Parameters"☆22Oct 14, 2025Updated 8 months ago