A lightweight deep learning training framework implemented from scratch in C++, featuring a PyTorch-style API.
☆183Jun 10, 2026Updated last week
Alternatives and similar repositories for TinyTorch
Users that are interested in TinyTorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Tiny C++ LLM inference implementation from scratch☆113Jun 10, 2026Updated last week
- A Minimalistic Auto-Diff Optimization Framework for Teaching and Understanding Pytorch☆27Jun 10, 2026Updated last week
- Examples and exercises from the book Programming Massively Parallel Processors - A Hands-on Approach. David B. Kirk and Wen-mei W. Hwu (T…☆79Jan 21, 2021Updated 5 years ago
- 对 tensorRT_Pro 开源项目理解☆22Feb 23, 2023Updated 3 years ago
- Handy tools & graphics API abstraction for blazing fast prototyping☆10Jan 17, 2024Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Tiny-DeepSpeed, a minimalistic re-implementation of the DeepSpeed library☆52Aug 20, 2025Updated 9 months ago
- ☆33Jul 23, 2024Updated last year
- ☆69Mar 31, 2026Updated 2 months ago
- 该仓库整理收录了北大陈斌教授的《Python语言基础与应用》课 程的相关习题答案☆11Jan 26, 2021Updated 5 years ago
- Gensis is a lightweight deep learning framework written from scratch in Python, with Triton as its backend for high-performance computing…☆36Jan 15, 2026Updated 5 months ago
- ☆27May 27, 2024Updated 2 years ago
- A toy Python DL training library with PyTorch like API☆38Sep 23, 2025Updated 8 months ago
- 3D Game Engine.☆25May 25, 2026Updated 3 weeks ago
- Large-scale Auto-Distributed Training/Inference Unified Framework | Memory-Compute-Control Decoupled Architecture | Multi-language SDK & …☆54May 1, 2026Updated last month
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- An expression template based linear algebra library running completely on the GPU using CUDA☆25Jun 24, 2021Updated 4 years ago
- 使用 CUDA C++ 实现的 llama 模型推理框架☆65Nov 8, 2024Updated last year
- TensaLang is a Tensor-first programming language, compiler, and runtime that let you write the Model’s inference engine (e.g. LLMs) and s…☆74Feb 20, 2026Updated 3 months ago
- A practical way of learning Swizzle☆41Feb 3, 2025Updated last year
- OneFlow Serving☆20Apr 10, 2025Updated last year
- Tiny-Megatron, a minimalistic re-implementation of the Megatron library☆27Sep 1, 2025Updated 9 months ago
- flash attention tutorial written in python, triton, cuda, cutlass☆522Jan 20, 2026Updated 4 months ago
- Collections of RLxLM experiments using minimal codes☆14Feb 17, 2025Updated last year
- No-dependency OpenGL support library, which abstracts the processes of creating buffers and shaders☆14Apr 28, 2023Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- FlashTile is a CUDA Tile IR compiler that is compatible with NVIDIA's tileiras, targeting SM70 through SM121 NVIDIA GPUs.☆61Feb 6, 2026Updated 4 months ago
- 晚上下班不刷手机,学点什么。系列一:CUDA 计算框架 CUFX (Cuda Framework eXtended)。☆17Dec 15, 2024Updated last year
- This is our Compiler Design project for 6th semester.☆12May 15, 2022Updated 4 years ago
- 图像重建过程中的算法记录☆14Apr 20, 2019Updated 7 years ago
- A 2D/3D Photorealistic Engine written in OpenGL☆11Nov 17, 2016Updated 9 years ago
- ☆122May 16, 2025Updated last year
- compiling DSLs to high-level hardware instructions☆23Nov 8, 2022Updated 3 years ago
- 并行程序设计导论 源码与课后题答案☆21Jul 9, 2021Updated 4 years ago
- Light Map Baker is a c++ library that bakes lightmaps.☆10Jan 6, 2021Updated 5 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆25May 7, 2021Updated 5 years ago
- DeeperGEMM: crazy optimized version☆86May 5, 2025Updated last year
- Android demo for dabnn☆20Oct 18, 2019Updated 6 years ago
- FractalTensor is a programming framework that introduces a novel approach to organizing data in deep neural networks (DNNs) as a list of …☆32Dec 21, 2024Updated last year
- Tutorial for writing an LLVM backend☆32May 19, 2025Updated last year
- GPU-Accelerated Software Rasterizer☆11Jun 8, 2017Updated 9 years ago
- Voxel Cone Tracing Implementation☆16Nov 18, 2021Updated 4 years ago