A lightweight deep learning training framework implemented from scratch in C++, featuring a PyTorch-style API.
☆177Apr 4, 2026Updated last month
Alternatives and similar repositories for TinyTorch
Users that are interested in TinyTorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Tiny C++ LLM inference implementation from scratch☆113Updated this week
- A Minimalistic Auto-Diff Optimization Framework for Teaching and Understanding Pytorch☆27Mar 12, 2026Updated 2 months ago
- Created a simple neural network using C++17 standard and the Eigen library that supports both forward and backward propagation.☆11Jul 27, 2024Updated last year
- Examples and exercises from the book Programming Massively Parallel Processors - A Hands-on Approach. David B. Kirk and Wen-mei W. Hwu (T…☆79Jan 21, 2021Updated 5 years ago
- Official code for Cross-Domain Policy Adaptation by Capturing Representation Mismatch (ICML 2024)☆15Aug 15, 2025Updated 9 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Handy tools & graphics API abstraction for blazing fast prototyping☆10Jan 17, 2024Updated 2 years ago
- A std::execution style runtime context and High Performance RPC Transport for using OpenUCX. Including CUDA/ROCM/... devices with RDMA.☆30May 20, 2026Updated last week
- Tiny-DeepSpeed, a minimalistic re-implementation of the DeepSpeed library☆52Aug 20, 2025Updated 9 months ago
- Next.js with TypeScript Example☆12Oct 21, 2021Updated 4 years ago
- ☆33Jul 23, 2024Updated last year
- ☆60Mar 31, 2026Updated last month
- Aligntune : A Modular Toolkit for Post Training Alignment of LLMs☆36Apr 29, 2026Updated last month
- ☆27May 27, 2024Updated 2 years ago
- Large-scale Auto-Distributed Training/Inference Unified Framework | Memory-Compute-Control Decoupled Architecture | Multi-language SDK & …☆55May 1, 2026Updated 3 weeks ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- An expression template based linear algebra library running completely on the GPU using CUDA☆25Jun 24, 2021Updated 4 years ago
- 使用 CUDA C++ 实现的 llama 模型推理框架☆65Nov 8, 2024Updated last year
- OneFlow Serving☆20Apr 10, 2025Updated last year
- Tiny-Megatron, a minimalistic re-implementation of the Megatron library☆26Sep 1, 2025Updated 8 months ago
- flash attention tutorial written in python, triton, cuda, cutlass☆517Jan 20, 2026Updated 4 months ago
- Collections of RLxLM experiments using minimal codes☆14Feb 17, 2025Updated last year
- FlashTile is a CUDA Tile IR compiler that is compatible with NVIDIA's tileiras, targeting SM70 through SM121 NVIDIA GPUs.☆60Feb 6, 2026Updated 3 months ago
- An object detection codebase based on MegEngine.☆28Dec 14, 2022Updated 3 years ago
- This is our Compiler Design project for 6th semester.☆12May 15, 2022Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Learning Action-Value Gradients in Model-based Policy Optimization☆32Sep 7, 2021Updated 4 years ago
- Reproducing and Improving CheXNet: Deep Learning for Chest X-ray Disease Classification☆17Mar 24, 2026Updated 2 months ago
- ☆121May 16, 2025Updated last year
- A JAX Implementation of the Twin Delayed DDPG Algorithm☆35Mar 12, 2020Updated 6 years ago
- Benchmarking OpenBLAS on the Apple M1☆18Dec 31, 2020Updated 5 years ago
- Wasserstein Gaussian Splatting☆18Dec 10, 2024Updated last year
- ☆25May 7, 2021Updated 5 years ago
- Tiny UTF-8 ANSI/VT102 terminal abstraction in C☆20Aug 19, 2014Updated 11 years ago
- DeeperGEMM: crazy optimized version☆86May 5, 2025Updated last year
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Android demo for dabnn☆20Oct 18, 2019Updated 6 years ago
- ☆21Dec 27, 2024Updated last year
- FractalTensor is a programming framework that introduces a novel approach to organizing data in deep neural networks (DNNs) as a list of …☆31Dec 21, 2024Updated last year
- ☆62Dec 5, 2021Updated 4 years ago
- Pytorch implementation of "Succinct and Robust Multi-Agent Communication With Temporal Message Control"☆27Dec 6, 2020Updated 5 years ago
- Implementation of different diffusion models for probabilistic image generation☆44Jul 2, 2024Updated last year
- c++实现的clip推理,模型有一点点改动,但是不大,改动和导出模型的代码可以在readme里找到,模型文件都在Releases里,包括AX650的模型。新增支持ChineseCLIP☆31Jun 19, 2025Updated 11 months ago