A tiny deep learning training framework implemented from scratch in C++ that follows PyTorch's API.
☆165Mar 26, 2026Updated this week
Alternatives and similar repositories for TinyTorch
Users that are interested in TinyTorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Tiny C++ LLM inference implementation from scratch☆108Mar 12, 2026Updated 2 weeks ago
- Created a simple neural network using C++17 standard and the Eigen library that supports both forward and backward propagation.☆11Jul 27, 2024Updated last year
- Examples and exercises from the book Programming Massively Parallel Processors - A Hands-on Approach. David B. Kirk and Wen-mei W. Hwu (T…☆78Jan 21, 2021Updated 5 years ago
- Official code for ICLR 2024 paper, SEABO: A Simple Search-Based Method for Offline Imitation Learning☆12Jan 19, 2024Updated 2 years ago
- 对 tensorRT_Pro 开源项目理解☆22Feb 23, 2023Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Official code for Cross-Domain Policy Adaptation by Capturing Representation Mismatch (ICML 2024)☆15Aug 15, 2025Updated 7 months ago
- A std::execution style runtime context and High Performance RPC Transport for using OpenUCX. Including CUDA/ROCM/... devices with RDMA.☆30Feb 22, 2026Updated last month
- Next.js with TypeScript Example☆12Oct 21, 2021Updated 4 years ago
- Crixus is a preprocessing tool for SPH, in particular Spartacus3D, Sphynx and GPUSPH.☆11Mar 18, 2016Updated 10 years ago
- Aligntune : A Modular Toolkit for Post Training Alignment of LLMs☆37Mar 23, 2026Updated last week
- Gensis is a lightweight deep learning framework written from scratch in Python, with Triton as its backend for high-performance computing…☆36Jan 15, 2026Updated 2 months ago
- Large-scale Auto-Distributed Training/Inference Unified Framework | Memory-Compute-Control Decoupled Architecture | Multi-language SDK & …☆55Jan 30, 2026Updated 2 months ago
- An expression template based linear algebra library running completely on the GPU using CUDA☆25Jun 24, 2021Updated 4 years ago
- 使用 CUDA C++ 实现的 llama 模型推理框架☆65Nov 8, 2024Updated last year
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- A practical way of learning Swizzle☆37Feb 3, 2025Updated last year
- OneFlow Serving☆20Apr 10, 2025Updated 11 months ago
- flash attention tutorial written in python, triton, cuda, cutlass☆494Jan 20, 2026Updated 2 months ago
- Tiny-Megatron, a minimalistic re-implementation of the Megatron library☆23Sep 1, 2025Updated 6 months ago
- Collections of RLxLM experiments using minimal codes☆14Feb 17, 2025Updated last year
- 晚上下班不刷手机,学点什么。系列一:CUDA 计算框架 CUFX (Cuda Framework eXtended)。☆16Dec 15, 2024Updated last year
- This is our Compiler Design project for 6th semester.☆12May 15, 2022Updated 3 years ago
- ☆119May 16, 2025Updated 10 months ago
- Learning Action-Value Gradients in Model-based Policy Optimization☆32Sep 7, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Reproducing and Improving CheXNet: Deep Learning for Chest X-ray Disease Classification☆17Updated this week
- 并行程序设计导论 源码与课后题答案☆20Jul 9, 2021Updated 4 years ago
- A JAX Implementation of the Twin Delayed DDPG Algorithm☆35Mar 12, 2020Updated 6 years ago
- ☆12Dec 29, 2021Updated 4 years ago
- DeeperGEMM: crazy optimized version☆75May 5, 2025Updated 10 months ago
- ☆26May 7, 2021Updated 4 years ago
- Android demo for dabnn☆20Oct 18, 2019Updated 6 years ago
- FractalTensor is a programming framework that introduces a novel approach to organizing data in deep neural networks (DNNs) as a list of …☆30Dec 21, 2024Updated last year
- ☆63Dec 5, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Tutorial for writing an LLVM backend☆31May 19, 2025Updated 10 months ago
- Pytorch implementation of "Succinct and Robust Multi-Agent Communication With Temporal Message Control"☆28Dec 6, 2020Updated 5 years ago
- An open source benchmark for Multi Agent Reinforcement Learning☆31Jul 15, 2023Updated 2 years ago
- [EuroSys'25] Mist: Efficient Distributed Training of Large Language Models via Memory-Parallelism Co-Optimization☆22Feb 5, 2026Updated last month
- c++实现的clip推理,模型有一点点改动,但是不大,改动和导出模型的代码可以在readme里找到,模型文件都在Releases里,包括AX650的模型。新增支持ChineseCLIP☆31Jun 19, 2025Updated 9 months ago
- CUDA 8-bit Tensor Core Matrix Multiplication based on m16n16k16 WMMA API☆35Sep 15, 2023Updated 2 years ago
- ☆12Jan 29, 2026Updated 2 months ago