LSTM-Kirigaya / SimpleTensorLinks
a simple demo to implement a deep learning frame based on static graph
☆20Updated 3 years ago
Alternatives and similar repositories for SimpleTensor
Users that are interested in SimpleTensor are comparing it to the libraries listed below
Sorting:
- A tutorial for CUDA&PyTorch☆142Updated 4 months ago
- Implement custom operators in PyTorch with cuda/c++☆62Updated 2 years ago
- ☆35Updated last year
- A light llama-like llm inference framework based on the triton kernel.☆122Updated this week
- TensorRT-in-Action 是一个 GitHub 代码库,提供了使用 TensorRT 的代码示例,并有对应 Jupyter Notebook。☆16Updated 2 years ago
- llm theoretical performance analysis tools and support params, flops, memory and latency analysis.☆92Updated last week
- NumPy实现类PyTorch的动态计算图和神经网络框架(MLP, CNN, RNN, Transformer)☆81Updated 11 months ago
- Tutorials for writing high-performance GPU operators in AI frameworks.☆130Updated last year
- Implementation of FlashAttention in PyTorch☆150Updated 4 months ago
- ☆134Updated last year
- Implement Flash Attention using Cute.☆85Updated 5 months ago
- hands on model tuning with TVM and profile it on a Mac M1, x86 CPU, and GTX-1080 GPU.☆48Updated last year
- ☆31Updated last year
- ☆120Updated 2 years ago
- 机器学习编译 陈天奇☆34Updated 2 years ago
- b站上的课程☆75Updated last year
- Codes & examples for "CUDA - From Correctness to Performance"