owenliang / learnpytorch
☆9Updated last year
Alternatives and similar repositories for learnpytorch:
Users that are interested in learnpytorch are comparing it to the libraries listed below
- ☆26Updated 10 months ago
- llama 2 Inference☆42Updated last year
- Do NLP without coding! Simple NLP framework.☆21Updated 2 years ago
- Triton Documentation in Chinese Simplified / Triton 中文文档☆62Updated 2 months ago
- 用C++实现一个简单的Transformer模型。 Attention Is All You Need。☆47Updated 4 years ago
- 关于无锁队列的知识☆11Updated 8 years ago
- b站上的课程☆72Updated last year
- CMake快速入门☆27Updated 5 months ago
- OneFlow Serving☆20Updated 3 months ago
- 使用 CUDA C++ 实现的 llama 模型推理框架☆48Updated 4 months ago
- A one-page-only CGraph-API-liked DAG project.☆17Updated last month
- 纯c++的全平台llm加速库,支持python调用,支持baichuan, glm, llama, moss基座,手机端流畅运行chatglm-6B级模型单卡可达10000+token / s,☆45Updated last year
- Transformer related optimization, including BERT, GPT☆17Updated last year
- 高性能文本 Tokenizer 库☆28Updated last year
- 分层解耦的深度学习推理引擎☆72Updated last month
- Multiple GEMM operators are constructed with cutlass to support LLM inference.☆17Updated 6 months ago
- Inference deployment of the llama3☆11Updated 11 months ago
- TLLM_QMM strips the implementation of quantized kernels of Nvidia's TensorRT-LLM, removing NVInfer dependency and exposes ease of use Pyt…☆16Updated 8 months ago
- ☆22Updated last month
- EasyNN是一个面向教学而开发的神经网络推理框架,旨在让大家0基础也能自主完成推理框架编写!☆26Updated 7 months ago
- Inference code for LLaMA models☆118Updated last year
- 飞桨护航计划集训营☆19Updated last week
- CPU Memory Compiler and Parallel programing☆25Updated 4 months ago
- ☢️ TensorRT 2023复赛——基于TensorRT-LLM的Llama模型推断加速优化☆46Updated last year
- 使用 cutlass 实现 flash-attention 精简版,具有教学意义☆38Updated 7 months ago
- A tutorial for CUDA&PyTorch☆131Updated 2 months ago
- 模型压缩的小白入门教程☆22Updated 8 months ago
- NumPy实现类PyTorch的动态计算图和神经网络框架(MLP, CNN, RNN, Transformer)☆80Updated 9 months ago
- A repo to learn c++☆31Updated last year
- A simple deep learning framework inspired by Dezero and PyTorch☆29Updated 2 months ago