Llama causal LM fully recreated in LibTorch. Designed to be used in Unreal Engine 5
☆16Sep 19, 2024Updated last year
Alternatives and similar repositories for Llama-LibTorch
Users that are interested in Llama-LibTorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- LLM implementation one matrix multiplication at a time☆13Aug 8, 2024Updated last year
- PyTorch extension enabling direct access to cuDNN-accelerated C++ convolution functions.☆13Mar 14, 2021Updated 5 years ago
- Zero Dependency LibTorch Safetensors Loading and Storing in C++☆23Jul 12, 2024Updated last year
- Neural radiance fields(NeRF) c++ LibTorch implementation☆17Dec 30, 2025Updated 5 months ago
- Superpoint模型调用的C++实现(使用libtorch)☆18Jun 13, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- HunyuanDiT with TensorRT and libtorch☆18May 22, 2024Updated 2 years ago
- ☆15Apr 11, 2024Updated 2 years ago
- This demo shows you how to build a single pose estimation algorithm in C++ using libtorch The model is trained using pytorch (Alphapose…☆15Feb 19, 2020Updated 6 years ago
- [ICML 2025] LaCache: Ladder-Shaped KV Caching for Efficient Long-Context Modeling of Large Language Models☆18Nov 4, 2025Updated 7 months ago
- The Tensed Computer Improviser☆15May 6, 2026Updated last month
- 自然场景检测DBNet网络的tensorrt版本☆24Feb 7, 2021Updated 5 years ago
- yolov5部署☆19Jun 17, 2022Updated 3 years ago
- ☆33Feb 3, 2025Updated last year
- The inference implementation of the deeplabV3+ person segementation algorithm.☆24Jan 1, 2021Updated 5 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- OSC Property Wrapper☆23May 11, 2024Updated 2 years ago
- High Performance FP8 GEMM Kernels for SM89 and later GPUs.☆21Jan 24, 2025Updated last year
- Basel morphable face model mesh and texture generator using GPU.☆14Sep 14, 2020Updated 5 years ago
- Transformer Architecture written with CUDA, C++ and LibTorch.☆10Jul 26, 2025Updated 10 months ago
- InfNeRF: Towards Infinite Scale NeRF Rendering with O(log n) Space Complexity☆12Jan 3, 2026Updated 5 months ago
- Performance of the C++ interface of flash attention and flash attention v2 in large language model (LLM) inference scenarios.☆16Aug 31, 2023Updated 2 years ago
- Automatic Synthesizer Programming Library☆60Jul 6, 2023Updated 2 years ago
- A lightweight UNet implementation, using Keras☆14Jan 16, 2020Updated 6 years ago
- Detectron2 Libtorch C++ faster rcnn☆13Aug 6, 2021Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A library for computing Frechet Music Distance.☆31Feb 4, 2025Updated last year
- Basically, I implemented U-Net architecture using Dense-Blocks instead of Convolution layers and also added a Dilated Spatial Pooling Lay…☆15Jun 11, 2019Updated 6 years ago
- It's a project of medical image processing.☆14Oct 16, 2022Updated 3 years ago
- Minimal PyTorch implementation of SOLOv2.☆15Jan 14, 2025Updated last year
- OpenGL 学习代码☆15Jun 25, 2023Updated 2 years ago
- Simulator for EYESY visualizer☆24Jun 5, 2022Updated 4 years ago
- Reproduction of MobileSAM using pytorch☆22Oct 27, 2023Updated 2 years ago
- Keras implementation of pointnet for part segmentation on shapenet dataset☆14Apr 14, 2020Updated 6 years ago
- LLM checkpointing for DeepSpeed/Megatron☆25Nov 30, 2025Updated 6 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A Keras Implementation of Coordinate Attention follows https://github.com/Andrew-Qibin/CoordAttention☆13Sep 25, 2021Updated 4 years ago
- Implementation of PFLD(Paper: "A Practical Facial Landmark Detector") by pytorch.☆15Feb 16, 2021Updated 5 years ago
- ☆24Mar 1, 2023Updated 3 years ago
- 本仓库在OpenVINO推理框架下部署Nanodet检测算法,并重写预处理和后处理部分,具有超高性能!让你在Intel CPU平台上的检测速度起飞! 并基于NNCF和PPQ工具将模型量化(PTQ)至int8精度,推理速度更快!☆16Jun 14, 2023Updated 2 years ago
- ⚡️Write HGEMM from scratch using Tensor Cores with WMMA, MMA and CuTe API, Achieve Peak⚡️ Performance.☆155May 10, 2025Updated last year
- Context Manager to profile the forward and backward times of PyTorch's nn.Module☆83Oct 10, 2023Updated 2 years ago
- Tensor library for machine learning☆30May 19, 2026Updated 2 weeks ago