P1ayer-1/Llama-LibTorch

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/P1ayer-1/Llama-LibTorch)

P1ayer-1 / Llama-LibTorch

Llama causal LM fully recreated in LibTorch. Designed to be used in Unreal Engine 5

☆16

Alternatives and similar repositories for Llama-LibTorch

Users that are interested in Llama-LibTorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ZiQiangXie / llm-from-scratch
View on GitHub
LLM implementation one matrix multiplication at a time
☆13Aug 8, 2024Updated last year
triple-mu / HunyuanDiT-TensorRT-libtorch
View on GitHub
HunyuanDiT with TensorRT and libtorch
☆18May 22, 2024Updated 2 years ago
konan6915 / SuperPointCPP
View on GitHub
Superpoint模型调用的C++实现(使用libtorch)
☆18Jun 13, 2023Updated 3 years ago
carsonpo / safetensors.cpp
View on GitHub
Zero Dependency LibTorch Safetensors Loading and Storing in C++
☆23Jul 12, 2024Updated 2 years ago
wmmae / wmma_extension
View on GitHub
An extension library of WMMA API (Tensor Core API)
☆115Jul 12, 2024Updated 2 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
digitalocean / go-ps
View on GitHub
Find, list, and inspect processes from Go (golang).
☆10Feb 4, 2018Updated 8 years ago
ZJU-lishuang / yolov5_tensorrt
View on GitHub
yolov5部署
☆19Jun 17, 2022Updated 4 years ago
emre570 / transformer.cu
View on GitHub
Transformer Architecture written with CUDA, C++ and LibTorch.
☆11Jul 26, 2025Updated 11 months ago
SunnyHaze / miSRGAN-pytorch
View on GitHub
Unofficial pytorch implementation of miSRGAN, in paper "3D Registration of pre-surgical prostate MRI and histopathology images via super-…
☆11Dec 6, 2023Updated 2 years ago
BBuf / tensorrt-llm-moe
View on GitHub
☆34Feb 3, 2025Updated last year
XuYongi / UNet-Pruning
View on GitHub
UNet-Pruning b developing NNI
☆10Sep 2, 2020Updated 5 years ago
neuralps3d / neuralps3d
View on GitHub
Neural Reflectance Field from Shading and Shadow under a Fixed Viewpoint
☆16Aug 8, 2022Updated 3 years ago
wxia43 / DesmokeData
View on GitHub
☆13Aug 25, 2025Updated 10 months ago
ShaYeBuHui01 / flash_attention_inference
View on GitHub
Performance of the C++ interface of flash attention and flash attention v2 in large language model (LLM) inference scenarios.
☆15Aug 31, 2023Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
sail-sg / InfNeRF
View on GitHub
InfNeRF: Towards Infinite Scale NeRF Rendering with O(log n) Space Complexity
☆12Jan 3, 2026Updated 6 months ago
littlebearsama / xxCu3Dlibrary
View on GitHub
cuda 加速3D点云算法库，持续更新（含cudaicp，glfw点云可视化等）
☆17Aug 24, 2022Updated 3 years ago
Smorodov / nano_bfm
View on GitHub
Basel morphable face model mesh and texture generator using GPU.
☆14Sep 14, 2020Updated 5 years ago
davegreenwood / torch-tps
View on GitHub
Implementation of Thin-Plate-Splines (TPS) in PyTorch.
☆13Mar 10, 2020Updated 6 years ago
YinZHag / DFENet-A-dual-branch-feature-enhanced-network-integrating-transformers-and-convolutional-feature-lea
View on GitHub
It's a project of medical image processing.
☆14Oct 16, 2022Updated 3 years ago
rushikesh12 / Classification-of-T1-MRI-Images
View on GitHub
Applied 3D CNN on T1-MRI images for classifying healthy/unhealthy subjects.
☆10Nov 13, 2020Updated 5 years ago
feiyuhuahuo / SOLOv2_minimal
View on GitHub
Minimal PyTorch implementation of SOLOv2.
☆15Jan 14, 2025Updated last year
caps-tum / mt4g
View on GitHub
Memory Topology for GPUs
☆19Updated this week
Wangdai-0800 / CoordinateAttention_Keras
View on GitHub
A Keras Implementation of Coordinate Attention follows https://github.com/Andrew-Qibin/CoordAttention
☆13Sep 25, 2021Updated 4 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
jtirana98 / MultiHop-Federeated-Split-Learning
View on GitHub
Framework that supports federated split learning with multiple hops that enables pipeline parallelism.
☆16Jun 17, 2026Updated last month
lavendelion / my-PFLD-pytorch
View on GitHub
Implementation of PFLD(Paper: "A Practical Facial Landmark Detector") by pytorch.
☆15Feb 16, 2021Updated 5 years ago
l-sf / Nanodet_openvino_quant_deploy
View on GitHub
本仓库在OpenVINO推理框架下部署Nanodet检测算法，并重写预处理和后处理部分，具有超高性能！让你在Intel CPU平台上的检测速度起飞！并基于NNCF和PPQ工具将模型量化(PTQ)至int8精度，推理速度更快！
☆16Jun 14, 2023Updated 3 years ago
DD-DuDa / Cute-Learning
View on GitHub
Examples of CUDA implementations by Cutlass CuTe
☆279Jul 1, 2025Updated last year
behzadhaki / MonotonicGrooveTransformer
View on GitHub
☆24Mar 1, 2023Updated 3 years ago
deezer / zeroNoteSamba
View on GitHub
Repository for the IEEE/ACM TASLP 2023 Paper "Zero-Note Samba: Self-Supervised Beat Tracking".
☆29Jul 25, 2023Updated 2 years ago
IAMLAB-Ryerson / FlowReg
View on GitHub
☆14Nov 20, 2021Updated 4 years ago
junyuchen245 / TransMorph_TVF
View on GitHub
Unsupervised Learning of Diffeomorphic Image Registration via TransMorph (PyTorch)
☆17Apr 30, 2025Updated last year
spiegelib / spiegelib
View on GitHub
Automatic Synthesizer Programming Library
☆61Jul 6, 2023Updated 3 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
IST-DASLab / gemm-fp8
View on GitHub
High Performance FP8 GEMM Kernels for SM89 and later GPUs.
☆21Jan 24, 2025Updated last year
randerka / cNeRF
View on GitHub
A concise C++ implementation of Neural Radiance Fields (NeRF) using LibTorch.
☆55Apr 3, 2023Updated 3 years ago
PaddlePaddle / PaDiff
View on GitHub
Paddle Automatically Diff Precision Toolkits.
☆53Dec 5, 2025Updated 7 months ago
markus-k / bisenetv2-tf2
View on GitHub
BiSeNet V2 TensorFlow 2 Implementation capable of running on Edge TPUs
☆16Nov 18, 2022Updated 3 years ago
gmum / HyperNeRFGAN
View on GitHub
Generative model for 3D objects.
☆18Aug 12, 2023Updated 2 years ago
adrienchaton / neural_granular_synthesis
View on GitHub
ICMC 2020 "Neural Granular Sound Synthesis"
☆40Nov 3, 2021Updated 4 years ago
Triang-jyed-driung / i8muon
View on GitHub
Muon in Int8 Precision Made Possible
☆20Jun 18, 2026Updated last month