Llama causal LM fully recreated in LibTorch. Designed to be used in Unreal Engine 5
☆16Sep 19, 2024Updated last year
Alternatives and similar repositories for Llama-LibTorch
Users that are interested in Llama-LibTorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- One Diffusion model implementation base on LibTorch☆13Mar 22, 2023Updated 3 years ago
- LLM implementation one matrix multiplication at a time☆13Aug 8, 2024Updated last year
- PyTorch extension enabling direct access to cuDNN-accelerated C++ convolution functions.☆13Mar 14, 2021Updated 5 years ago
- Neural radiance fields(NeRF) c++ LibTorch implementation☆17Dec 30, 2025Updated 4 months ago
- Superpoint模型调用的C++实现(使用libtorch)☆18Jun 13, 2023Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- This demo shows you how to build a single pose estimation algorithm in C++ using libtorch The model is trained using pytorch (Alphapose…☆15Feb 19, 2020Updated 6 years ago
- The Tensed Computer Improviser☆15May 6, 2026Updated last week
- 自然场景检测DBNet网络的tensorrt版本☆24Feb 7, 2021Updated 5 years ago
- ☆26Feb 23, 2026Updated 2 months ago
- Unofficial pytorch implementation of miSRGAN, in paper "3D Registration of pre-surgical prostate MRI and histopathology images via super-…☆11Dec 6, 2023Updated 2 years ago
- Neural Reflectance Field from Shading and Shadow under a Fixed Viewpoint☆16Aug 8, 2022Updated 3 years ago
- OSC Property Wrapper☆23May 11, 2024Updated 2 years ago
- Implementation of the soft 2D Frangi filter on Pytorch☆11Mar 2, 2022Updated 4 years ago
- High Performance FP8 GEMM Kernels for SM89 and later GPUs.☆21Jan 24, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Basel morphable face model mesh and texture generator using GPU.☆14Sep 14, 2020Updated 5 years ago
- Transformer Architecture written with CUDA, C++ and LibTorch.☆10Jul 26, 2025Updated 9 months ago
- ☆14Dec 21, 2025Updated 4 months ago
- Performance of the C++ interface of flash attention and flash attention v2 in large language model (LLM) inference scenarios.☆16Aug 31, 2023Updated 2 years ago
- cuda 加速3D点云算法库,持续更新(含cudaicp,glfw点云可视化等)☆16Aug 24, 2022Updated 3 years ago
- Automatic Synthesizer Programming Library☆58Jul 6, 2023Updated 2 years ago
- A lightweight UNet implementation, using Keras☆14Jan 16, 2020Updated 6 years ago
- Detectron2 Libtorch C++ faster rcnn☆13Aug 6, 2021Updated 4 years ago
- A library for computing Frechet Music Distance.☆31Feb 4, 2025Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- It's a project of medical image processing.☆14Oct 16, 2022Updated 3 years ago
- Minimal PyTorch implementation of SOLOv2.☆15Jan 14, 2025Updated last year
- High Performance Int8 GEMM Kernels for SM80 and later GPUs.☆23Mar 11, 2025Updated last year
- Simulator for EYESY visualizer☆23Jun 5, 2022Updated 3 years ago
- Reproduction of MobileSAM using pytorch☆18Oct 27, 2023Updated 2 years ago
- ☆17Dec 10, 2018Updated 7 years ago
- A Keras Implementation of Coordinate Attention follows https://github.com/Andrew-Qibin/CoordAttention☆13Sep 25, 2021Updated 4 years ago
- Spatial Transformer Network YOLO Model for Agricultural Object Detection☆18Sep 18, 2024Updated last year
- Implementation of PFLD(Paper: "A Practical Facial Landmark Detector") by pytorch.☆15Feb 16, 2021Updated 5 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆24Mar 1, 2023Updated 3 years ago
- 本仓库在OpenVINO推理框架下部署Nanodet检测算法,并重写预处理和后处理部分,具有超高性能!让你在Intel CPU平台上的检测速度起飞! 并基于NNCF和PPQ工具将模型量化(PTQ)至int8精度,推理速度更快!☆16Jun 14, 2023Updated 2 years ago
- Deploy YOLACT++ with onnxruntime and C++ API☆14Jun 1, 2021Updated 4 years ago
- Repository for the IEEE/ACM TASLP 2023 Paper "Zero-Note Samba: Self-Supervised Beat Tracking".☆29Jul 25, 2023Updated 2 years ago
- ☆46Oct 28, 2025Updated 6 months ago
- [ICML'24] Creative Text-to-Audio Generation via Synthesizer Programming☆40Sep 26, 2024Updated last year
- ⚡️Write HGEMM from scratch using Tensor Cores with WMMA, MMA and CuTe API, Achieve Peak⚡️ Performance.☆153May 10, 2025Updated last year