guoguo1314 / llama3_learn.cView external linksLinks
Inference deployment of the llama3
☆11Apr 21, 2024Updated last year
Alternatives and similar repositories for llama3_learn.c
Users that are interested in llama3_learn.c are comparing it to the libraries listed below
Sorting:
- ☆20Dec 29, 2023Updated 2 years ago
- A thin cython wrapper around llama.cpp, whisper.cpp and stable-diffusion.cpp☆16Updated this week
- ☆10Jul 18, 2024Updated last year
- 纯c++的全平台llm加速库,支持python调用,支持chatglm-6B, llama, baichuan, moss基座,x86 / ARM☆12Jan 30, 2026Updated 2 weeks ago
- the original reference implementation of a specified llama.cpp backend for Qualcomm Hexagon NPU on Android phone, https://github.com/ggml…☆35Jul 14, 2025Updated 7 months ago
- A one-page-only CGraph-API-liked DAG project.☆26Feb 11, 2025Updated last year
- llm deploy project based onnx.☆49Oct 9, 2024Updated last year
- Flash Attention in ~100 lines of CUDA (forward pass only)☆11Jun 10, 2024Updated last year
- paper-read-notes☆13Sep 26, 2024Updated last year
- ffmpeg+cuvid+tensorrt+multicamera☆12Dec 31, 2024Updated last year
- flux1非官方的量化模型(flux1 unofficial quantize model)☆12Aug 14, 2024Updated last year
- Implementation of a histogram equalization program using CUDA. Histogram equalization is a technique for adjusting image intensities to e…☆13Jan 3, 2021Updated 5 years ago
- Recording models☆12Sep 19, 2023Updated 2 years ago
- ggml学习笔记,ggml是一个机器学习的推理框架☆18Mar 24, 2024Updated last year
- 搜藏的希望的代码片段☆13Jun 6, 2023Updated 2 years ago
- 使用mnn-llm对GOT-OCR2.0进行推理☆14Oct 2, 2024Updated last year
- pure go for rwkv☆19Dec 31, 2023Updated 2 years ago
- learn TensorRT from scratch🥰☆18Sep 29, 2024Updated last year
- ☆17Apr 29, 2024Updated last year
- segmentation algorithm yolact use tensorrt deploy☆14May 7, 2022Updated 3 years ago
- Large Language Model Onnx Inference Framework☆35Nov 25, 2025Updated 2 months ago
- ☆11Feb 6, 2026Updated last week
- 一个轻量化的大模型推理框架☆21May 26, 2025Updated 8 months ago
- 瑞芯微芯片的rknn推理框架部署(yolo模型)☆14Jul 17, 2025Updated 6 months ago
- Terminal Voice Assistant is a powerful and flexible tool designed to help users interact with their terminal using natural language comma…☆19Jun 9, 2024Updated last year
- Llama.cui is a small llama.cpp-based chat application for Node.js☆20Jul 10, 2025Updated 7 months ago
- stable diffusion using mnn☆67Sep 28, 2023Updated 2 years ago
- HunyuanDiT with TensorRT and libtorch☆18May 22, 2024Updated last year
- LLM inference in C/C++☆21Mar 22, 2025Updated 10 months ago
- Inference Llama 2 in one file of pure Cuda☆17Aug 20, 2023Updated 2 years ago
- 高性能 高精度 大陆车牌、港澳车牌、台湾车牌 韩国车牌(South Korea LPR)识别 代码开源(ncnn移植)☆41Nov 5, 2025Updated 3 months ago
- minimal C implementation of speculative decoding based on llama2.c☆25Jul 15, 2024Updated last year
- qwen2 and llama3 cpp implementation☆49Jun 7, 2024Updated last year
- Awesome code, projects, books, etc. related to CUDA☆30Feb 3, 2026Updated last week
- RKNN-YOLOV5-BatchInference-MultiThreadingYOLOV5多张图片多线程C++推理☆22Nov 6, 2023Updated 2 years ago
- ☆26Nov 21, 2024Updated last year
- Writing Tools, Apple's AI-inspired app, enchants Windows, enhancing your pen with AI LLMs. One hotkey press, system-wide, fixes grammar, …☆26Jul 26, 2025Updated 6 months ago
- Llama3 Streaming Chat Sample☆22Apr 24, 2024Updated last year
- RKNN模型推理部署模板☆24Aug 11, 2023Updated 2 years ago