hopef / llama3_chatView external linksLinks
Llama3 Streaming Chat Sample
☆22Apr 24, 2024Updated last year
Alternatives and similar repositories for llama3_chat
Users that are interested in llama3_chat are comparing it to the libraries listed below
Sorting:
- An onnx-based quantitation tool.☆71Jan 8, 2024Updated 2 years ago
- ☆26Nov 21, 2024Updated last year
- ☆30Nov 16, 2024Updated last year
- nerf☆41Aug 1, 2022Updated 3 years ago
- Using pattern matcher in onnx model to match and replace subgraphs.☆81Feb 7, 2024Updated 2 years ago
- 10000 fps 🚀 for 360 Surround-View CUDA Solution☆145Dec 23, 2023Updated 2 years ago
- ☆20Dec 29, 2023Updated 2 years ago
- ffmpeg+cuvid+tensorrt+multicamera☆12Dec 31, 2024Updated last year
- Accelerating SAHI-based inference on YOLO models using TensorRT.☆92Jan 6, 2026Updated last month
- 搜藏的希望的代码片段☆13Jun 6, 2023Updated 2 years ago
- 跟着Tensorrt_pro学习各种知识☆40Nov 25, 2022Updated 3 years ago
- 本仓库在OpenVINO推理框架下部署Nanodet检测算法,并重写预处理和后处理部分,具有超高性能!让你在Intel CPU平台上的检测速度起飞! 并基于NNCF和PPQ工具将模型量化(PTQ)至int8精度,推理速度更快!☆16Jun 14, 2023Updated 2 years ago
- ☆47Mar 27, 2023Updated 2 years ago
- async inference for machine learning model☆26Sep 21, 2022Updated 3 years ago
- README.md☆48Sep 21, 2023Updated 2 years ago
- YOLOv5 on Orin DLA☆221Feb 18, 2024Updated last year
- a plugin-oriented framework for video structured. 国产程序员请加微信zhzhi78拉群交流。☆18May 28, 2024Updated last year
- HunyuanDiT with TensorRT and libtorch☆18May 22, 2024Updated last year
- ☆79May 16, 2023Updated 2 years ago
- A work assistant for 24-hour live streaming☆13May 18, 2025Updated 8 months ago
- ☆57Aug 21, 2023Updated 2 years ago
- 重构nerf代码,更加容易读懂☆13Mar 26, 2023Updated 2 years ago
- Inference deployment of the llama3☆11Apr 21, 2024Updated last year
- This repository give a guidline to learn CUDA and TensorRT from the beginning.☆319Feb 17, 2025Updated 11 months ago
- llm deploy project based onnx.☆49Oct 9, 2024Updated last year
- ☆13Oct 5, 2023Updated 2 years ago
- Flash Attention in ~100 lines of CUDA (forward pass only)☆11Jun 10, 2024Updated last year
- A simple tool that can generate TensorRT plugin code quickly.☆240Jul 11, 2023Updated 2 years ago
- 基于curl的minio cpp sdk,实现上传下载和创建bucket,查询bucket等操作。简单好用☆49Dec 14, 2021Updated 4 years ago
- This is a repository to practice multi-thread programming in C++☆28Feb 21, 2024Updated last year
- Implementation of a histogram equalization program using CUDA. Histogram equalization is a technique for adjusting image intensities to e…☆13Jan 3, 2021Updated 5 years ago
- paper-read-notes☆13Sep 26, 2024Updated last year
- yolov7-pose end2end TRT实现☆27Sep 8, 2022Updated 3 years ago
- ☆13Dec 28, 2021Updated 4 years ago
- TensorRT for RefineNet Segmentation☆12Apr 27, 2021Updated 4 years ago
- ☆14Apr 18, 2023Updated 2 years ago
- YoloV8 segmentation NPU for the RK 3566/68/88☆18Apr 30, 2024Updated last year
- Stable Diffusion in TensorRT 8.5+☆15Mar 19, 2023Updated 2 years ago
- ☆13Aug 11, 2022Updated 3 years ago