Llama3 Streaming Chat Sample
☆22Apr 24, 2024Updated last year
Alternatives and similar repositories for llama3_chat
Users that are interested in llama3_chat are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An onnx-based quantitation tool.☆71Jan 8, 2024Updated 2 years ago
- ☆26Nov 21, 2024Updated last year
- Using pattern matcher in onnx model to match and replace subgraphs.☆81Feb 7, 2024Updated 2 years ago
- ☆30Nov 16, 2024Updated last year
- 10000 fps 🚀 for 360 Surround-View CUDA Solution☆146Dec 23, 2023Updated 2 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- nerf☆41Aug 1, 2022Updated 3 years ago
- Accelerating SAHI-based inference on YOLO models using TensorRT.☆95Jan 6, 2026Updated 2 months ago
- async inference for machine learning model☆26Sep 21, 2022Updated 3 years ago
- A work assistant for 24-hour live streaming☆13May 18, 2025Updated 10 months ago
- ☆20Dec 29, 2023Updated 2 years ago
- README.md☆48Sep 21, 2023Updated 2 years ago
- ffmpeg+cuvid+tensorrt+multicamera☆12Dec 31, 2024Updated last year
- YOLOv5 on Orin DLA☆223Feb 18, 2024Updated 2 years ago
- a plugin-oriented framework for video structured. 国产程序员请加微信zhzhi78拉群交流。☆18May 28, 2024Updated last year
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- 跟着Tensorrt_pro学习各种知识☆39Nov 25, 2022Updated 3 years ago
- A simple tool that can generate TensorRT plugin code quickly.☆240Jul 11, 2023Updated 2 years ago
- 本仓库在OpenVINO推理框架下部署Nanodet检测算法,并重写预处理和后处理部分,具有超高性能!让你在Intel CPU平台上的检测速度起飞! 并基于NNCF和PPQ工具将模型量化(PTQ)至int8精度,推理速度更快!☆16Jun 14, 2023Updated 2 years ago
- 搜藏的希望的代码片段☆13Jun 6, 2023Updated 2 years ago
- ☆118Aug 1, 2024Updated last year
- ☆48Mar 27, 2023Updated 3 years ago
- ☆79May 16, 2023Updated 2 years ago
- ☆57Aug 21, 2023Updated 2 years ago
- HunyuanDiT with TensorRT and libtorch☆18May 22, 2024Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- This repository give a guidline to learn CUDA and TensorRT from the beginning.☆327Feb 17, 2025Updated last year
- This repository is based on shouxieai/tensorRT_Pro, with adjustments to support YOLOv8.☆406Jan 15, 2026Updated 2 months ago
- llm deploy project based onnx.☆50Oct 9, 2024Updated last year
- ☆13Dec 28, 2021Updated 4 years ago
- 基于curl的minio cpp sdk,实现上传下载和创建bucket,查询bucket等操作。简单好用☆49Dec 14, 2021Updated 4 years ago
- This is a repository to practice multi-thread programming in C++☆28Feb 21, 2024Updated 2 years ago
- ☆13Oct 5, 2023Updated 2 years ago
- YoloV8 segmentation NPU for the RK 3566/68/88☆18Apr 30, 2024Updated last year
- 重构nerf代码,更加容易读懂☆13Mar 26, 2023Updated 3 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- ☆114Mar 11, 2024Updated 2 years ago
- ☆14Apr 18, 2023Updated 2 years ago
- 该代码与B站上的视频 https://www.bilibili.com/video/BV18L41197Uz/?spm_id_from=333.788&vd_source=eefa4b6e337f16d87d87c2c357db8ca7 相关联。☆70Oct 7, 2023Updated 2 years ago
- 基于 CUDA Driver API 的 cuda 运行时环境☆15Jul 30, 2025Updated 7 months ago
- yolo model qat and deploy with deepstream&tensorrt☆592Sep 25, 2024Updated last year
- Stable Diffusion in TensorRT 8.5+☆15Mar 19, 2023Updated 3 years ago
- A new tensorrt integrate. Easy to integrate many tasks☆451Apr 2, 2023Updated 2 years ago