Llama3 Streaming Chat Sample
☆22Apr 24, 2024Updated 2 years ago
Alternatives and similar repositories for llama3_chat
Users that are interested in llama3_chat are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An onnx-based quantitation tool.☆71Jan 8, 2024Updated 2 years ago
- ☆26Nov 21, 2024Updated last year
- Using pattern matcher in onnx model to match and replace subgraphs.☆81Feb 7, 2024Updated 2 years ago
- ☆30Nov 16, 2024Updated last year
- 10000 fps 🚀 for 360 Surround-View CUDA Solution☆152Dec 23, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- nerf☆41Aug 1, 2022Updated 3 years ago
- Accelerating SAHI-based inference on YOLO models using TensorRT.☆101Jan 6, 2026Updated 4 months ago
- async inference for machine learning model☆26Sep 21, 2022Updated 3 years ago
- A work assistant for 24-hour live streaming☆13May 18, 2025Updated last year
- ☆20Dec 29, 2023Updated 2 years ago
- README.md☆48Sep 21, 2023Updated 2 years ago
- ffmpeg+cuvid+tensorrt+multicamera☆12Dec 31, 2024Updated last year
- YOLOv5 on Orin DLA☆225Feb 18, 2024Updated 2 years ago
- a plugin-oriented framework for video structured. 国产程序员请加微信zhzhi78拉群交流。☆18May 28, 2024Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- 跟着Tensorrt_pro学习各种知识☆39Nov 25, 2022Updated 3 years ago
- A simple tool that can generate TensorRT plugin code quickly.☆241Jul 11, 2023Updated 2 years ago
- 本仓库在OpenVINO推理框架下部署Nanodet检测算法,并重写预处理和后处理部分,具有超高性能!让你在Intel CPU平台上的检测速度起飞! 并基于NNCF和PPQ工具将模型量化(PTQ)至int8精度,推理速度更快!☆16Jun 14, 2023Updated 2 years ago
- 搜藏的希望的代码片段☆13Jun 6, 2023Updated 2 years ago
- ☆119Aug 1, 2024Updated last year
- ☆48Mar 27, 2023Updated 3 years ago
- ☆79May 16, 2023Updated 3 years ago
- ☆57Aug 21, 2023Updated 2 years ago
- HunyuanDiT with TensorRT and libtorch☆18May 22, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- This repository give a guidline to learn CUDA and TensorRT from the beginning.☆345Feb 17, 2025Updated last year
- This repository is based on shouxieai/tensorRT_Pro, with adjustments to support YOLOv8.☆410Updated this week
- llm deploy project based onnx.☆49Oct 9, 2024Updated last year
- 基于curl的minio cpp sdk,实现上传下载和创建bucket,查询bucket等操作。简单好用☆49Dec 14, 2021Updated 4 years ago
- ☆13Dec 28, 2021Updated 4 years ago
- This is a repository to practice multi-thread programming in C++☆30Feb 21, 2024Updated 2 years ago
- ☆13Oct 5, 2023Updated 2 years ago
- YoloV8 segmentation NPU for the RK 3566/68/88☆18Apr 30, 2024Updated 2 years ago
- 重构nerf代码,更加容易读懂☆13Mar 26, 2023Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆114Mar 11, 2024Updated 2 years ago
- ☆14Apr 18, 2023Updated 3 years ago
- 该代码与B站上的视频 https://www.bilibili.com/video/BV18L41197Uz/?spm_id_from=333.788&vd_source=eefa4b6e337f16d87d87c2c357db8ca7 相关联。☆70Oct 7, 2023Updated 2 years ago
- 基于 CUDA Driver API 的 cuda 运行时环境☆16Jul 30, 2025Updated 9 months ago
- Stable Diffusion in TensorRT 8.5+☆15Mar 19, 2023Updated 3 years ago
- yolo model qat and deploy with deepstream&tensorrt☆603Sep 25, 2024Updated last year
- A new tensorrt integrate. Easy to integrate many tasks☆453Apr 2, 2023Updated 3 years ago