Optimize QWen1.5 models with TensorRT-LLM
☆17May 14, 2024Updated last year
Alternatives and similar repositories for QWen1.5_TensorRT-LLM
Users that are interested in QWen1.5_TensorRT-LLM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- pose estimation code with deepstream and yolo-pose☆13Oct 14, 2022Updated 3 years ago
- Ultimate NLP Toolkit for GPUs: RAPIDS-AI, PyTorch, NeMo, Tensorboard, TensorRT, CUDA 10.1☆10Mar 19, 2020Updated 6 years ago
- ☆14Sep 6, 2024Updated last year
- CTE: Contextualized Table Extraction Dataset☆17Feb 23, 2023Updated 3 years ago
- A simplified implementation of RetinaNet from https://arxiv.org/pdf/1708.02002.pdf using TF2.0☆13Aug 5, 2020Updated 5 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Implementation of Neural Style Transfer on Video☆10Nov 6, 2018Updated 7 years ago
- I'm going to use the Winograd’s minimal filtering algorithms to introduce a new class of fast algorithms for convolutional neural networks…☆12Mar 22, 2018Updated 8 years ago
- our first ai.☆15Jul 2, 2017Updated 8 years ago
- this repo attemps to reproduce DSOD: Learning Deeply Supervised Object Detectors from Scratch use gluon reimplementation☆14Aug 18, 2018Updated 7 years ago
- ☆12Jul 19, 2019Updated 6 years ago
- ☆621Jul 31, 2024Updated last year
- Original PyTorch Implementation for the EMNLP 2023 Paper "Beyond Detection: A Defend-and-Summarize Strategy for Robust and Interpretable …☆16Dec 14, 2023Updated 2 years ago
- This repository contains the results and code for the MLPerf™ Inference v0.7 benchmark.☆17Jul 24, 2025Updated 9 months ago
- This script implements the tensorflow1.x and keras model into a caffe inference model.☆14Mar 21, 2020Updated 6 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- 用RLHF可选LoRA对LLaMA和MOSS进行训练|Training LLaMA or MOSS with RLHF [LoRA]☆21May 16, 2023Updated 2 years ago
- PyTorch Implementation for CS229 Course Project - "Grammatical Error Correction using Neural Networks"☆10Dec 16, 2017Updated 8 years ago
- Use the MobileNet V2 as the basenet instead of the original VGG16☆14Aug 28, 2019Updated 6 years ago
- laravel 中国地图web Api集合☆13Apr 27, 2023Updated 3 years ago
- 如需体验TextIn文档解析,请访问 https://cc.co/16YSIy☆15Mar 4, 2025Updated last year
- LiteMind 是基于 Java 21 + Spring Boot 3 + Spring AI + RAG + Tool Calling + MCP 构建的通用 AI 智能体。支持多轮对话、记忆持久化和 RAG 知识库检索,基于 ReAct 智能体工作模式,具备自主思考能…☆52Dec 26, 2025Updated 4 months ago
- Summary of system papers/frameworks/codes/tools on training or serving large model☆57Dec 17, 2023Updated 2 years ago
- 🌳CED: Catalog Extraction from Documents☆16Jul 30, 2023Updated 2 years ago
- 利用GPT2实现的闲聊模型☆12Apr 22, 2021Updated 5 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Explained QNNPACK Implementation☆21Sep 20, 2025Updated 7 months ago
- Vision Longformer For Object Detection☆34May 17, 2021Updated 4 years ago
- ☆28Nov 6, 2024Updated last year
- ☆32Apr 8, 2025Updated last year
- Your Coursera Helper☆71Jun 23, 2015Updated 10 years ago
- Convert Keras models into Caffe models (within reason)☆21Sep 15, 2017Updated 8 years ago
- Code of the VFL part proposed in the paper: Variational Representation Learning for Vehicle Re-Identification (IEEE ICIP)☆25Oct 22, 2019Updated 6 years ago
- FinRAG: Financial Retrieval Augmented Generation☆42Aug 28, 2024Updated last year
- ☆26Jul 7, 2021Updated 4 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- 大模型推理压测☆47Jul 31, 2025Updated 9 months ago
- ☆28Sep 7, 2018Updated 7 years ago
- ☆27Aug 5, 2022Updated 3 years ago
- 检测透视图像中的矩形文档并对其进行矫正☆31Sep 16, 2022Updated 3 years ago
- This is an accurate implementation for IoU loss between two rotated polygons. This algorithm is accurate and differential, but there is n…☆18Mar 5, 2022Updated 4 years ago
- ☆23Oct 3, 2023Updated 2 years ago
- A simple Regular expression matcher☆17Oct 6, 2011Updated 14 years ago