Optimize QWen1.5 models with TensorRT-LLM
☆17May 14, 2024Updated last year
Alternatives and similar repositories for QWen1.5_TensorRT-LLM
Users that are interested in QWen1.5_TensorRT-LLM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Ultimate NLP Toolkit for GPUs: RAPIDS-AI, PyTorch, NeMo, Tensorboard, TensorRT, CUDA 10.1☆10Mar 19, 2020Updated 6 years ago
- CTE: Contextualized Table Extraction Dataset☆17Feb 23, 2023Updated 3 years ago
- A simplified implementation of RetinaNet from https://arxiv.org/pdf/1708.02002.pdf using TF2.0☆13Aug 5, 2020Updated 5 years ago
- Implementation of Neural Style Transfer on Video☆11Nov 6, 2018Updated 7 years ago
- ☆13Jun 3, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ☆10Mar 8, 2026Updated 2 weeks ago
- our first ai.☆15Jul 2, 2017Updated 8 years ago
- Direct Preference Optimization for RWKV, aiming for RWKV-5 and 6.☆11Mar 1, 2024Updated 2 years ago
- Docear: An Academic Literature Suite for Searching, Organizing and Creating Academic Literature☆13Nov 1, 2012Updated 13 years ago
- This repository contains the results and code for the MLPerf™ Inference v0.7 benchmark.☆17Jul 24, 2025Updated 8 months ago
- ☆14Feb 27, 2021Updated 5 years ago
- This script implements the tensorflow1.x and keras model into a caffe inference model.☆14Mar 21, 2020Updated 6 years ago
- 用RLHF可选LoRA对LLaMA和MOSS进行训练|Training LLaMA or MOSS with RLHF [LoRA]☆21May 16, 2023Updated 2 years ago
- PyTorch Implementation for CS229 Course Project - "Grammatical Error Correction using Neural Networks"☆10Dec 16, 2017Updated 8 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Use the MobileNet V2 as the basenet instead of the original VGG16☆14Aug 28, 2019Updated 6 years ago
- 如需体验TextIn文档解析,请访问 https://cc.co/16YSIy☆15Mar 4, 2025Updated last year
- tensorflow slim Implementation crnn☆16Feb 9, 2021Updated 5 years ago
- Summary of system papers/frameworks/codes/tools on training or serving large model☆57Dec 17, 2023Updated 2 years ago
- 一个简单的,由ChatGPT主导编写的api,使用简单的请求访问ChatRWKV☆15May 19, 2023Updated 2 years ago
- 🌳CED: Catalog Extraction from Documents☆16Jul 30, 2023Updated 2 years ago
- 利用GPT2实现的闲聊模型☆12Apr 22, 2021Updated 4 years ago
- Explained QNNPACK Implementation☆21Sep 20, 2025Updated 6 months ago
- Video-to-video style transfer using convolutional neural networks☆27Dec 8, 2016Updated 9 years ago
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- FinRAG: Financial Retrieval Augmented Generation☆39Aug 28, 2024Updated last year
- ☆28Nov 6, 2024Updated last year
- ☆32Apr 8, 2025Updated 11 months ago
- Convert Keras models into Caffe models (within reason)☆21Sep 15, 2017Updated 8 years ago
- ☆26Jul 7, 2021Updated 4 years ago
- 大模型推理压测☆46Jul 31, 2025Updated 7 months ago
- Pytorch implementation of YOLO v1 from scratch☆13May 21, 2024Updated last year
- ☆27Aug 5, 2022Updated 3 years ago
- This is an accurate implementation for IoU loss between two rotated polygons. This algorithm is accurate and differential, but there is n…☆18Mar 5, 2022Updated 4 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆22Oct 3, 2023Updated 2 years ago
- 🎮 A toolkit for Relation Extraction and more...☆24May 8, 2025Updated 10 months ago
- An easy API for making Event Source requests, with all the features of fetch(), Supports browsers and node.js☆29Jan 31, 2026Updated last month
- Viscacha:通用信息抽取数据集收集☆27Feb 21, 2024Updated 2 years ago
- a sample code for utilizing torch.distributed☆21Aug 25, 2020Updated 5 years ago
- CenterNet's inference.(C++)/基于CenterNet的旋转目标检测C++版☆28Dec 28, 2020Updated 5 years ago
- YOLO-Mark is not a good tool to use,So we use the tool labelme to get the things for YOLO to train,☆21Dec 3, 2020Updated 5 years ago