Optimize QWen1.5 models with TensorRT-LLM
☆17May 14, 2024Updated last year
Alternatives and similar repositories for QWen1.5_TensorRT-LLM
Users that are interested in QWen1.5_TensorRT-LLM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- pose estimation code with deepstream and yolo-pose☆13Oct 14, 2022Updated 3 years ago
- Ultimate NLP Toolkit for GPUs: RAPIDS-AI, PyTorch, NeMo, Tensorboard, TensorRT, CUDA 10.1☆10Mar 19, 2020Updated 6 years ago
- Multi-Cluster application progressive delivery controller☆21Mar 23, 2026Updated 3 weeks ago
- ☆14Sep 6, 2024Updated last year
- CTE: Contextualized Table Extraction Dataset☆17Feb 23, 2023Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Implementation of Neural Style Transfer on Video☆11Nov 6, 2018Updated 7 years ago
- ☆13Jun 3, 2023Updated 2 years ago
- ☆10Mar 8, 2026Updated last month
- A website created with Django2.0.1☆12Dec 8, 2022Updated 3 years ago
- Direct Preference Optimization for RWKV, aiming for RWKV-5 and 6.☆11Mar 1, 2024Updated 2 years ago
- this repo attemps to reproduce DSOD: Learning Deeply Supervised Object Detectors from Scratch use gluon reimplementation☆14Aug 18, 2018Updated 7 years ago
- ☆12Jul 19, 2019Updated 6 years ago
- Docear: An Academic Literature Suite for Searching, Organizing and Creating Academic Literature☆13Nov 1, 2012Updated 13 years ago
- Original PyTorch Implementation for the EMNLP 2023 Paper "Beyond Detection: A Defend-and-Summarize Strategy for Robust and Interpretable …☆16Dec 14, 2023Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- This script implements the tensorflow1.x and keras model into a caffe inference model.☆14Mar 21, 2020Updated 6 years ago
- PyTorch Implementation for CS229 Course Project - "Grammatical Error Correction using Neural Networks"☆10Dec 16, 2017Updated 8 years ago
- a finance MCP tool☆44Feb 24, 2026Updated last month
- Use the MobileNet V2 as the basenet instead of the original VGG16☆14Aug 28, 2019Updated 6 years ago
- laravel 中国地图web Api集合☆13Apr 27, 2023Updated 2 years ago
- tensorflow slim Implementation crnn☆16Feb 9, 2021Updated 5 years ago
- 🌳CED: Catalog Extraction from Documents☆16Jul 30, 2023Updated 2 years ago
- 利用GPT2实现的闲聊模型☆12Apr 22, 2021Updated 4 years ago
- Explained QNNPACK Implementation☆21Sep 20, 2025Updated 6 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- interesting application of deep learning☆17Feb 16, 2020Updated 6 years ago
- Semantics-guided Part Attention Network (ECCV 2020 Oral)☆26Jun 8, 2021Updated 4 years ago
- LangChain V0.2 官网文档中文翻译☆37Jun 1, 2024Updated last year
- Video-to-video style transfer using convolutional neural networks☆27Dec 8, 2016Updated 9 years ago
- ☆21May 22, 2023Updated 2 years ago
- ☆28Nov 6, 2024Updated last year
- ☆32Apr 8, 2025Updated last year
- Convert Keras models into Caffe models (within reason)☆21Sep 15, 2017Updated 8 years ago
- Code of the VFL part proposed in the paper: Variational Representation Learning for Vehicle Re-Identification (IEEE ICIP)☆25Oct 22, 2019Updated 6 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- FinRAG: Financial Retrieval Augmented Generation☆41Aug 28, 2024Updated last year
- ☆26Jul 7, 2021Updated 4 years ago
- 大模型推理压测☆47Jul 31, 2025Updated 8 months ago
- ☆28Sep 7, 2018Updated 7 years ago
- This is an accurate implementation for IoU loss between two rotated polygons. This algorithm is accurate and differential, but there is n…☆18Mar 5, 2022Updated 4 years ago
- python等客户端和web端通过WebRTC交互☆23Sep 26, 2023Updated 2 years ago
- The prototype for NSDI paper "NetHint: White-Box Networking for Multi-Tenant Data Centers"☆26Feb 2, 2024Updated 2 years ago