Optimize QWen1.5 models with TensorRT-LLM
☆17May 14, 2024Updated 2 years ago
Alternatives and similar repositories for QWen1.5_TensorRT-LLM
Users that are interested in QWen1.5_TensorRT-LLM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- pose estimation code with deepstream and yolo-pose☆13Oct 14, 2022Updated 3 years ago
- Ultimate NLP Toolkit for GPUs: RAPIDS-AI, PyTorch, NeMo, Tensorboard, TensorRT, CUDA 10.1☆10Mar 19, 2020Updated 6 years ago
- Multi-Cluster application progressive delivery controller☆21May 13, 2026Updated last month
- ☆14Sep 6, 2024Updated last year
- CTE: Contextualized Table Extraction Dataset☆17Feb 23, 2023Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆13Jun 3, 2023Updated 3 years ago
- ☆10Jun 7, 2026Updated last week
- Direct Preference Optimization for RWKV, aiming for RWKV-5 and 6.☆11Mar 1, 2024Updated 2 years ago
- ☆12Jul 19, 2019Updated 6 years ago
- ☆619Jul 31, 2024Updated last year
- Docear: An Academic Literature Suite for Searching, Organizing and Creating Academic Literature☆13Nov 1, 2012Updated 13 years ago
- This script implements the tensorflow1.x and keras model into a caffe inference model.☆14Mar 21, 2020Updated 6 years ago
- 利用Weather Undegroung提供的内布拉斯加州林肯市2015年1月4日开始总计997天的气象数据,预测天气温度(多元回归)。 基于Pytorch☆19Nov 19, 2020Updated 5 years ago
- 用RLHF可选LoRA对LLaMA和MOSS进行训练|Training LLaMA or MOSS with RLHF [LoRA]☆21May 16, 2023Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Some codes to trace KVM events using BPF☆22Mar 6, 2020Updated 6 years ago
- Use the MobileNet V2 as the basenet instead of the original VGG16☆14Aug 28, 2019Updated 6 years ago
- laravel 中国地图web Api集合☆13Apr 27, 2023Updated 3 years ago
- tensorflow slim Implementation crnn☆16Feb 9, 2021Updated 5 years ago
- a finance MCP tool☆47Feb 24, 2026Updated 3 months ago
- Source for kusionstack.io☆18Oct 13, 2025Updated 8 months ago
- 🌳CED: Catalog Extraction from Documents☆16Jul 30, 2023Updated 2 years ago
- 利用GPT2实现的闲聊模型☆12Apr 22, 2021Updated 5 years ago
- Explained QNNPACK Implementation☆21Sep 20, 2025Updated 8 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- mavlink2rest creates a REST server that provides mavlink information from a mavlink source☆16Mar 29, 2026Updated 2 months ago
- Semantics-guided Part Attention Network (ECCV 2020 Oral)☆26Jun 8, 2021Updated 5 years ago
- ☆21May 22, 2023Updated 3 years ago
- Vision Longformer For Object Detection☆34May 17, 2021Updated 5 years ago
- Crack Detection Based on Infrared thermography (IR)☆18Jun 26, 2024Updated last year
- ☆27Nov 6, 2024Updated last year
- ☆32Apr 8, 2025Updated last year
- Convert Keras models into Caffe models (within reason)☆21Sep 15, 2017Updated 8 years ago
- FinRAG: Financial Retrieval Augmented Generation☆43Aug 28, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Spelling and grammatical error detection and correction using N-grams Model☆12Nov 12, 2018Updated 7 years ago
- ☆26Jul 7, 2021Updated 4 years ago
- 大模型推理压测☆48Jul 31, 2025Updated 10 months ago
- Pytorch implementation of YOLO v1 from scratch☆13May 21, 2024Updated 2 years ago
- ☆28Sep 7, 2018Updated 7 years ago
- ☆27Aug 5, 2022Updated 3 years ago
- This is an accurate implementation for IoU loss between two rotated polygons. This algorithm is accurate and differential, but there is n…☆18Mar 5, 2022Updated 4 years ago