Getting started with TensorRT-LLM using BLOOM as a case study
☆24Mar 7, 2024Updated 2 years ago
Alternatives and similar repositories for TensorRT-LLM-Tutorial
Users that are interested in TensorRT-LLM-Tutorial are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This repository provides the data and the codes used in the AAAI'24 paper, COOPER: Coordinating Specialized Agents towards a Complex Dial…☆28Mar 1, 2024Updated 2 years ago
- A Multi-Session and Multi-Therapy Benchmark for High-Realism AI Psychological Counselor☆35Jan 13, 2026Updated 2 months ago
- Chat language model that can interpret and execute functions/plugins☆14Oct 16, 2024Updated last year
- Code for HyperSeg and HyperSum☆16Jul 15, 2025Updated 8 months ago
- Few-shot text classification with meta learning and BERT☆11Jun 14, 2021Updated 4 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- A basic jupyterhub with Nvidia GPU accessibility.☆16Nov 4, 2024Updated last year
- Codes for Mitigating Unhelpfulness in Emotional Support Conversations with Multifaceted AI Feedback (ACL 2024 Findings)☆16Jul 2, 2024Updated last year
- Huggingface Implementation of AV-HuBERT on the MuAViC Dataset☆19Mar 6, 2025Updated last year
- Repo for paper: Controllable Text Generation with Language Constraints☆20Jun 20, 2023Updated 2 years ago
- ☆25Nov 27, 2023Updated 2 years ago
- Official repository for the paper "Development of a freely accessible deep learning platform for comprehensive chest X-ray reading: a ret…☆25Mar 18, 2026Updated 3 weeks ago
- LLM inference in C/C++☆111Updated this week
- This is the official repository for Vista dataset - A Vietnamese multimodal dataset contains more than 700,000 samples of conversations a…☆26May 14, 2024Updated last year
- A dataset focused on summarization of dialogs, which represents the rich domain of Twitter customer care conversations☆32Dec 21, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ☆26Nov 22, 2022Updated 3 years ago
- Comfy UI Workflows Created by Wonderflex☆68Mar 12, 2025Updated last year
- VNHSGE: Vietnamese High School Graduation Examination Dataset for Large Language Models☆29Jul 24, 2023Updated 2 years ago
- This is the repository of our ACL 2024 paper "ESCoT: Towards Interpretable Emotional Support Dialogue Systems".☆39May 10, 2025Updated 11 months ago
- ☆42Oct 3, 2024Updated last year
- AAAI-2021 paper: Unsupervised Summarization for Chat Logs with Topic-Oriented Ranking and Context-Aware Auto-Encoders.☆39Jun 25, 2021Updated 4 years ago
- ☆76Mar 7, 2024Updated 2 years ago
- Awesome MLOps Course Outline☆36Dec 27, 2022Updated 3 years ago
- Scalable, cloud-native recommender system with end-to-end MLOps for building, training, and deploying models in research and production☆57Aug 5, 2025Updated 8 months ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- The Triton TensorRT-LLM Backend☆930Mar 17, 2026Updated 3 weeks ago
- [ACL 2024] CPsyCoun: A Report-based Multi-turn Dialogue Reconstruction and Evaluation Framework for Chinese Psychological Counseling☆224May 21, 2025Updated 10 months ago
- Sample solution for MLOps Marathon 2023☆29Jun 25, 2023Updated 2 years ago
- Data and code used in the 2015 ACL paper, "Ground Truth for Grammatical Error Correction Metrics"☆55Dec 17, 2017Updated 8 years ago
- Mixed precision inference by Tensorrt-LLM☆80Oct 23, 2024Updated last year
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆70Nov 17, 2025Updated 4 months ago
- Frontier self improving AI intern / coworker☆44Updated this week
- ☆12Jan 20, 2026Updated 2 months ago
- This repo builds an end-to-end deep learning application that supports speech recognition system. It's simple to use and understand☆38May 23, 2023Updated 2 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- Copilot with deepseek and more...☆13Mar 7, 2025Updated last year
- A general 2-8 bits quantization toolbox with GPTQ/AWQ/HQQ/VPTQ, and export to onnx/onnx-runtime easily.☆187Mar 23, 2026Updated 2 weeks ago
- A thin cython wrapper around llama.cpp, whisper.cpp and stable-diffusion.cpp☆21Updated this week
- MIXQ: Taming Dynamic Outliers in Mixed-Precision Quantization by Online Prediction☆94Oct 29, 2024Updated last year
- 🍳🚀 CookFast is a free AI tool that writes essential product documents (like Requirements Docs & Application Flows) from your idea, help…☆14Dec 19, 2025Updated 3 months ago
- AI Search engine☆13Sep 24, 2025Updated 6 months ago
- ComfyUI Workflows☆10Sep 27, 2025Updated 6 months ago