Getting started with TensorRT-LLM using BLOOM as a case study
☆25Mar 7, 2024Updated 2 years ago
Alternatives and similar repositories for TensorRT-LLM-Tutorial
Users that are interested in TensorRT-LLM-Tutorial are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This repository provides the data and the codes used in the AAAI'24 paper, COOPER: Coordinating Specialized Agents towards a Complex Dial…☆28Mar 1, 2024Updated 2 years ago
- Chat language model that can interpret and execute functions/plugins☆14Oct 16, 2024Updated last year
- WangchanX Fine-tuning Pipeline☆46Oct 4, 2024Updated last year
- Repo for paper: Controllable Text Generation with Language Constraints☆20Jun 20, 2023Updated 2 years ago
- decrypts Ableton DRM-protected *.aif files☆17Sep 26, 2019Updated 6 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆25Nov 27, 2023Updated 2 years ago
- Convert Wiktionary entries to various formats such as StarDict or DB (MariaDB/MySQL). I'm dropping the database support for this new main…☆17Oct 5, 2025Updated 7 months ago
- LLM inference in C/C++☆119May 13, 2026Updated last week
- POC for netdata ndsudo vulnerability - CVE-2024-32019☆21Aug 3, 2025Updated 9 months ago
- A telegram bot that supports last.fm, libre.fm and listenbrainz.org☆20Feb 24, 2026Updated 2 months ago
- This is the repository of our ACL 2024 paper "ESCoT: Towards Interpretable Emotional Support Dialogue Systems".☆40May 10, 2025Updated last year
- ☆13Nov 22, 2022Updated 3 years ago
- The official code of "PixelWorld: Towards Perceiving Everything as Pixels" [TMLR25]☆16Sep 12, 2025Updated 8 months ago
- ☆76Mar 7, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Setup an MCP server in 60 seconds.☆13Dec 12, 2024Updated last year
- ☆21Dec 2, 2025Updated 5 months ago
- Awesome MLOps Course Outline☆36Dec 27, 2022Updated 3 years ago
- Extract tracks of SACD ISO image☆24Jan 29, 2023Updated 3 years ago
- The Triton TensorRT-LLM Backend☆934May 7, 2026Updated last week
- Code for the paper LaM-SLidE - Latent Space Modeling of Spatial Dynamical Systems via Linked Entities☆26May 23, 2025Updated 11 months ago
- This is the repository of EMNLP'2022 paper: "Improving Multi-turn Emotional Support Dialogue Generation with Lookahead Strategy Planning"…☆50Feb 6, 2023Updated 3 years ago
- additions by balake/relar/pepopo☆24Apr 17, 2026Updated last month
- ☆25Jul 4, 2024Updated last year
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- A WoW Classic Addon that shows you upcoming trainer abilities☆18Apr 25, 2026Updated 3 weeks ago
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆70Nov 17, 2025Updated 6 months ago
- ArchQ Linux for Audiophiles☆21Updated this week
- The book: Practice exams in Quantum Computing for graduate students.☆22May 21, 2022Updated 3 years ago
- ☆12Jan 20, 2026Updated 3 months ago
- Copilot with deepseek and more...☆13Mar 7, 2025Updated last year
- A general 2-8 bits quantization toolbox with GPTQ/AWQ/HQQ/VPTQ, and export to onnx/onnx-runtime easily.☆190Mar 23, 2026Updated last month
- A simple Archlinux install script☆21May 6, 2026Updated 2 weeks ago
- A thin cython wrapper around llama.cpp, whisper.cpp and stable-diffusion.cpp☆26Updated this week
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- AI Search engine☆13Sep 24, 2025Updated 7 months ago
- Quick access to any large language model from your browser.☆10Feb 16, 2026Updated 3 months ago
- an auto-sleeping and -waking framework around llama.cpp☆12Feb 8, 2025Updated last year
- Delete cookies either on browser exit or when tabs are closed☆14Nov 16, 2025Updated 6 months ago
- SPAM filter rules for Stalwart Mail Server☆16Apr 13, 2026Updated last month
- ☆10Sep 29, 2024Updated last year
- Simple and powerful extension for searching web and viewing website content.☆14Apr 11, 2025Updated last year