inferless / triton-co-pilot
Generate Glue Code in seconds to simplify your Nvidia Triton Inference Server Deployments
☆19Updated 9 months ago
Alternatives and similar repositories for triton-co-pilot:
Users that are interested in triton-co-pilot are comparing it to the libraries listed below
- ☆13Updated last year
- The official evaluation suite and dynamic data release for MixEval.☆11Updated 7 months ago
- Check for data drift between two OpenAI multi-turn chat jsonl files.☆37Updated last year
- 🚀 Scale your RAG pipeline using Ragswift: A scalable centralized embeddings management platform☆38Updated last year
- ☆15Updated 3 weeks ago
- Training hybrid models for dummies.☆20Updated 3 months ago
- Vector Database with support for late interaction and token level embeddings.☆54Updated 6 months ago
- NLP with Rust for Python 🦀🐍☆62Updated 10 months ago
- Unleash the full potential of exascale LLMs on consumer-class GPUs, proven by extensive benchmarks, with no long-term adjustments and min…☆26Updated 5 months ago
- This repository has code for fine-tuning LLMs with GRPO specifically for Rust Programming using cargo as feedback☆80Updated last month
- Basic Denoising Diffusion Probabilistic Model image generator implemented in PyTorch☆10Updated 3 months ago
- Because it's there.☆16Updated 7 months ago
- ☆12Updated 9 months ago
- First token cutoff sampling inference example☆30Updated last year
- a pipeline for using api calls to agnostically convert unstructured data into structured training data☆30Updated 7 months ago
- ☆31Updated 10 months ago
- 👷 Build compute kernels☆35Updated this week
- ☆19Updated 6 months ago
- Enable moe for nanogpt.☆26Updated last year
- LLM Compression Benchmark☆21Updated 2 months ago
- ☆16Updated 3 months ago
- AI aware proxy☆18Updated 7 months ago
- ColBERT for live vector indexes☆24Updated 6 months ago
- Experimental wasm32-unknown-wasi runtime for Python code execution☆37Updated 5 months ago
- BYOeB is a tool to build a chatbot with a custom knowledge base and an expert-in-the-loop.☆24Updated 2 months ago
- A repository of PyTorch example☆9Updated 2 years ago
- ☆18Updated 7 months ago
- ☆23Updated this week
- TensorRT-LLM server with Structured Outputs (JSON) built with Rust☆49Updated 3 weeks ago
- Repository containing awesome resources regarding Hugging Face tooling.☆46Updated last year