basetenlabs / truss-examples
Examples of models deployable with Truss
☆120Updated this week
Related projects: ⓘ
- Gradio UI for a Cog API☆62Updated 5 months ago
- ☆101Updated 6 months ago
- 🐳 | Dockerfiles for the RunPod container images used for our official templates.☆141Updated 3 weeks ago
- All the world is a play, we are but actors in it.☆46Updated 2 months ago
- WIP - Allows you to create DSPy pipelines using ComfyUI☆170Updated last month
- The RunPod worker template for serving our large language model endpoints. Powered by vLLM.☆222Updated this week
- ☆95Updated this week
- 🐍 | Python library for RunPod API and serverless worker SDK.☆172Updated this week
- Easily view and modify JSON datasets for large language models☆55Updated this week
- GPT-4 Level Conversational QA Trained In a Few Hours☆53Updated last month
- ⚡️🧪 Fast LLM Tool Calling Experimentation, big and smol☆133Updated last week
- ☆133Updated 9 months ago
- 🔓 The open-source autonomous agent LLM initiative 🔓☆90Updated 7 months ago
- Gradio based tool to run opensource LLM models directly from Huggingface☆84Updated 2 months ago
- Python client library for improving your LLM app accuracy☆94Updated this week
- ☆76Updated 6 months ago
- auto fine tune of models with synthetic data☆71Updated 7 months ago
- ⚡️ A fast and flexible PyTorch inference server that runs locally, on any cloud or AI HW.☆125Updated 3 months ago
- Dagger functions to import Hugging Face GGUF models into a local ollama instance and optionally push them to ollama.com.☆109Updated 3 months ago
- ☆201Updated 7 months ago
- Pipeline is an open source python SDK for building AI/ML workflows☆124Updated this week
- A curated list of amazing RunPod projects, libraries, and resources☆98Updated last month
- Simple and fast server for GPTQ-quantized LLaMA inference☆24Updated last year
- automatically quant GGUF models☆119Updated this week
- Generate Synthetic Data Using OpenAI, MistralAI or AnthropicAI☆223Updated 4 months ago
- ☆64Updated 3 months ago
- A collection of cog models for use on Replicate☆23Updated 8 months ago
- Community ComfyUI workflows running on fal.ai☆53Updated 3 weeks ago
- ☆144Updated 2 months ago
- An endpoint server for efficiently serving quantized open-source LLMs for code.☆52Updated 11 months ago