basetenlabs / truss-examples
Examples of models deployable with Truss
β161Updated this week
Alternatives and similar repositories for truss-examples:
Users that are interested in truss-examples are comparing it to the libraries listed below
- The RunPod worker template for serving our large language model endpoints. Powered by vLLM.β287Updated last week
- π | Python library for RunPod API and serverless worker SDK.β212Updated last month
- β111Updated 2 months ago
- Gradio based tool to run opensource LLM models directly from Huggingfaceβ91Updated 8 months ago
- A curated list of amazing RunPod projects, libraries, and resourcesβ107Updated 6 months ago
- β124Updated last month
- Generate Synthetic Data Using OpenAI, MistralAI or AnthropicAIβ223Updated 10 months ago
- automatically quant GGUF modelsβ156Updated last week
- β152Updated 7 months ago
- Dagger functions to import Hugging Face GGUF models into a local ollama instance and optionally push them to ollama.com.β115Updated 9 months ago
- All the world is a play, we are but actors in it.β47Updated this week
- Beating the GAIA benchmark with Transformers Agents. πβ90Updated last week
- β199Updated last year
- Run inference on replit-3B code instruct model using CPUβ154Updated last year
- ASR + diarization model server with speculative decodingβ57Updated 9 months ago
- β77Updated 11 months ago
- Chat Bot with LLM and Fact Reference. RAG(Retrieval Augmented Generation) and LangChain backedβ128Updated 10 months ago
- π | A simple worker that can be used as a starting point to build your own custom RunPod Endpoint API worker.β97Updated 4 months ago
- β99Updated 6 months ago
- A feed of trending repos/models from GitHub, Replicate, HuggingFace, and Reddit.β120Updated 5 months ago
- A fast batching API to serve LLM modelsβ180Updated 10 months ago
- An OpenAI API compatible API for chat with image input and questions about the images. aka Multimodal.β226Updated 2 months ago
- Own your AI, search the web with itππβ80Updated last month
- Scripts to create your own moe models using mlxβ86Updated last year
- A curated list of amazingly awesome Modal applications, demos, and shiny things. Inspired by awesome-php.β120Updated 2 weeks ago
- VideoDB Python SDKβ63Updated this week
- Low-Rank adapter extraction for fine-tuned transformers modelsβ170Updated 10 months ago
- The code we currently use to fine-tune models.β113Updated 9 months ago
- An JS web client for connecting to Pipecat bots with voice and visionβ43Updated 2 months ago
- An endpoint server for efficiently serving quantized open-source LLMs for code.β54Updated last year