basetenlabs / truss-examples
Examples of models deployable with Truss
β169Updated this week
Alternatives and similar repositories for truss-examples:
Users that are interested in truss-examples are comparing it to the libraries listed below
- π | Python library for RunPod API and serverless worker SDK.β221Updated 2 weeks ago
- The RunPod worker template for serving our large language model endpoints. Powered by vLLM.β307Updated this week
- β35Updated last year
- β153Updated 9 months ago
- β112Updated 4 months ago
- Gradio UI for a Cog APIβ67Updated last year
- Pipeline is an open source python SDK for building AI/ML workflowsβ132Updated 7 months ago
- β199Updated last year
- A curated list of amazing RunPod projects, libraries, and resourcesβ111Updated 8 months ago
- β130Updated last week
- A curated list of amazingly awesome Modal applications, demos, and shiny things. Inspired by awesome-php.β132Updated last week
- Dagger functions to import Hugging Face GGUF models into a local ollama instance and optionally push them to ollama.com.β115Updated 11 months ago
- automatically quant GGUF modelsβ168Updated this week
- π³ | Dockerfiles for the RunPod container images used for our official templates.β178Updated 2 weeks ago
- πΉοΈ Performance Comparison of MLOps Engines, Frameworks, and Languages on Mainstream AI Models.β136Updated 9 months ago
- β196Updated 3 weeks ago
- Chat Bot with LLM and Fact Reference. RAG(Retrieval Augmented Generation) and LangChain backedβ128Updated 11 months ago
- Self-host LLMs with vLLM and BentoMLβ106Updated last week
- Low-Rank adapter extraction for fine-tuned transformers modelsβ171Updated 11 months ago
- Generate Synthetic Data Using OpenAI, MistralAI or AnthropicAIβ223Updated 11 months ago
- Own your AI, search the web with itππβ84Updated 3 months ago
- β135Updated last year
- A simple Python sandbox for helpful LLM data agentsβ250Updated 10 months ago
- Starting point to build your own custom serverless endpointβ102Updated this week
- β28Updated last year
- β74Updated last year
- an implementation of Self-Extend, to expand the context window via grouped attentionβ119Updated last year
- A toolkit for building computer use AI agentsβ158Updated this week
- β52Updated last week
- Cog inference for flux modelsβ343Updated last week