substratusai / helm
β18Updated 8 months ago
Alternatives and similar repositories for helm:
Users that are interested in helm are comparing it to the libraries listed below
- β59Updated last month
- π‘ Deploy AI models and apps to Kubernetes without developing a herniaβ32Updated 11 months ago
- A high performance batching router optimises max throughput for text inference workloadβ16Updated last year
- Code for evaluating with Flow-Judge-v0.1 - an open-source, lightweight (3.8B) language model optimized for LLM system evaluations. Crafteβ¦β67Updated 6 months ago
- Modular, open source LLMOps stack that separates concerns: LiteLLM unifies LLM APIs, manages routing and cost controls, and ensures high-β¦β93Updated 2 months ago
- Extensible generative AI platform on Kubernetes with OpenAI-compatible APIs.β67Updated this week
- β20Updated last year
- Using modal.com to process FineWeb-edu dataβ20Updated last month
- A framework for simulating e-commerce data and interactions that can be used to build recommendation systemsβ10Updated last year
- π Scale your RAG pipeline using Ragswift: A scalable centralized embeddings management platformβ38Updated last year
- Self-host LLMs with vLLM and BentoMLβ109Updated this week
- β14Updated 7 months ago
- Explore the use of DSPy for extracting features from PDFs πβ39Updated last year
- πͺΆ Lightweight OpenAI drop-in replacement for Kubernetesβ144Updated last year
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing β‘β66Updated 6 months ago
- β12Updated 10 months ago
- Tutorial for DSPyβ23Updated last year
- This repository is designed for deploying and managing server processes that handle embeddings using the Infinity Embedding model or Largβ¦β22Updated 2 months ago
- IBM development fork of https://github.com/huggingface/text-generation-inferenceβ60Updated this week
- β10Updated 7 months ago
- β16Updated 11 months ago
- Adding NeMo Guardrails to a LlamaIndex RAG pipelineβ37Updated last year
- A stable, fast and easy-to-use inference library with a focus on a sync-to-async APIβ45Updated 7 months ago
- β36Updated 3 months ago
- OpenMindedChatbot is a Proof Of Concept that leverages the power of Open source Large Language Models (LLM) with Function Calling capabilβ¦β29Updated last year
- β66Updated 11 months ago
- π A deep-dive into HyDE for Advanced LLM RAG + π‘ Introducing AutoHyDE, a semi-supervised framework to improve the effectiveness, coveraβ¦β32Updated last year
- β23Updated 5 months ago
- Quickly and securely turn any Linux box into a build and deployment assistantβ24Updated 7 months ago
- β19Updated 6 months ago