substratusai / helm
β18Updated 5 months ago
Alternatives and similar repositories for helm:
Users that are interested in helm are comparing it to the libraries listed below
- β52Updated last month
- vLLM adapter for a TGIS-compatible gRPC server.β19Updated this week
- π‘ Deploy AI models and apps to Kubernetes without developing a herniaβ32Updated 8 months ago
- Self-host LLMs with vLLM and BentoMLβ86Updated this week
- β157Updated last week
- Routing on Random Forest (RoRF)β112Updated 4 months ago
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing β‘β64Updated 3 months ago
- Helm charts to deploy Weaviate to k8sβ58Updated this week
- Explore the use of DSPy for extracting features from PDFs πβ38Updated 11 months ago
- Zero-trust AI APIs for easy and private consumption of open-source LLMsβ38Updated 6 months ago
- IBM development fork of https://github.com/huggingface/text-generation-inferenceβ59Updated last month
- β65Updated 8 months ago
- Writing Blog Posts with Generative Feedback Loops!β47Updated 10 months ago
- Repository hosting Langchain helm charts.β44Updated this week
- Using modal.com to process FineWeb-edu dataβ20Updated 2 months ago
- π Unstructured Data Connectors for Haystack 2.0β16Updated last year
- This repository is a combination of llama workflows and agents together which is a powerful concept.β17Updated 6 months ago
- Using LlamaIndex with Ray for productionizing LLM applicationsβ71Updated last year
- Code for evaluating with Flow-Judge-v0.1 - an open-source, lightweight (3.8B) language model optimized for LLM system evaluations. Crafteβ¦β59Updated 3 months ago
- Open Weight, tool-calling LLMsβ151Updated 3 months ago
- SCIPE is a powerful tool for evaluating and diagnosing LLM (Large Language Model) graphs or chains.β21Updated 3 months ago
- This project implements a demonstrator agent that compares the Cache-Augmented Generation (CAG) Framework with traditional Retrieval-Augmβ¦β25Updated last month
- Generate Tools and Toolkits from any Python SDK -- no extra code requiredβ50Updated 3 months ago
- Chat Markup Language conversation libraryβ55Updated last year
- A stable, fast and easy-to-use inference library with a focus on a sync-to-async APIβ44Updated 4 months ago
- π Scale your RAG pipeline using Ragswift: A scalable centralized embeddings management platformβ37Updated last year
- Quickly and securely turn any Linux box into a build and deployment assistantβ24Updated 4 months ago
- The backend behind the LLM-Perf Leaderboardβ10Updated 9 months ago
- A framework for simulating e-commerce data and interactions that can be used to build recommendation systemsβ10Updated last year