substratusai / helmLinks
β18Updated last year
Alternatives and similar repositories for helm
Users that are interested in helm are comparing it to the libraries listed below
Sorting:
- β66Updated 8 months ago
- π‘ Deploy AI models and apps to Kubernetes without developing a herniaβ33Updated last year
- Using modal.com to process FineWeb-edu dataβ20Updated 8 months ago
- Modular, open source LLMOps stack that separates concerns: LiteLLM unifies LLM APIs, manages routing and cost controls, and ensures high-β¦β128Updated 10 months ago
- IBM development fork of https://github.com/huggingface/text-generation-inferenceβ62Updated 3 months ago
- Experimental Code for StructuredRAG: JSON Response Formatting with Large Language Modelsβ115Updated 8 months ago
- Self-host LLMs with vLLM and BentoMLβ161Updated 3 weeks ago
- Code for evaluating with Flow-Judge-v0.1 - an open-source, lightweight (3.8B) language model optimized for LLM system evaluations. Crafteβ¦β78Updated last year
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing β‘β68Updated last month
- A high performance batching router optimises max throughput for text inference workloadβ16Updated 2 years ago
- Open Weight, tool-calling LLMsβ155Updated last year
- Simple examples using Argilla tools to build AIβ56Updated last year
- β21Updated last year
- A stable, fast and easy-to-use inference library with a focus on a sync-to-async APIβ46Updated last year
- TitanML Takeoff Server is an optimization, compression and deployment platform that makes state of the art machine learning models accessβ¦β114Updated last year
- π Scale your RAG pipeline using Ragswift: A scalable centralized embeddings management platformβ38Updated last year
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absoluteβ¦β51Updated last year
- β19Updated last year
- Dynamic Metadata based RAG Frameworkβ78Updated last week
- ReDel is a toolkit for researchers and developers to build, iterate on, and analyze recursive multi-agent systems. (EMNLP 2024 Demo)β89Updated 2 weeks ago
- The Benefits of a Concise Chain of Thought on Problem Solving in Large Language Modelsβ22Updated last year
- The backend behind the LLM-Perf Leaderboardβ11Updated last year
- vLLM adapter for a TGIS-compatible gRPC server.β45Updated this week
- GPT-4 Level Conversational QA Trained In a Few Hoursβ66Updated last year
- ScalarLM - a unified training and inference stackβ93Updated last month
- Using LlamaIndex with Ray for productionizing LLM applicationsβ71Updated 2 years ago
- Hugging Face Inference Toolkit used to serve transformers, sentence-transformers, and diffusers models.β88Updated last month
- β31Updated 11 months ago
- π Unstructured Data Connectors for Haystack 2.0β17Updated 2 years ago
- Routing on Random Forest (RoRF)β229Updated last year