substratusai / helmLinks
β18Updated 10 months ago
Alternatives and similar repositories for helm
Users that are interested in helm are comparing it to the libraries listed below
Sorting:
- β62Updated 2 months ago
- π A deep-dive into HyDE for Advanced LLM RAG + π‘ Introducing AutoHyDE, a semi-supervised framework to improve the effectiveness, coveraβ¦β32Updated last year
- Helm charts to deploy Weaviate to k8sβ59Updated last month
- A stable, fast and easy-to-use inference library with a focus on a sync-to-async APIβ45Updated 8 months ago
- One Repo To Quickly Build One Docker File for HuggingChat Front and BackEndβ26Updated last year
- Using modal.com to process FineWeb-edu dataβ20Updated 2 months ago
- β66Updated last year
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing β‘β66Updated 7 months ago
- A framework for simulating e-commerce data and interactions that can be used to build recommendation systemsβ10Updated last year
- Modular, open source LLMOps stack that separates concerns: LiteLLM unifies LLM APIs, manages routing and cost controls, and ensures high-β¦β105Updated 4 months ago
- β77Updated last year
- IBM development fork of https://github.com/huggingface/text-generation-inferenceβ60Updated last month
- π‘ Deploy AI models and apps to Kubernetes without developing a herniaβ32Updated last year
- β19Updated 8 months ago
- Experimental Code for StructuredRAG: JSON Response Formatting with Large Language Modelsβ106Updated 2 months ago
- π Scale your RAG pipeline using Ragswift: A scalable centralized embeddings management platformβ38Updated last year
- β19Updated last year
- Writing Blog Posts with Generative Feedback Loops!β48Updated last year
- Chat Markup Language conversation libraryβ55Updated last year
- Code for evaluating with Flow-Judge-v0.1 - an open-source, lightweight (3.8B) language model optimized for LLM system evaluations. Crafteβ¦β70Updated 7 months ago
- A high performance batching router optimises max throughput for text inference workloadβ16Updated last year
- β18Updated last year
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absoluteβ¦β49Updated 11 months ago
- π€ Trade any tensors over the networkβ30Updated last year
- Small python package to measure OCR quality and other related metrics.β23Updated last year
- A framework for evaluating function calls made by LLMsβ37Updated 11 months ago
- This repository is designed for deploying and managing server processes that handle embeddings using the Infinity Embedding model or Largβ¦β23Updated 3 months ago
- Self-host LLMs with vLLM and BentoMLβ123Updated this week
- TitanML Takeoff Server is an optimization, compression and deployment platform that makes state of the art machine learning models accessβ¦β114Updated last year
- Knowledge Graph Generator appβ31Updated last year