substratusai / helmLinks
β18Updated 9 months ago
Alternatives and similar repositories for helm
Users that are interested in helm are comparing it to the libraries listed below
Sorting:
- β60Updated 2 months ago
- π‘ Deploy AI models and apps to Kubernetes without developing a herniaβ32Updated last year
- β19Updated 7 months ago
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing β‘β66Updated 7 months ago
- β66Updated last year
- Chat Markup Language conversation libraryβ55Updated last year
- A framework for simulating e-commerce data and interactions that can be used to build recommendation systemsβ10Updated last year
- Experimental Code for StructuredRAG: JSON Response Formatting with Large Language Modelsβ105Updated last month
- β19Updated last year
- π Scale your RAG pipeline using Ragswift: A scalable centralized embeddings management platformβ38Updated last year
- Code for evaluating with Flow-Judge-v0.1 - an open-source, lightweight (3.8B) language model optimized for LLM system evaluations. Crafteβ¦β70Updated 7 months ago
- Using modal.com to process FineWeb-edu dataβ20Updated last month
- TitanML Takeoff Server is an optimization, compression and deployment platform that makes state of the art machine learning models accessβ¦β114Updated last year
- β12Updated 10 months ago
- Knowledge Graph Generator appβ31Updated last year
- A stable, fast and easy-to-use inference library with a focus on a sync-to-async APIβ45Updated 8 months ago
- Inference server benchmarking toolβ67Updated last month
- vLLM adapter for a TGIS-compatible gRPC server.β30Updated this week
- The backend behind the LLM-Perf Leaderboardβ10Updated last year
- Helm charts to deploy Weaviate to k8sβ60Updated 3 weeks ago
- One Repo To Quickly Build One Docker File for HuggingChat Front and BackEndβ26Updated last year
- β77Updated 11 months ago
- A python command-line tool to download & manage MLX AI models from Hugging Face.β17Updated 9 months ago
- A framework for evaluating function calls made by LLMsβ37Updated 10 months ago
- A high performance batching router optimises max throughput for text inference workloadβ16Updated last year
- πͺΆ Lightweight OpenAI drop-in replacement for Kubernetesβ145Updated last year
- β101Updated 9 months ago
- β215Updated this week
- Writing Blog Posts with Generative Feedback Loops!β48Updated last year
- β36Updated 3 months ago