aishwaryaprabhat / BigBerthaLinks
BigBertha is an architecture design that demonstrates how automated LLMOps (Large Language Models Operations) can be achieved on any Kubernetes cluster using open source container-native technologies π
β28Updated last year
Alternatives and similar repositories for BigBertha
Users that are interested in BigBertha are comparing it to the libraries listed below
Sorting:
- Using LlamaIndex with Ray for productionizing LLM applicationsβ71Updated 2 years ago
- β146Updated last week
- Chart for deploying ChromaDB in Kubernetesβ51Updated 4 months ago
- Flyte Documentation πβ83Updated 6 months ago
- Tools and utilities for operating Metaflow in productionβ62Updated last month
- Helm charts to deploy Weaviate to k8sβ63Updated last month
- A python package that provides a custom streamlit connection to query data from weaviate, the AI native vector databaseβ57Updated last year
- Constrain LLM outputβ113Updated last year
- Leverage your LangChain trace data for fine tuningβ46Updated last year
- Product analytics for AI Assistantsβ153Updated 4 months ago
- A curated list of awesome open source tools and commercial products for monitoring data quality, monitoring model performance, and profilβ¦β88Updated last year
- Examples on how to use LangChain and Rayβ229Updated 2 years ago
- TitanML Takeoff Server is an optimization, compression and deployment platform that makes state of the art machine learning models accessβ¦β114Updated last year
- A collection of examples and tutorials for Qdrant vector search engineβ190Updated last week
- Adding NeMo Guardrails to a LlamaIndex RAG pipelineβ41Updated last year
- β50Updated last year
- Pebblo enables developers to safely load data and promote their Gen AI app to deploymentβ147Updated 3 months ago
- RAG orchestration framework β΅οΈβ201Updated 2 months ago
- Finetune LLMs on K8s by using Runbooksβ170Updated last year
- β62Updated 5 months ago
- Build Enterprise RAG (Retriver Augmented Generation) Pipelines to tackle various Generative AI use cases with LLM's by simply plugging coβ¦β113Updated last year
- Multi-threaded matrix multiplication and cosine similarity calculations for dense and sparse matrices. Appropriate for calculating the K β¦β83Updated 9 months ago
- A repository that showcases how you can use ZenML with Gitβ70Updated 2 months ago
- β16Updated last year
- An LLM-powered Streamlit chatbot for data exploration and question answering on Snowflakeβ129Updated last year
- Repository hosting Langchain helm charts.β67Updated this week
- Curated examples and patterns for using Chalk. Use these to build your feature pipelines.β21Updated 2 months ago
- Chassis turns machine learning models into portable container images that can run just about anywhere.β86Updated last year
- β11Updated 2 years ago
- There are many articles that cover the principles of reducing latency optimization for LLMs, however it is often unclear how to actually β¦β30Updated last year