aishwaryaprabhat / BigBertha
BigBertha is an architecture design that demonstrates how automated LLMOps (Large Language Models Operations) can be achieved on any Kubernetes cluster using open source container-native technologies π
β27Updated last year
Alternatives and similar repositories for BigBertha:
Users that are interested in BigBertha are comparing it to the libraries listed below
- Using LlamaIndex with Ray for productionizing LLM applicationsβ71Updated last year
- JupyterLab extension to provide a Kubeflow specific left area for Notebooks deploymentβ18Updated 4 years ago
- Check for data drift between two OpenAI multi-turn chat jsonl files.β37Updated 11 months ago
- Adding NeMo Guardrails to a LlamaIndex RAG pipelineβ36Updated last year
- Additional packages (components, document stores and the likes) to extend the capabilities of Haystackβ142Updated this week
- β19Updated 5 months ago
- β27Updated 7 months ago
- Self-host LLMs with vLLM and BentoMLβ97Updated last week
- Benchmark study on LanceDB, an embedded vector DB, for full-text search and vector searchβ23Updated last year
- Leverage your LangChain trace data for fine tuningβ41Updated 8 months ago
- β76Updated 9 months ago
- SCIPE is a powerful tool for evaluating and diagnosing LLM (Large Language Model) graphs or chains.β21Updated 4 months ago
- β78Updated 10 months ago
- Build reliable, secure, and production-ready AI apps easily.β70Updated this week
- Tuning the Finetuning: An exploration of achieving success with QLoRAβ43Updated 10 months ago
- β16Updated 10 months ago
- TitanML Takeoff Server is an optimization, compression and deployment platform that makes state of the art machine learning models accessβ¦β114Updated last year
- GenAIOps on Kubernetes: A collection of reference architectures for running GenAI at scale on Kubernetes using OSS toolingβ129Updated 5 months ago
- Modular, open source LLMOps stack that separates concerns: LiteLLM unifies LLM APIs, manages routing and cost controls, and ensures high-β¦β90Updated last month
- β32Updated 2 months ago
- Applying Evaluation Driven Development (EDD) to aid in the design decision of RAG pipelinesβ31Updated last year
- Dynamic Metadata based RAG Frameworkβ72Updated 8 months ago
- Handout for a talk I gave about LLM and CLI toolsβ62Updated 9 months ago
- Repository for makeinga a GitHub Actions for deploying to Kubeflow.β35Updated 3 years ago
- Research notes and extra resources for all the work at explodinggradients.comβ23Updated 3 weeks ago
- β20Updated 2 years ago
- π§ͺ Experimental features for Haystackβ41Updated this week
- Repository for open inference protocol specificationβ52Updated 8 months ago
- QLLM: A powerful CLI for seamless interaction with multiple Large Language Models. Simplify AI workflows, streamline development, and unlβ¦β33Updated last month
- β69Updated 9 months ago