nguyenthai-duong / Deploying-RAG-on-Kubernetes-with-Jenkins-for-Legal-Document-RetrievalLinks
Built and deployed scalable LLM retrieval APIs on a hybrid GCP architecture with full CI/CD, IaC, and monitoring
☆71Updated 5 months ago
Alternatives and similar repositories for Deploying-RAG-on-Kubernetes-with-Jenkins-for-Legal-Document-Retrieval
Users that are interested in Deploying-RAG-on-Kubernetes-with-Jenkins-for-Legal-Document-Retrieval are comparing it to the libraries listed below
Sorting:
- ☆69Updated last year
- ☆58Updated last year
- To simplify and streamline LLM operations, empowering developers and organizations to harness the full potential of large language models…☆131Updated last year
- MLOPs human pose estimation end-to-end.☆37Updated last year
- ☆71Updated 2 years ago
- ☆33Updated last year
- ☆28Updated 2 years ago
- ☆23Updated last year
- ☆27Updated last year
- Building a highly scalable Machine Learning System☆27Updated last year
- ☆107Updated 2 years ago
- ☆32Updated 2 years ago
- ☆55Updated 10 months ago
- MLOps for Image Caption Generator.☆25Updated 2 years ago
- ☆67Updated last year
- Comprehensive tools for building (Retrieval Augmented Generation) RAG chatbots.☆82Updated last year
- This project demonstrates a production-grade MLOps pipeline that deploys a YOLOv11-based face detection service on Google Kubernetes Engi…☆38Updated 7 months ago
- ☆17Updated 2 years ago
- Baseline for ZaloAI Challenge 2023 Elementary Math Solving☆68Updated 2 years ago
- Scalable, cloud-native recommender system with end-to-end MLOps for building, training, and deploying models in research and production☆52Updated 6 months ago
- ☆64Updated last year
- ☆32Updated 2 years ago
- ☆49Updated last year
- ☆30Updated last year
- Baseline achieving 0.8 accuracy on the private test set in the ZaloAI Challenge 2023 Elementary Math Solving☆24Updated last year
- ☆11Updated 2 years ago
- MLOps Platform for MLOps Crash Course☆43Updated 3 years ago
- ☆46Updated 2 years ago
- A turnkey MLOps pipeline demonstrating how to go from raw events to real-time predictions at scale.☆232Updated 3 months ago
- ☆56Updated last year