tensorchord / openmodelz
One-click machine learning deployment (LLM, text-to-image and so on) at scale on any cluster (GCP, AWS, Lambda labs, your home lab, or even a single machine).
β239Updated last year
Related projects β
Alternatives and complementary repositories for openmodelz
- OpenAI compatible API for LLMs and embeddings (LLaMA, Vicuna, ChatGLM and many others)β265Updated last year
- ChatData π π brings RAG to real applications with FREEβ¨ knowledge bases. Now enjoy your chat with 6 million wikipedia pages and 2 milliβ¦β155Updated this week
- This is a landscape of the infrastructure that powers the generative AI ecosystemβ129Updated 3 weeks ago
- A diverse, simple, and secure all-in-one LLMOps platformβ85Updated last month
- https://TiDB.AI is a Graph RAG based and conversational knowledge base tool built with TiDB Serverless Vector Storage and LlamaIndex. Opeβ¦β199Updated this week
- Open-source observability for your LLM application.β43Updated 2 weeks ago
- A lightweight version of Milvusβ276Updated 2 weeks ago
- EvalGPT is an code interpreter framework that utilizes large language models to automate the process of code-writing and execution, delivβ¦β250Updated last year
- Self-hosted huggingface mirror service.β76Updated this week
- Turn any OCR models into online inference API endpoint π πβ50Updated last year
- A simple service that integrates vLLM with Ray Serve for fast and scalable LLM serving.β53Updated 7 months ago
- an MLOps/LLMOps platformβ208Updated 3 months ago
- Pretrain, finetune and serve LLMs on Intel platforms with Rayβ101Updated last week
- π This is an adapted version of Jina AI's Reader for local deployment using Docker. Convert any URL to an LLM-friendly input with a simpβ¦β46Updated last month
- Examples on how to use LangChain and Rayβ220Updated last year
- Self-host LLMs with vLLM and BentoMLβ72Updated this week
- β205Updated this week
- Octogen is an Open-Source Code Interpreter Agent Frameworkβ253Updated 3 months ago
- β32Updated 9 months ago
- π An awesome & curated list of best LLMOps tools.β29Updated 3 weeks ago
- Model Deployment at Scale on Kubernetes π¦οΈβ789Updated 6 months ago
- Open Source Text Embedding Models with OpenAI Compatible APIβ131Updated 3 months ago
- Python client library for improving your LLM app accuracyβ96Updated this week
- a local implementation of OpenAI Assistants API: myla stands for MY Local Assistantβ49Updated 2 months ago
- π§ Pod-Helper: Real-time audio transcription and repair on consumer hardwareβ78Updated 8 months ago
- Manage GPU clusters for running LLMsβ551Updated this week
- Develop, evaluate and monitor LLM applications at scaleβ93Updated this week
- LLMPerf is a library for validating and benchmarking LLMsβ636Updated 2 months ago
- Redis Vector Library (RedisVL) interfaces with Redis' vector database for realtime semantic search, RAG, and recommendation systems.β229Updated this week
- VQLite - Simple and Lightweight Vector Search Engine based on Google ScaNNβ84Updated 3 months ago