MarvinSt / ray-docker-compose
☆23Updated last year
Alternatives and similar repositories for ray-docker-compose
Users that are interested in ray-docker-compose are comparing it to the libraries listed below
Sorting:
- Simple dependency injection framework for Python☆21Updated last year
- Cortex-compatible model server for Python and TensorFlow☆17Updated 2 years ago
- MLFlow Deployment Plugin for Ray Serve☆44Updated 3 years ago
- Sentence Embedding as a Service☆15Updated last year
- Some microbenchmarks and design docs before commencement☆12Updated 4 years ago
- Demonstration of how to perform continuous model monitoring on CML using Model Metrics and Evidently.ai dashboards☆12Updated 5 months ago
- Article about deploying machine learning models using grpc, pytorch and asyncio☆28Updated 2 years ago
- ☆8Updated 2 years ago
- Search system on top of Elasticsearch, Kubeflow and Katib☆29Updated 2 years ago
- Ray provider for Apache Airflow☆48Updated last year
- A boilerplate to use multiprocessing for your gRPC server in your Python project☆25Updated 3 years ago
- Trace LLM calls (and others) and visualize them in WandB, as interactive SVG or using a streaming local webapp☆14Updated 2 months ago
- ☆17Updated last year
- Benchmark for machine learning model online serving (LLM, embedding, Stable-Diffusion, Whisper)☆28Updated last year
- Python and Scala APIs for enhanced Spark analytics☆12Updated 8 years ago
- ☆14Updated last year
- SQLFlow client library for Python☆29Updated 2 years ago
- A tutorial on locality sensitive hashing, using MinHashing for document similarity and CosineSimilarity for Euclidean space similarity.☆33Updated 4 years ago
- An open-source NLP library: fast text cleaning and preprocessing☆23Updated 3 years ago
- Triton backend for managing the model state tensors automatically in sequence batcher☆17Updated last year
- PyCon Talks 2022 by Antoine Toubhans☆23Updated 2 years ago
- ☆32Updated 2 years ago
- Visual similarity search engine demo with use of PyTorch Metric Learning and Qdrant☆12Updated 2 years ago
- Comparing PyTorch, JIT and ONNX for inference with Transformers☆19Updated 4 years ago
- ☆12Updated 2 years ago
- ☆16Updated 2 years ago
- Fast model deployment on AWS Lambda☆14Updated last year
- setup the env for vllm users☆16Updated last year
- Fine-tune Mistral 7B to generate fashion style suggestions☆34Updated last year
- Repository for the LLMOps RAG with Airflow + Weaviate Learn use case.☆33Updated last year