philschmid / serverless-machine-learningLinks
collection of serverless machine learning use cases and examples including Hugging Face transformers, timm, Gradio
☆16Updated 3 years ago
Alternatives and similar repositories for serverless-machine-learning
Users that are interested in serverless-machine-learning are comparing it to the libraries listed below
Sorting:
- Large Language Model Hosting Container☆91Updated 3 months ago
- Framework for building and maintaining self-updating prompts for LLMs☆65Updated last year
- ☆64Updated 9 months ago
- Tools and utilities for operating Metaflow in production☆68Updated 2 months ago
- Amazon SageMaker Managed Spot Training Examples☆51Updated last year
- Leverage your LangChain trace data for fine tuning☆46Updated last year
- Deploy llama.cpp compatible Generative AI LLMs on AWS Lambda!☆177Updated last year
- Tutorial and template for a semantic search app powered by the Atlas Embedding Database, Langchain, OpenAI and FastAPI☆114Updated 2 years ago
- ☆29Updated 2 years ago
- Command Line Interface for Hugging Face Inference Endpoints☆65Updated last year
- ☆29Updated 2 months ago
- Chrome Extension for exploring Hugging Face datasets 🔎☆48Updated last year
- Library of Prefect tasks and utilities.☆10Updated last year
- Render Jupyter Notebooks With Metaflow Cards☆31Updated last year
- Use sync mode Playwright interactively, inside a Jupyter notebook☆17Updated last week
- ☆48Updated 2 years ago
- Reference architecture for LLM-based applications on Google Cloud Platform with Redis Enterprise as a high-performance data layer.☆38Updated 9 months ago
- Cortex-compatible model server for Python and TensorFlow☆18Updated 3 years ago
- Run GPU inference and training jobs on serverless infrastructure that scales with you.☆102Updated last year
- ☆13Updated last year
- ☆56Updated 2 years ago
- Async bulk data ingestion and querying in various document, graph and vector databases via their Python clients☆40Updated 2 years ago
- ☆24Updated last year
- Fast Audio/Video transcribe using Openai's Whisper and Modal, an hour audio/video file can be transcribed in ~1 minute☆81Updated 2 years ago
- TitanML Takeoff Server is an optimization, compression and deployment platform that makes state of the art machine learning models access…☆114Updated 2 years ago
- Writing Blog Posts with Generative Feedback Loops!☆50Updated last year
- ☆45Updated 6 months ago
- VSCode extension for ZenML☆21Updated last week
- ☆22Updated 2 years ago
- Adding NeMo Guardrails to a LlamaIndex RAG pipeline☆41Updated last year