Containers for machine learning
☆9,252Updated this week
Alternatives and similar repositories for cog
Users that are interested in cog are comparing it to the libraries listed below
Sorting:
- Run, manage, and scale AI workloads on any AI infrastructure. Use one system to access & manage all AI compute (Kubernetes, 20+ clouds, o…☆9,478Updated this week
- Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!☆41,855Updated this week
- Open-source AI orchestration framework for building context-engineered, production-ready LLM applications. Design modular pipelines and a…☆24,295Updated this week
- LlamaIndex is the leading document agent and OCR platform☆47,210Updated this week
- Qdrant - High-performance, massive-scale Vector Database and Vector Search Engine for the next generation of AI. Also available in the cl…☆29,102Updated this week
- The easiest way to serve AI apps and models - Build Model Inference APIs, Job queues, LLM apps, Multi-model pipelines, and more!☆8,472Updated this week
- DSPy: The framework for programming —not prompting—language models☆32,381Updated this week
- A high-throughput and memory-efficient inference and serving engine for LLMs☆71,234Updated this week
- 🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.☆32,873Updated this week
- ☁️ Build multimodal AI applications with cloud-native stack☆21,832Mar 24, 2025Updated 11 months ago
- Structured Outputs☆13,456Feb 13, 2026Updated 2 weeks ago
- Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing a…☆37,083Updated this week
- 💡 All-in-one AI framework for semantic search, LLM orchestration and language model workflows☆12,210Updated this week
- Weaviate is an open-source vector database that stores both objects and vectors, allowing for the combination of vector search with struc…☆15,626Feb 20, 2026Updated last week
- 🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading☆9,958Sep 7, 2024Updated last year
- A guidance language for controlling large language models.☆21,319Feb 13, 2026Updated 2 weeks ago
- 🦜🔗 The platform for reliable agents.☆127,192Updated this week
- Open-source search and retrieval database for AI applications.☆26,269Updated this week
- Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.☆41,413Updated this week
- Go ahead and axolotl questions☆11,335Updated this week
- Large Language Model Text Generation Inference☆10,774Jan 8, 2026Updated last month
- Prefect is a workflow orchestration framework for building resilient data pipelines in Python.☆21,652Updated this week
- Streamlit — A faster way to build and share data apps.☆43,634Updated this week
- OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamical…☆37,444Aug 17, 2024Updated last year
- structured outputs for llms☆12,428Updated this week
- You like pytorch? You like micrograd? You love tinygrad! ❤️☆31,424Updated this week
- 🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production☆44,608Aug 16, 2024Updated last year
- Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets☆4,875Updated this week
- Invoke is a leading creative engine for Stable Diffusion models, empowering professionals, artists, and enthusiasts to generate and creat…☆26,786Updated this week
- 🦉 Data Versioning and ML Experiments☆15,385Feb 16, 2026Updated last week
- LLM inference in C/C++☆95,726Updated this week
- Milvus is a high-performance, cloud-native vector database built for scalable vector ANN search☆42,932Updated this week
- Pretrain, finetune ANY AI model of ANY size on 1 or 10,000+ GPUs with zero code changes.☆30,860Updated this week
- DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.☆41,648Updated this week
- Kedro is a toolbox for production-ready data science. It uses software engineering best practices to help you create data engineering and…☆10,768Updated this week
- Extremely fast Query Engine for DataFrames, written in Rust☆37,513Updated this week
- An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.☆39,414Jun 2, 2025Updated 8 months ago
- A library for efficient similarity search and clustering of dense vectors.☆39,195Updated this week
- Low-code framework for building custom LLMs, neural networks, and other AI models☆11,651Updated this week