replicate / cogLinks

Containers for machine learning

☆8,729

Alternatives and similar repositories for cog

Users that are interested in cog are comparing it to the libraries listed below

Sorting:

bentoml / BentoML
The easiest way to serve AI apps and models - Build Model Inference APIs, Job queues, LLM apps, Multi-model pipelines, and more!
☆7,908Updated this week
huggingface / safetensors
Simple, safe way to store and distribute tensors
☆3,356Updated 3 weeks ago
bigscience-workshop / petals
🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading
☆9,731Updated 10 months ago
skypilot-org / skypilot
SkyPilot: Run AI and batch jobs on any infra (Kubernetes or 16+ clouds). Get unified execution, cost savings, and high GPU availability v…
☆8,409Updated this week
gradio-app / gradio
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
☆39,113Updated last week
huggingface / accelerate
🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (i…
☆8,951Updated last week
bitsandbytes-foundation / bitsandbytes
Accessible large language models via k-bit quantization for PyTorch.
☆7,330Updated this week
eth-sri / lmql
A language for constraint-guided and efficient LLM programming.
☆4,011Updated 2 months ago
ggml-org / ggml
Tensor library for machine learning
☆12,859Updated this week
huggingface / optimum
🚀 Accelerate inference and training of 🤗 Transformers, Diffusers, TIMM and Sentence Transformers with easy to use hardware optimization…
☆2,990Updated this week
huggingface / text-generation-inference
Large Language Model Text Generation Inference
☆10,352Updated this week
PrefectHQ / marvin
an ambient intelligence library
☆5,825Updated this week
lancedb / lancedb
Developer-friendly, embedded retrieval engine for multimodal AI. Search More; Manage Less.
☆7,105Updated last week
huggingface / diffusers
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.
☆29,863Updated this week
1rgs / jsonformer
A Bulletproof Way to Generate Structured JSON from Language Models
☆4,776Updated last year
neuml / txtai
💡 All-in-one open-source AI framework for semantic search, LLM orchestration and language model workflows
☆11,248Updated last week
activeloopai / deeplake
Database for AI. Store Vectors, Images, Texts, Videos, etc. Use with LLMs/LangChain. Store, query, version, & visualize any AI data. Stre…
☆8,732Updated last week
facebookincubator / AITemplate
AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (N…
☆4,659Updated 3 months ago
neuralmagic / deepsparse
Sparsity-aware deep learning inference runtime for CPUs
☆3,160Updated last month
deep-floyd / IF
☆7,838Updated last year
argilla-io / argilla
Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets
☆4,585Updated last week
huggingface / autotrain-advanced
🤗 AutoTrain Advanced
☆4,449Updated 6 months ago
openai / tiktoken
tiktoken is a fast BPE tokeniser for use with OpenAI's models.
☆15,146Updated 4 months ago
deepset-ai / haystack
AI orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file convert…
☆21,637Updated this week
facebookresearch / xformers
Hackable and optimized Transformers building blocks, supporting a composable construction.
☆9,751Updated last week
Lightning-AI / lit-llama
Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Ad…
☆6,079Updated 3 weeks ago
triton-lang / triton
Development repository for the Triton language and compiler
☆16,245Updated this week
pgvector / pgvector
Open-source vector similarity search for Postgres
☆16,636Updated 2 weeks ago
artidoro / qlora
QLoRA: Efficient Finetuning of Quantized LLMs
☆10,572Updated last year
diffgram / diffgram
The AI Datastore for Schemas, BLOBs, and Predictions. Use with your apps or integrate built-in Human Supervision, Data Workflow, and UI C…
☆1,874Updated 8 months ago