replicate / cogLinks
Containers for machine learning
☆8,729Updated last week
Alternatives and similar repositories for cog
Users that are interested in cog are comparing it to the libraries listed below
Sorting:
- The easiest way to serve AI apps and models - Build Model Inference APIs, Job queues, LLM apps, Multi-model pipelines, and more!☆7,908Updated this week
- Simple, safe way to store and distribute tensors☆3,356Updated 3 weeks ago
- 🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading☆9,731Updated 10 months ago
- SkyPilot: Run AI and batch jobs on any infra (Kubernetes or 16+ clouds). Get unified execution, cost savings, and high GPU availability v…☆8,409Updated this week
- Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!☆39,113Updated last week
- 🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (i…☆8,951Updated last week
- Accessible large language models via k-bit quantization for PyTorch.☆7,330Updated this week
- A language for constraint-guided and efficient LLM programming.☆4,011Updated 2 months ago
- Tensor library for machine learning☆12,859Updated this week
- 🚀 Accelerate inference and training of 🤗 Transformers, Diffusers, TIMM and Sentence Transformers with easy to use hardware optimization…☆2,990Updated this week
- Large Language Model Text Generation Inference☆10,352Updated this week
- an ambient intelligence library☆5,825Updated this week
- Developer-friendly, embedded retrieval engine for multimodal AI. Search More; Manage Less.☆7,105Updated last week
- 🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.☆29,863Updated this week
- A Bulletproof Way to Generate Structured JSON from Language Models☆4,776Updated last year
- 💡 All-in-one open-source AI framework for semantic search, LLM orchestration and language model workflows☆11,248Updated last week
- Database for AI. Store Vectors, Images, Texts, Videos, etc. Use with LLMs/LangChain. Store, query, version, & visualize any AI data. Stre…☆8,732Updated last week
- AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (N…☆4,659Updated 3 months ago
- Sparsity-aware deep learning inference runtime for CPUs☆3,160Updated last month
- ☆7,838Updated last year
- Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets☆4,585Updated last week
- 🤗 AutoTrain Advanced☆4,449Updated 6 months ago
- tiktoken is a fast BPE tokeniser for use with OpenAI's models.☆15,146Updated 4 months ago
- AI orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file convert…☆21,637Updated this week
- Hackable and optimized Transformers building blocks, supporting a composable construction.☆9,751Updated last week
- Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Ad…☆6,079Updated 3 weeks ago
- Development repository for the Triton language and compiler☆16,245Updated this week
- Open-source vector similarity search for Postgres☆16,636Updated 2 weeks ago
- QLoRA: Efficient Finetuning of Quantized LLMs☆10,572Updated last year
- The AI Datastore for Schemas, BLOBs, and Predictions. Use with your apps or integrate built-in Human Supervision, Data Workflow, and UI C…☆1,874Updated 8 months ago