replicate / cog
Containers for machine learning
☆8,557Updated this week
Alternatives and similar repositories for cog:
Users that are interested in cog are comparing it to the libraries listed below
- 20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.☆12,004Updated this week
- 🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.☆28,700Updated this week
- Database for AI. Store Vectors, Images, Texts, Videos, etc. Use with LLMs/LangChain. Store, query, version, & visualize any AI data. Stre…☆8,552Updated this week
- 🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading☆9,582Updated 7 months ago
- The easiest way to serve AI apps and models - Build Model Inference APIs, Job queues, LLM apps, Multi-model pipelines, and more!☆7,645Updated this week
- Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets☆4,463Updated this week
- Go ahead and axolotl questions☆9,165Updated this week
- Simple, safe way to store and distribute tensors☆3,233Updated last month
- Large Language Model Text Generation Inference☆10,031Updated this week
- DSPy: The framework for programming—not prompting—language models☆23,640Updated this week
- Tensor library for machine learning☆12,388Updated this week
- The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.☆8,408Updated 11 months ago
- Postgres with GPUs for ML/AI apps.☆6,247Updated last week
- A language for constraint-guided and efficient LLM programming.☆3,902Updated 10 months ago
- AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (N…☆4,630Updated 3 weeks ago
- Development repository for the Triton language and compiler☆15,290Updated this week
- The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.☆10,483Updated 2 weeks ago
- A Bulletproof Way to Generate Structured JSON from Language Models☆4,698Updated last year
- Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)☆11,995Updated last week
- Hackable and optimized Transformers building blocks, supporting a composable construction.☆9,372Updated last week
- Universal LLM Deployment Engine with ML Compilation☆20,434Updated 2 weeks ago
- Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Ad…☆6,050Updated 7 months ago
- Aim 💫 — An easy-to-use & supercharged open-source experiment tracker.☆5,515Updated this week
- A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Auto…☆13,684Updated this week
- tiktoken is a fast BPE tokeniser for use with OpenAI's models.☆14,232Updated last month
- State-of-the-art Machine Learning for the web. Run 🤗 Transformers directly in your browser, with no need for a server!☆13,466Updated last week
- Ongoing research training transformer models at scale☆12,118Updated this week
- Running large language models on a single GPU for throughput-oriented scenarios.☆9,308Updated 5 months ago
- Train transformer language models with reinforcement learning.☆13,373Updated this week
- Run any open-source LLMs, such as DeepSeek and Llama, as OpenAI compatible API endpoint in the cloud.☆11,177Updated this week