replicate / cogLinks
Containers for machine learning
☆8,670Updated this week
Alternatives and similar repositories for cog
Users that are interested in cog are comparing it to the libraries listed below
Sorting:
- SkyPilot: Run AI and batch jobs on any infra (Kubernetes or 16+ clouds). Get unified execution, cost savings, and high GPU availability v…☆8,261Updated this week
- 🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading☆9,675Updated 9 months ago
- A collection of libraries to optimise AI model performances☆8,372Updated 11 months ago
- A guidance language for controlling large language models.☆20,336Updated this week
- Structured Text Generation☆11,843Updated this week
- Hackable and optimized Transformers building blocks, supporting a composable construction.☆9,610Updated this week
- Large Language Model Text Generation Inference☆10,236Updated this week
- Accessible large language models via k-bit quantization for PyTorch.☆7,142Updated this week
- 💡 All-in-one open-source AI framework for semantic search, LLM orchestration and language model workflows☆11,110Updated last week
- A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training☆22,108Updated 10 months ago
- The easiest way to serve AI apps and models - Build Model Inference APIs, Job queues, LLM apps, Multi-model pipelines, and more!☆7,806Updated this week
- QLoRA: Efficient Finetuning of Quantized LLMs☆10,500Updated last year
- Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM☆7,850Updated last month
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆42,030Updated 6 months ago
- Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Ad…☆6,073Updated 9 months ago
- Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!☆38,681Updated this week
- Simple, safe way to store and distribute tensors☆3,311Updated this week
- AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (N…☆4,647Updated 2 months ago
- Bringing stable diffusion models to web browsers. Everything runs inside the browser with no server support.☆3,668Updated last year
- LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath☆9,421Updated 2 weeks ago
- Run any open-source LLMs, such as DeepSeek and Llama, as OpenAI compatible API endpoint in the cloud.☆11,366Updated this week
- 20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.☆12,293Updated this week
- Postgres with GPUs for ML/AI apps.☆6,340Updated 2 months ago
- Numbers every LLM developer should know☆4,235Updated last year
- Fast and memory-efficient exact attention☆17,952Updated this week
- Repo for external large-scale work☆6,530Updated last year
- A Bulletproof Way to Generate Structured JSON from Language Models☆4,750Updated last year
- Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets☆4,552Updated last week
- Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022☆6,339Updated 11 months ago
- 🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.☆29,437Updated this week