replicate / cogLinks
Containers for machine learning
☆8,685Updated this week
Alternatives and similar repositories for cog
Users that are interested in cog are comparing it to the libraries listed below
Sorting:
- 🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading☆9,694Updated 9 months ago
- A collection of libraries to optimise AI model performances☆8,372Updated 11 months ago
- Simple, safe way to store and distribute tensors☆3,334Updated last week
- SkyPilot: Run AI and batch jobs on any infra (Kubernetes or 16+ clouds). Get unified execution, cost savings, and high GPU availability v…☆8,301Updated this week
- The easiest way to serve AI apps and models - Build Model Inference APIs, Job queues, LLM apps, Multi-model pipelines, and more!☆7,814Updated this week
- Hackable and optimized Transformers building blocks, supporting a composable construction.☆9,641Updated last week
- 🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.☆29,526Updated this week
- the AI-native open-source embedding database☆20,790Updated this week
- Run any open-source LLMs, such as DeepSeek and Llama, as OpenAI compatible API endpoint in the cloud.☆11,443Updated last week
- Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Ad…☆6,073Updated this week
- StableLM: Stability AI Language Models☆15,828Updated last year
- Development repository for the Triton language and compiler☆15,989Updated this week
- Aim 💫 — An easy-to-use & supercharged open-source experiment tracker.☆5,672Updated this week
- Accessible large language models via k-bit quantization for PyTorch.☆7,167Updated this week
- Bringing stable diffusion models to web browsers. Everything runs inside the browser with no server support.☆3,670Updated last year
- Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM☆7,851Updated 2 months ago
- Running large language models on a single GPU for throughput-oriented scenarios.☆9,337Updated 8 months ago
- Serve, optimize and scale PyTorch models in production☆4,340Updated 2 weeks ago
- 💡 All-in-one open-source AI framework for semantic search, LLM orchestration and language model workflows☆11,159Updated this week
- Structured Outputs☆11,990Updated this week
- Instruct-tune LLaMA on consumer hardware☆18,927Updated 11 months ago
- Large Language Model Text Generation Inference☆10,265Updated this week
- AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (N…☆4,647Updated 3 months ago
- 🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (i…☆8,875Updated this week
- Go ahead and axolotl questions☆9,760Updated this week
- DSPy: The framework for programming—not prompting—language models☆26,016Updated this week
- Implementation of Dreambooth (https://arxiv.org/abs/2208.12242) with Stable Diffusion☆7,744Updated 2 years ago
- OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamical…☆37,391Updated 10 months ago
- 🚀 Accelerate inference and training of 🤗 Transformers, Diffusers, TIMM and Sentence Transformers with easy to use hardware optimization…☆2,953Updated this week
- 🤗 AutoTrain Advanced☆4,425Updated 5 months ago