Public repo for HF blog posts
β3,408May 19, 2026Updated this week
Alternatives and similar repositories for blog
Users that are interested in blog are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Train transformer language models with reinforcement learning.β18,411Updated this week
- π€ PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.β21,138May 13, 2026Updated last week
- π A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (iβ¦β9,691Updated this week
- Large Language Model Text Generation Inferenceβ10,856Mar 21, 2026Updated 2 months ago
- Fast and memory-efficient exact attentionβ23,836Updated this week
- Managed Kubernetes at scale on DigitalOcean β’ AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- π€ Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal modelβ¦β160,794Updated this week
- DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.β42,386Updated this week
- An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.β39,474May 1, 2026Updated 3 weeks ago
- π€ Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.β33,668Updated this week
- Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalitiesβ22,128Jan 23, 2026Updated 4 months ago
- A high-throughput and memory-efficient inference and serving engine for LLMsβ80,418Updated this week
- Notebooks using the Hugging Face libraries π€β4,553May 15, 2026Updated last week
- Accessible large language models via k-bit quantization for PyTorch.β8,216Updated this week
- π Accelerate inference and training of π€ Transformers, Diffusers, TIMM and Sentence Transformers with easy to use hardware optimizationβ¦β3,392May 7, 2026Updated 2 weeks ago
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.β24,802Aug 12, 2024Updated last year
- Facebook AI Research Sequence-to-Sequence Toolkit written in Python.β32,219Sep 30, 2025Updated 7 months ago
- Inference code for Llama modelsβ59,425Jan 26, 2025Updated last year
- Ongoing research training transformer models at scaleβ16,427Updated this week
- LlamaIndex is the leading document agent and OCR platformβ49,501May 15, 2026Updated last week
- Code and documentation to train Stanford's Alpaca models, and generate the data.β30,248Jul 17, 2024Updated last year
- Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)β71,468Updated this week
- Robust recipes to align language models with human and AI preferencesβ5,602Apr 8, 2026Updated last month
- Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"β13,544Dec 17, 2024Updated last year
- Virtual machines for every use case on DigitalOcean β’ AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- A framework for few-shot evaluation of language models.β12,595May 11, 2026Updated last week
- Example models using DeepSpeedβ6,820Updated this week
- State-of-the-Art Embeddings, Retrieval, and Rerankingβ18,669May 12, 2026Updated last week
- CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an imageβ33,514Mar 25, 2026Updated last month
- This repository contains demos I made with the Transformers library by HuggingFace.β11,628Apr 20, 2026Updated last month
- Hackable and optimized Transformers building blocks, supporting a composable construction.β10,462Apr 21, 2026Updated last month
- Fully open reproduction of DeepSeek-R1β26,020Apr 2, 2026Updated last month
- QLoRA: Efficient Finetuning of Quantized LLMsβ10,908Jun 10, 2024Updated last year
- Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We alsβ¦β18,331Updated this week
- AI Agents on DigitalOcean Gradient AI Platform β’ AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- LAVIS - A One-stop Library for Language-Vision Intelligenceβ11,221Nov 18, 2024Updated last year
- LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMathβ9,480Jun 7, 2025Updated 11 months ago
- A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Autoβ¦β17,227Updated this week
- The agent engineering platform.β136,798May 14, 2026Updated last week
- Unsloth Studio is a web UI for training and running open models like Gemma 4, Qwen3.6, DeepSeek, gpt-oss locally.β64,485Updated this week
- Build and share delightful machine learning apps, all in Python. π Star to support our work!β42,634Updated this week
- Efficient few-shot learning with Sentence Transformersβ2,741Apr 17, 2026Updated last month