Public repo for HF blog posts
β3,370Apr 8, 2026Updated this week
Alternatives and similar repositories for blog
Users that are interested in blog are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Train transformer language models with reinforcement learning.β17,967Updated this week
- π€ PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.β20,895Apr 2, 2026Updated last week
- π A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (iβ¦β9,596Apr 2, 2026Updated last week
- Large Language Model Text Generation Inferenceβ10,830Mar 21, 2026Updated 3 weeks ago
- Fast and memory-efficient exact attentionβ23,185Updated this week
- Managed Kubernetes at scale on DigitalOcean β’ AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- π€ Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal modelβ¦β159,060Updated this week
- DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.β42,029Updated this week
- An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.β39,447Jun 2, 2025Updated 10 months ago
- π€ Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.β33,282Updated this week
- A high-throughput and memory-efficient inference and serving engine for LLMsβ75,637Updated this week
- Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalitiesβ22,077Jan 23, 2026Updated 2 months ago
- π Accelerate inference and training of π€ Transformers, Diffusers, TIMM and Sentence Transformers with easy to use hardware optimizationβ¦β3,354Apr 2, 2026Updated last week
- Accessible large language models via k-bit quantization for PyTorch.β8,107Updated this week
- Notebooks using the Hugging Face libraries π€β4,506Apr 2, 2026Updated last week
- DigitalOcean Gradient AI Platform β’ AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.β24,652Aug 12, 2024Updated last year
- Facebook AI Research Sequence-to-Sequence Toolkit written in Python.β32,201Sep 30, 2025Updated 6 months ago
- Inference code for Llama modelsβ59,296Jan 26, 2025Updated last year
- Ongoing research training transformer models at scaleβ15,985Updated this week
- LlamaIndex is the leading document agent and OCR platformβ48,389Updated this week
- Code and documentation to train Stanford's Alpaca models, and generate the data.β30,253Jul 17, 2024Updated last year
- Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)β69,794Updated this week
- Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"β13,411Dec 17, 2024Updated last year
- Robust recipes to align language models with human and AI preferencesβ5,551Apr 2, 2026Updated last week
- 1-Click AI Models by DigitalOcean Gradient β’ AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- A framework for few-shot evaluation of language models.β12,020Apr 1, 2026Updated last week
- Example models using DeepSpeedβ6,815Mar 30, 2026Updated last week
- State-of-the-Art Text Embeddingsβ18,534Updated this week
- CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an imageβ33,104Mar 25, 2026Updated 2 weeks ago
- This repository contains demos I made with the Transformers library by HuggingFace.β11,566Mar 9, 2026Updated last month
- Hackable and optimized Transformers building blocks, supporting a composable construction.β10,411Mar 30, 2026Updated last week
- Fully open reproduction of DeepSeek-R1β25,973Apr 2, 2026Updated last week
- Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We alsβ¦β18,278Apr 1, 2026Updated last week
- QLoRA: Efficient Finetuning of Quantized LLMsβ10,865Jun 10, 2024Updated last year
- End-to-end encrypted email - Proton Mail β’ AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- LAVIS - A One-stop Library for Language-Vision Intelligenceβ11,192Nov 18, 2024Updated last year
- LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMathβ9,471Jun 7, 2025Updated 10 months ago
- The agent engineering platformβ133,136Updated this week
- A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Autoβ¦β17,048Updated this week
- Unsloth Studio is a web UI for training and running open models like Qwen3.5, Gemma 4, DeepSeek, gpt-oss locally.β59,774Updated this week
- Build and share delightful machine learning apps, all in Python. π Star to support our work!β42,277Updated this week
- Efficient few-shot learning with Sentence Transformersβ2,710Apr 2, 2026Updated last week