Public repo for HF blog posts
β3,344Mar 13, 2026Updated last week
Alternatives and similar repositories for blog
Users that are interested in blog are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Train transformer language models with reinforcement learning.β17,697Updated this week
- π€ PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.β20,809Mar 16, 2026Updated last week
- π A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (iβ¦β9,563Updated this week
- Large Language Model Text Generation Inferenceβ10,812Jan 8, 2026Updated 2 months ago
- Fast and memory-efficient exact attentionβ22,832Updated this week
- π€ Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal modelβ¦β158,060Updated this week
- DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.β41,869Updated this week
- An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.β39,428Jun 2, 2025Updated 9 months ago
- π€ Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.β33,085Updated this week
- A high-throughput and memory-efficient inference and serving engine for LLMsβ73,479Updated this week
- Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalitiesβ22,046Jan 23, 2026Updated 2 months ago
- π Accelerate inference and training of π€ Transformers, Diffusers, TIMM and Sentence Transformers with easy to use hardware optimizationβ¦β3,332Mar 13, 2026Updated last week
- Notebooks using the Hugging Face libraries π€β4,487Mar 12, 2026Updated last week
- Accessible large language models via k-bit quantization for PyTorch.β8,052Updated this week
- [NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.β24,578Aug 12, 2024Updated last year
- Facebook AI Research Sequence-to-Sequence Toolkit written in Python.β32,190Sep 30, 2025Updated 5 months ago
- Inference code for Llama modelsβ59,221Jan 26, 2025Updated last year
- Ongoing research training transformer models at scaleβ15,744Updated this week
- LlamaIndex is the leading document agent and OCR platformβ47,753Updated this week
- Code and documentation to train Stanford's Alpaca models, and generate the data.β30,258Jul 17, 2024Updated last year
- Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)β68,728Updated this week
- Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"β13,351Dec 17, 2024Updated last year
- Robust recipes to align language models with human and AI preferencesβ5,527Sep 8, 2025Updated 6 months ago
- A framework for few-shot evaluation of language models.β11,704Mar 5, 2026Updated 2 weeks ago
- Example models using DeepSpeedβ6,807Mar 4, 2026Updated 2 weeks ago
- State-of-the-Art Text Embeddingsβ18,427Mar 12, 2026Updated last week
- CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an imageβ32,861Feb 18, 2026Updated last month
- This repository contains demos I made with the Transformers library by HuggingFace.β11,531Mar 9, 2026Updated 2 weeks ago
- Hackable and optimized Transformers building blocks, supporting a composable construction.β10,373Updated this week
- Fully open reproduction of DeepSeek-R1β25,953Nov 24, 2025Updated 3 months ago
- Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We alsβ¦β18,261Mar 3, 2026Updated 2 weeks ago
- LAVIS - A One-stop Library for Language-Vision Intelligenceβ11,189Nov 18, 2024Updated last year
- QLoRA: Efficient Finetuning of Quantized LLMsβ10,858Jun 10, 2024Updated last year
- LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMathβ9,478Jun 7, 2025Updated 9 months ago
- Fine-tuning & Reinforcement Learning for LLMs. π¦₯ Train OpenAI gpt-oss, DeepSeek, Qwen, Llama, Gemma, TTS 2x faster with 70% less VRAM.β54,096Updated this week
- The agent engineering platformβ130,454Updated this week
- A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Autoβ¦β16,918Updated this week
- SGLang is a high-performance serving framework for large language models and multimodal models.β24,829Updated this week
- Efficient few-shot learning with Sentence Transformersβ2,699Dec 11, 2025Updated 3 months ago