Public repo for HF blog posts
β3,446Jun 24, 2026Updated last week
Alternatives and similar repositories for blog
Users that are interested in blog are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Train transformer language models with reinforcement learning.β18,735Updated this week
- π€ PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.β21,337Updated this week
- π A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (iβ¦β9,749Updated this week
- Large Language Model Text Generation Inferenceβ10,862Mar 21, 2026Updated 3 months ago
- Fast and memory-efficient exact attentionβ24,304Updated this week
- Deploy to Railway using AI coding agents - Free Credits Offer β’ AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- π€ Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal modelβ¦β161,885Jun 25, 2026Updated last week
- DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.β42,586Updated this week
- An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.β39,492May 1, 2026Updated 2 months ago
- π€ Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.β33,960Updated this week
- Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalitiesβ22,152Jan 23, 2026Updated 5 months ago
- A high-throughput and memory-efficient inference and serving engine for LLMsβ84,877Updated this week
- Notebooks using the Hugging Face libraries π€β4,583Updated this week
- Accessible large language models via k-bit quantization for PyTorch.β8,286Jun 22, 2026Updated last week
- π Accelerate inference and training of π€ Transformers, Diffusers, TIMM and Sentence Transformers with easy to use hardware optimizationβ¦β3,426Jun 22, 2026Updated last week
- Wordpress hosting with auto-scaling - Free Trial Offer β’ AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- [NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.β24,896Aug 12, 2024Updated last year
- Facebook AI Research Sequence-to-Sequence Toolkit written in Python.β32,233Sep 30, 2025Updated 9 months ago
- Inference code for Llama modelsβ59,475Jan 26, 2025Updated last year
- Ongoing research training transformer models at scaleβ16,838Updated this week
- LlamaIndex is the leading document agent and OCR platformβ50,533Updated this week
- Code and documentation to train Stanford's Alpaca models, and generate the data.β30,249Jul 17, 2024Updated last year
- Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)β72,482Jun 24, 2026Updated last week
- Robust recipes to align language models with human and AI preferencesβ5,623May 26, 2026Updated last month
- Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"β13,614Dec 17, 2024Updated last year
- Virtual machines for every use case on DigitalOcean β’ AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- A framework for few-shot evaluation of language models.β13,106Jun 24, 2026Updated last week
- Example models using DeepSpeedβ6,825Jun 24, 2026Updated last week
- State-of-the-Art Embeddings, Retrieval, and Rerankingβ18,853Updated this week
- CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an imageβ33,892Mar 25, 2026Updated 3 months ago
- This repository contains demos I made with the Transformers library by HuggingFace.β11,653Apr 20, 2026Updated 2 months ago
- Hackable and optimized Transformers building blocks, supporting a composable construction.β10,513Jun 18, 2026Updated 2 weeks ago
- Fully open reproduction of DeepSeek-R1β26,345Apr 2, 2026Updated 3 months ago
- QLoRA: Efficient Finetuning of Quantized LLMsβ10,940Jun 10, 2024Updated 2 years ago
- Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We alsβ¦β18,378May 19, 2026Updated last month
- 1-Click AI Models by DigitalOcean Gradient β’ AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- LAVIS - A One-stop Library for Language-Vision Intelligenceβ11,243Jun 2, 2026Updated 3 weeks ago
- LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMathβ9,485Jun 7, 2025Updated last year
- A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Autoβ¦β17,646Updated this week
- The agent engineering platform.β140,319Updated this week
- Unsloth Studio is a web UI for training and running open models like Gemma 4, Qwen3.6, DeepSeek, gpt-oss locally.β67,571Updated this week
- Build and share delightful machine learning apps, all in Python. π Star to support our work!β42,989Jun 24, 2026Updated last week
- Efficient few-shot learning with Sentence Transformersβ2,755May 26, 2026Updated last month