Public repo for HF blog posts
β3,434Jun 4, 2026Updated last week
Alternatives and similar repositories for blog
Users that are interested in blog are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Train transformer language models with reinforcement learning.β18,613Updated this week
- π€ PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.β21,258Updated this week
- π A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (iβ¦β9,720Updated this week
- Large Language Model Text Generation Inferenceβ10,859Mar 21, 2026Updated 2 months ago
- Fast and memory-efficient exact attentionβ24,111Updated this week
- End-to-end encrypted cloud storage - Proton Drive β’ AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- π€ Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal modelβ¦β161,309Jun 5, 2026Updated last week
- DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.β42,478Updated this week
- An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.β39,470May 1, 2026Updated last month
- π€ Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.β33,826Updated this week
- Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalitiesβ22,147Jan 23, 2026Updated 4 months ago
- Notebooks using the Hugging Face libraries π€β4,569Updated this week
- Accessible large language models via k-bit quantization for PyTorch.β8,258Updated this week
- π Accelerate inference and training of π€ Transformers, Diffusers, TIMM and Sentence Transformers with easy to use hardware optimizationβ¦β3,409Jun 3, 2026Updated last week
- [NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.β24,864Aug 12, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer β’ AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Facebook AI Research Sequence-to-Sequence Toolkit written in Python.β32,226Sep 30, 2025Updated 8 months ago
- Inference code for Llama modelsβ59,452Jan 26, 2025Updated last year
- Ongoing research training transformer models at scaleβ16,617Updated this week
- LlamaIndex is the leading document agent and OCR platformβ50,073Updated this week
- Code and documentation to train Stanford's Alpaca models, and generate the data.β30,248Jul 17, 2024Updated last year
- Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)β71,937Jun 5, 2026Updated last week
- Robust recipes to align language models with human and AI preferencesβ5,608May 26, 2026Updated 2 weeks ago
- Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"β13,579Dec 17, 2024Updated last year
- A framework for few-shot evaluation of language models.β12,885Jun 2, 2026Updated last week
- 1-Click AI Models by DigitalOcean Gradient β’ AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Example models using DeepSpeedβ6,820May 20, 2026Updated 3 weeks ago
- State-of-the-Art Embeddings, Retrieval, and Rerankingβ18,780Jun 5, 2026Updated last week
- CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an imageβ33,747Mar 25, 2026Updated 2 months ago
- This repository contains demos I made with the Transformers library by HuggingFace.β11,635Apr 20, 2026Updated last month
- Hackable and optimized Transformers building blocks, supporting a composable construction.β10,490May 21, 2026Updated 3 weeks ago
- Fully open reproduction of DeepSeek-R1β26,034Apr 2, 2026Updated 2 months ago
- QLoRA: Efficient Finetuning of Quantized LLMsβ10,925Jun 10, 2024Updated 2 years ago
- Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We alsβ¦β18,343May 19, 2026Updated 3 weeks ago
- LAVIS - A One-stop Library for Language-Vision Intelligenceβ11,236Jun 2, 2026Updated last week
- Managed Database hosting by DigitalOcean β’ AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMathβ9,483Jun 7, 2025Updated last year
- A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Autoβ¦β17,335Updated this week
- The agent engineering platform.β138,777Updated this week
- Unsloth Studio is a web UI for training and running open models like Gemma 4, Qwen3.6, DeepSeek, gpt-oss locally.β66,153Updated this week
- Build and share delightful machine learning apps, all in Python. π Star to support our work!β42,815Jun 5, 2026Updated last week
- Efficient few-shot learning with Sentence Transformersβ2,743May 26, 2026Updated 2 weeks ago
- Retrieval and Retrieval-augmented LLMsβ11,802Apr 22, 2026Updated last month