π€ Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
β161,694Jun 18, 2026Updated last week
Alternatives and similar repositories for transformers
Users that are interested in transformers are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- TensorFlow code and pre-trained models for BERTβ40,034Jul 23, 2024Updated last year
- Tensors and Dynamic neural networks in Python with strong GPU accelerationβ100,915Updated this week
- Facebook AI Research Sequence-to-Sequence Toolkit written in Python.β32,229Sep 30, 2025Updated 8 months ago
- The agent engineering platform.β139,780Updated this week
- DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.β42,544Updated this week
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Inference code for Llama modelsβ59,461Jan 26, 2025Updated last year
- A high-throughput and memory-efficient inference and serving engine for LLMsβ83,677Updated this week
- Pretrain, finetune ANY AI model of ANY size on 1 or 10,000+ GPUs with zero code changes.β31,198Jun 10, 2026Updated 2 weeks ago
- π€ The largest hub of ready-to-use datasets for AI models with fast, easy-to-use and efficient data manipulation toolsβ21,648Jun 18, 2026Updated last week
- A library for efficient similarity search and clustering of dense vectors.β40,378Updated this week
- State-of-the-Art Embeddings, Retrieval, and Rerankingβ18,833Jun 18, 2026Updated last week
- π€ PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.β21,299Updated this week
- An Open Source Machine Learning Framework for Everyoneβ195,844Updated this week
- Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the moβ¦β22,955Jul 28, 2024Updated last year
- GPUs on demand by Runpod - Special Offer Available β’ AdRun AI, ML, and HPC workloads on powerful cloud GPUsβwithout limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Build and share delightful machine learning apps, all in Python. π Star to support our work!β42,956Jun 18, 2026Updated last week
- Google Researchβ38,163Updated this week
- The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights --β¦β36,900Updated this week
- π€ Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.β33,914Updated this week
- AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus oβ¦β185,132Updated this week
- LLM inference in C/C++β117,608Updated this week
- Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalitiesβ22,151Jan 23, 2026Updated 5 months ago
- An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.β39,486May 1, 2026Updated last month
- Making large AI models cheaper, faster and more accessibleβ41,404May 25, 2026Updated 3 weeks ago
- Wordpress hosting with auto-scaling - Free Trial Offer β’ AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)β72,289Jun 17, 2026Updated last week
- Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and moreβ35,879Updated this week
- An open-source NLP research library, built on PyTorch.β11,889Nov 22, 2022Updated 3 years ago
- CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an imageβ33,845Mar 25, 2026Updated 2 months ago
- LlamaIndex is the leading document agent and OCR platformβ50,340Updated this week
- Fast and memory-efficient exact attentionβ24,221Updated this week
- π« Industrial-strength Natural Language Processing (NLP) in Pythonβ33,669May 19, 2026Updated last month
- Deep Learning for humansβ64,100Updated this week
- Code and documentation to train Stanford's Alpaca models, and generate the data.β30,246Jul 17, 2024Updated last year
- Virtual machines for every use case on DigitalOcean β’ AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.β42,931Updated this week
- Get up and running with Kimi-K2.6, GLM-5.1, MiniMax, DeepSeek, gpt-oss, Qwen, Gemma and other models.β174,493Updated this week
- Models and examples built with TensorFlowβ77,666Updated this week
- Code for the paper "Language Models are Unsupervised Multitask Learners"β24,951Aug 14, 2024Updated last year
- π Scalable embedding, reasoning, ranking for images and sentences with CLIPβ12,829Jan 23, 2024Updated 2 years ago
- A latent text-to-image diffusion modelβ73,137Jun 18, 2024Updated 2 years ago
- Train transformer language models with reinforcement learning.β18,701Updated this week