huggingface / transformersLinks
π€ Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
β148,105Updated this week
Alternatives and similar repositories for transformers
Users that are interested in transformers are comparing it to the libraries listed below
Sorting:
- Pretrain, finetune ANY AI model of ANY size on multiple GPUs, TPUs with zero code changes.β29,925Updated this week
- Facebook AI Research Sequence-to-Sequence Toolkit written in Python.β31,699Updated 2 months ago
- Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalitiesβ21,597Updated last month
- DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.β39,689Updated this week
- State-of-the-Art Text Embeddingsβ17,299Updated this week
- Tensors and Dynamic neural networks in Python with strong GPU accelerationβ92,272Updated this week
- π€ PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.β19,252Updated this week
- TensorFlow code and pre-trained models for BERTβ39,396Updated last year
- A library for efficient similarity search and clustering of dense vectors.β36,512Updated this week
- π₯ Fast State-of-the-Art Tokenizers optimized for Research and Productionβ9,969Updated last week
- Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and moreβ33,033Updated this week
- π A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (iβ¦β9,010Updated this week
- State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enterβ¦β14,432Updated 11 months ago
- An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.β38,944Updated 2 months ago
- Fast and memory-efficient exact attentionβ18,776Updated this week
- π€ Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.β30,176Updated this week
- A high-throughput and memory-efficient inference and serving engine for LLMsβ54,323Updated this week
- Making large AI models cheaper, faster and more accessibleβ41,067Updated this week
- BertViz: Visualize Attention in NLP Models (BERT, GPT2, BART, etc.)β7,586Updated 2 months ago
- Label Studio is a multi-type data labeling and annotation tool with standardized output formatβ24,070Updated this week
- A hyperparameter optimization frameworkβ12,449Updated this week
- The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoiβ¦β51,462Updated 10 months ago
- Visualizer for neural network, deep learning and machine learning modelsβ31,121Updated this week
- Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.β32,518Updated last month
- Ongoing research training transformer models at scaleβ13,130Updated this week
- Graph Neural Network Library for PyTorchβ22,715Updated last week
- π€ The largest hub of ready-to-use datasets for AI models with fast, easy-to-use and efficient data manipulation toolsβ20,473Updated last week
- LlamaIndex is the leading framework for building LLM-powered agents over your data.β43,574Updated this week
- Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.β38,394Updated this week
- π¦π Build context-aware reasoning applicationsβ113,278Updated this week