facebookresearch / fairseqLinks

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

☆31,889

Alternatives and similar repositories for fairseq

Users that are interested in fairseq are comparing it to the libraries listed below

Sorting:

huggingface / transformers
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal model…
☆151,652Updated last week
OpenNMT / OpenNMT-py
Open Source Neural Machine Translation and (Large) Language Models in PyTorch
☆6,958Updated 2 weeks ago
microsoft / unilm
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
☆21,798Updated 4 months ago
google / sentencepiece
Unsupervised text tokenizer for Neural Network-based text generation.
☆11,394Updated 3 weeks ago
google-research / text-to-text-transfer-transformer
Code for the paper "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"
☆6,446Updated 6 months ago
huggingface / accelerate
🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (i…
☆9,256Updated last week
allenai / allennlp
An open-source NLP research library, built on PyTorch.
☆11,882Updated 2 years ago
NVIDIA-NeMo / NeMo
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Auto…
☆16,017Updated this week
google-research / bert
TensorFlow code and pre-trained models for BERT
☆39,619Updated last year
codertimo / BERT-pytorch
Google AI 2018 BERT pytorch implementation
☆6,492Updated 2 years ago
huggingface / tokenizers
💥 Fast State-of-the-Art Tokenizers optimized for Research and Production
☆10,196Updated 2 weeks ago
huggingface / peft
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
☆19,959Updated this week
facebookresearch / faiss
A library for efficient similarity search and clustering of dense vectors.
☆37,735Updated this week
jessevig / bertviz
BertViz: Visualize Attention in NLP Models (BERT, GPT2, BART, etc.)
☆7,719Updated 5 months ago
jadore801120 / attention-is-all-you-need-pytorch
A PyTorch implementation of the Transformer model in "Attention is All You Need".
☆9,475Updated last year
NVIDIA / Megatron-LM
Ongoing research training transformer models at scale
☆13,976Updated this week
deepspeedai / DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
☆40,538Updated this week
harvardnlp / annotated-transformer
An annotated implementation of the Transformer paper.
☆6,648Updated last year
zihangdai / xlnet
XLNet: Generalized Autoregressive Pretraining for Language Understanding
☆6,180Updated 2 years ago
Dao-AILab / flash-attention
Fast and memory-efficient exact attention
☆20,280Updated this week
huggingface / sentence-transformers
State-of-the-Art Text Embeddings
☆17,774Updated last week
Lightning-AI / pytorch-lightning
Pretrain, finetune ANY AI model of ANY size on 1 or 10,000+ GPUs with zero code changes.
☆30,349Updated this week
facebookresearch / ParlAI
A framework for training and evaluating AI models on a variety of openly available dialogue datasets.
☆10,623Updated 2 years ago
stanfordnlp / stanza
Stanford NLP Python library for tokenization, sentence segmentation, NER, and parsing of many human languages
☆7,642Updated this week
google-research / google-research
Google Research
☆36,605Updated last week
tensorflow / tensor2tensor
Library of deep learning models and datasets designed to make deep learning more accessible and accelerate ML research.
☆16,650Updated 2 years ago
pytorch / examples
A set of examples around pytorch in Vision, Text, Reinforcement Learning, etc.
☆23,521Updated 2 months ago
huggingface / datasets
🤗 The largest hub of ready-to-use datasets for AI models with fast, easy-to-use and efficient data manipulation tools
☆20,801Updated this week
microsoft / nni
An open source AutoML toolkit for automate machine learning lifecycle, including feature engineering, neural architecture search, model c…
☆14,287Updated last year
microsoft / LoRA
Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"
☆12,843Updated 10 months ago