facebookresearch/XLM

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/facebookresearch/XLM)

facebookresearch / XLM

PyTorch original implementation of Cross-lingual Language Model Pretraining.

☆2,927

Alternatives and similar repositories for XLM

Users that are interested in XLM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

facebookresearch / UnsupervisedMT
View on GitHub
Phrase-Based & Neural Unsupervised Machine Translation
☆1,499Sep 15, 2021Updated 4 years ago
zihangdai / xlnet
View on GitHub
XLNet: Generalized Autoregressive Pretraining for Language Understanding
☆6,178May 28, 2023Updated 3 years ago
facebookresearch / MUSE
View on GitHub
A library for Multilingual Unsupervised or Supervised word Embeddings
☆3,248Aug 31, 2022Updated 3 years ago
microsoft / MASS
View on GitHub
MASS: Masked Sequence to Sequence Pre-training for Language Generation
☆1,117Nov 28, 2022Updated 3 years ago
facebookresearch / LASER
View on GitHub
Language-Agnostic SEntence Representations
☆3,660May 2, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
facebookresearch / fairseq
View on GitHub
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
☆32,240Sep 30, 2025Updated 9 months ago
glample / fastBPE
View on GitHub
Fast BPE
☆677Jun 18, 2024Updated 2 years ago
rsennrich / subword-nmt
View on GitHub
Unsupervised Word Segmentation for Neural Machine Translation and Text Generation
☆2,271Aug 7, 2024Updated last year
artetxem / vecmap
View on GitHub
A framework to learn cross-lingual word embedding mappings
☆655Apr 22, 2023Updated 3 years ago
allenai / allennlp
View on GitHub
An open-source NLP research library, built on PyTorch.
☆11,889Nov 22, 2022Updated 3 years ago
OpenNMT / OpenNMT-py
View on GitHub
Open Source Neural Machine Translation and (Large) Language Models in PyTorch
☆7,007Oct 14, 2025Updated 9 months ago
harvardnlp / pytorch-struct
View on GitHub
Fast, general, and tested differentiable structured prediction in PyTorch
☆1,132Apr 20, 2022Updated 4 years ago
facebookresearch / pytext
View on GitHub
A natural language modeling framework based on PyTorch
☆6,296Oct 17, 2022Updated 3 years ago
google-research / text-to-text-transfer-transformer
View on GitHub
Code for the paper "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"
☆6,537Jul 8, 2026Updated last week
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
google / sentencepiece
View on GitHub
Unsupervised text tokenizer for Neural Network-based text generation.
☆11,962Updated this week
google-research / xtreme
View on GitHub
XTREME is a benchmark for the evaluation of the cross-lingual generalization ability of pre-trained multilingual models that covers 40 ty…
☆651Jan 4, 2023Updated 3 years ago
namisan / mt-dnn
View on GitHub
Multi-Task Deep Neural Networks for Natural Language Understanding
☆2,260Mar 7, 2024Updated 2 years ago
facebookresearch / XNLI
View on GitHub
Evaluating Cross-lingual Sentence Representations
☆463Aug 30, 2021Updated 4 years ago
sebastianruder / NLP-progress
View on GitHub
Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the mo…
☆22,958Jul 28, 2024Updated last year
google-research / electra
View on GitHub
ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generators
☆2,367Mar 23, 2024Updated 2 years ago
facebookresearch / SentEval
View on GitHub
A python tool for evaluating the quality of sentence embeddings.
☆2,110Mar 19, 2024Updated 2 years ago
google-research / bert
View on GitHub
TensorFlow code and pre-trained models for BERT
☆40,054Jul 23, 2024Updated last year
salesforce / decaNLP
View on GitHub
The Natural Language Decathlon: A Multitask Challenge for NLP
☆2,338May 1, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
THUNLP-MT / MT-Reading-List
View on GitHub
A machine translation reading list maintained by Tsinghua Natural Language Processing Group
☆2,439Aug 9, 2024Updated last year
facebookresearch / LAMA
View on GitHub
LAnguage Model Analysis
☆1,391Jul 7, 2024Updated 2 years ago
facebookresearch / mmf
View on GitHub
A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)
☆5,636Jul 7, 2026Updated last week
nyu-mll / jiant
View on GitHub
jiant is an nlp toolkit
☆1,675Jul 6, 2023Updated 3 years ago
huggingface / pytorch-openai-transformer-lm
View on GitHub
🐥A PyTorch implementation of OpenAI's finetuned transformer language model with a script to import the weights pre-trained by OpenAI
☆1,521Aug 9, 2021Updated 4 years ago
neulab / compare-mt
View on GitHub
A tool for holistic analysis of language generations systems
☆471Sep 22, 2025Updated 9 months ago
facebookresearch / InferSent
View on GitHub
InferSent sentence embeddings
☆2,280Aug 30, 2021Updated 4 years ago
clab / fast_align
View on GitHub
Simple, fast unsupervised word aligner
☆769Jul 19, 2022Updated 3 years ago
pytorch / text
View on GitHub
Models, data loaders and abstractions for language processing, powered by PyTorch
☆3,558Sep 10, 2025Updated 10 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
huggingface / naacl_transfer_learning_tutorial
View on GitHub
Repository of code for the tutorial on Transfer Learning in NLP held at NAACL 2019 in Minneapolis, MN, USA
☆723Oct 16, 2019Updated 6 years ago
facebookresearch / adaptive-span
View on GitHub
Transformer training code for sequential tasks
☆610Sep 14, 2021Updated 4 years ago
flairNLP / flair
View on GitHub
A very simple framework for state-of-the-art Natural Language Processing (NLP)
☆14,380Oct 27, 2025Updated 8 months ago
artetxem / undreamt
View on GitHub
Unsupervised Neural Machine Translation
☆474Jul 8, 2020Updated 6 years ago
openai / finetune-transformer-lm
View on GitHub
Code and model for the paper "Improving Language Understanding by Generative Pre-Training"
☆2,306Jan 25, 2019Updated 7 years ago
kimiyoung / transformer-xl
View on GitHub
☆3,704Sep 21, 2022Updated 3 years ago
PetrochukM / PyTorch-NLP
View on GitHub
Basic Utilities for PyTorch Natural Language Processing (NLP)
☆2,224Jul 4, 2023Updated 3 years ago