facebookresearch / XLM
PyTorch original implementation of Cross-lingual Language Model Pretraining.
☆2,892Updated last year
Related projects ⓘ
Alternatives and complementary repositories for XLM
- ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generators☆2,339Updated 7 months ago
- MASS: Masked Sequence to Sequence Pre-training for Language Generation☆1,118Updated last year
- Language-Agnostic SEntence Representations☆3,600Updated 6 months ago
- Unsupervised Word Segmentation for Neural Machine Translation and Text Generation☆2,197Updated 3 months ago
- ☆3,612Updated 2 years ago
- Phrase-Based & Neural Unsupervised Machine Translation☆1,506Updated 3 years ago
- jiant is an nlp toolkit☆1,647Updated last year
- XLNet: Generalized Autoregressive Pretraining for Language Understanding☆6,182Updated last year
- A python tool for evaluating the quality of sentence embeddings.☆2,087Updated 8 months ago
- Multi-Task Deep Neural Networks for Natural Language Understanding☆2,239Updated 8 months ago
- A library for Multilingual Unsupervised or Supervised word Embeddings☆3,190Updated 2 years ago
- Super easy library for BERT based NLP models☆1,866Updated 3 months ago
- 🐥A PyTorch implementation of OpenAI's finetuned transformer language model with a script to import the weights pre-trained by OpenAI☆1,511Updated 3 years ago
- Tensorflow implementation of contextualized word representations from bi-directional language models☆1,620Updated last year
- Basic Utilities for PyTorch Natural Language Processing (NLP)☆2,213Updated last year
- Longformer: The Long-Document Transformer☆2,047Updated last year
- Toolkit for Machine Learning, Natural Language Processing, and Text Generation, in TensorFlow. This is part of the CASL project: http://…☆2,389Updated 3 years ago
- A curated list of pretrained sentence and word embedding models☆2,233Updated 3 years ago
- InferSent sentence embeddings☆2,280Updated 3 years ago
- Pre-trained subword embeddings in 275 languages, based on Byte-Pair Encoding (BPE)☆1,187Updated last month
- Source code and dataset for ACL 2019 paper "ERNIE: Enhanced Language Representation with Informative Entities"☆1,412Updated 10 months ago
- Plug and Play Language Model implementation. Allows to steer topic and attributes of GPT-2 models.☆1,132Updated 9 months ago
- Must-read Papers on pre-trained language models.☆3,330Updated 2 years ago
- BERT-related papers☆2,035Updated last year
- Models, data loaders and abstractions for language processing, powered by PyTorch☆3,517Updated this week
- 🌊HMTL: Hierarchical Multi-Task Learning - A State-of-the-Art neural network model for several NLP tasks based on PyTorch and AllenNLP☆1,191Updated last year
- Data augmentation for NLP☆4,454Updated 4 months ago
- Code for the paper "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"☆6,178Updated 2 months ago
- Single Headed Attention RNN - "Stop thinking with your head"☆1,178Updated 2 years ago
- Code and model for the paper "Improving Language Understanding by Generative Pre-Training"☆2,160Updated 5 years ago