facebookresearch / XLM
PyTorch original implementation of Cross-lingual Language Model Pretraining.
☆2,872Updated last year
Related projects: ⓘ
- ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generators☆2,321Updated 5 months ago
- A python tool for evaluating the quality of sentence embeddings.☆2,081Updated 6 months ago
- Multi-Task Deep Neural Networks for Natural Language Understanding☆2,223Updated 6 months ago
- BERT-related papers☆2,030Updated last year
- MASS: Masked Sequence to Sequence Pre-training for Language Generation☆1,115Updated last year
- jiant is an nlp toolkit☆1,637Updated last year
- ☆3,600Updated last year
- Unsupervised Word Segmentation for Neural Machine Translation and Text Generation☆2,181Updated last month
- XLNet: Generalized Autoregressive Pretraining for Language Understanding☆6,175Updated last year
- Language-Agnostic SEntence Representations☆3,576Updated 4 months ago
- Super easy library for BERT based NLP models☆1,853Updated last month
- Phrase-Based & Neural Unsupervised Machine Translation☆1,507Updated 3 years ago
- Basic Utilities for PyTorch Natural Language Processing (NLP)☆2,209Updated last year
- Longformer: The Long-Document Transformer☆2,028Updated last year
- 🐥A PyTorch implementation of OpenAI's finetuned transformer language model with a script to import the weights pre-trained by OpenAI☆1,506Updated 3 years ago
- Toolkit for Machine Learning, Natural Language Processing, and Text Generation, in TensorFlow. This is part of the CASL project: http://…☆2,387Updated 3 years ago
- ALBERT: A Lite BERT for Self-supervised Learning of Language Representations☆3,233Updated last year
- Models, data loaders and abstractions for language processing, powered by PyTorch☆3,496Updated this week
- Pre-trained subword embeddings in 275 languages, based on Byte-Pair Encoding (BPE)☆1,176Updated 6 months ago
- InferSent sentence embeddings☆2,280Updated 3 years ago
- Code and model for the paper "Improving Language Understanding by Generative Pre-Training"☆2,139Updated 5 years ago
- Must-read Papers on pre-trained language models.☆3,321Updated last year
- A library for Multilingual Unsupervised or Supervised word Embeddings☆3,179Updated 2 years ago
- A curated list of pretrained sentence and word embedding models☆2,214Updated 3 years ago
- Tensorflow implementation of contextualized word representations from bi-directional language models☆1,620Updated last year
- NLP made easy☆2,554Updated 11 months ago
- Data augmentation for NLP☆4,406Updated 2 months ago
- General purpose unsupervised sentence representations☆1,193Updated 2 years ago
- Unsupervised Data Augmentation (UDA)☆2,172Updated 3 years ago
- Papers & presentation materials from Hugging Face's internal science day☆2,027Updated 3 years ago