XTREME is a benchmark for the evaluation of the cross-lingual generalization ability of pre-trained multilingual models that covers 40 typologically diverse languages and includes nine tasks.
☆651Jan 4, 2023Updated 3 years ago
Alternatives and similar repositories for xtreme
Users that are interested in xtreme are comparing it to the libraries listed below
Sorting:
- PyTorch original implementation of Cross-lingual Language Model Pretraining.☆2,924Feb 14, 2023Updated 3 years ago
- New dataset☆311Aug 31, 2021Updated 4 years ago
- BLEURT is a metric for Natural Language Generation based on transfer learning.☆786Aug 4, 2023Updated 2 years ago
- Evaluating Cross-lingual Sentence Representations☆464Aug 30, 2021Updated 4 years ago
- ☆207Nov 12, 2021Updated 4 years ago
- Language-Agnostic SEntence Representations☆3,659May 2, 2024Updated last year
- ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generators☆2,371Mar 23, 2024Updated last year
- Code for using and evaluating SpanBERT.☆904Jul 25, 2023Updated 2 years ago
- LAnguage Model Analysis☆1,390Jul 7, 2024Updated last year
- Neural Text Generation with Unlikelihood Training☆310Aug 31, 2021Updated 4 years ago
- This repo supports various cross-lingual transfer learning & multilingual NLP models.☆92Sep 13, 2023Updated 2 years ago
- Longformer: The Long-Document Transformer☆2,188Feb 8, 2023Updated 3 years ago
- TyDi QA contains 200k human-annotated question-answer pairs in 11 Typologically Diverse languages, written without seeing the answer and …☆317May 28, 2020Updated 5 years ago
- Adversarial Natural Language Inference Benchmark☆398May 12, 2022Updated 3 years ago
- Code for the paper "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"☆6,490Jan 14, 2026Updated last month
- This is the official repository for NAACL 2021, "XOR QA: Cross-lingual Open-Retrieval Question Answering".☆80Jun 3, 2021Updated 4 years ago
- ACL2020 Tutorial: Open-Domain Question Answering☆835Jan 1, 2021Updated 5 years ago
- SentAugment is a data augmentation technique for NLP that retrieves similar sentences from a large bank of sentences. It can be used in c…☆359Feb 22, 2022Updated 4 years ago
- Library for Knowledge Intensive Language Tasks☆967Mar 31, 2022Updated 3 years ago
- MASS: Masked Sequence to Sequence Pre-training for Language Generation☆1,123Nov 28, 2022Updated 3 years ago
- Meta Representation Transformation for Low-resource Cross-lingual Learning☆41May 5, 2021Updated 4 years ago
- Cross-lingual GLUE☆49Jun 15, 2023Updated 2 years ago
- AAAI-20 paper: Cross-Lingual Natural Language Generation via Pre-Training☆129Aug 4, 2021Updated 4 years ago
- Tools to download and cleanup Common Crawl data☆1,039Apr 25, 2023Updated 2 years ago
- Unicoder model for understanding and generation.☆92Dec 12, 2023Updated 2 years ago
- Shared repository for open-sourced projects from the Google AI Language team.☆1,752Feb 20, 2026Updated last week
- jiant is an nlp toolkit☆1,674Jul 6, 2023Updated 2 years ago
- Beyond Accuracy: Behavioral Testing of NLP models with CheckList☆2,050Jan 9, 2024Updated 2 years ago
- ☆1,297Dec 15, 2022Updated 3 years ago
- EMNLP 2021 Tutorial: Multi-Domain Multilingual Question Answering☆38Nov 7, 2021Updated 4 years ago
- Facebook Low Resource (FLoRes) MT Benchmark☆766Nov 20, 2023Updated 2 years ago
- Progressively Pretrained Dense Corpus Index for Open-Domain QA and Information Retrieval☆43Jun 12, 2023Updated 2 years ago
- Resources for the MRQA 2019 Shared Task☆294Aug 5, 2021Updated 4 years ago
- XLNet: Generalized Autoregressive Pretraining for Language Understanding☆6,176May 28, 2023Updated 2 years ago
- A library for Multilingual Unsupervised or Supervised word Embeddings☆3,238Aug 31, 2022Updated 3 years ago
- Code and Data for ACL 2020 paper "Few-Shot NLG with Pre-Trained Language Model"☆190May 23, 2025Updated 9 months ago
- BERT score for text generation☆1,876Jul 30, 2024Updated last year
- Must-read Papers on pre-trained language models.☆3,362Nov 6, 2022Updated 3 years ago
- This dataset contains 108,463 human-labeled and 656k noisily labeled pairs that feature the importance of modeling structure, context, an…☆561Jan 4, 2022Updated 4 years ago