google-research/xtreme

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/google-research/xtreme)

google-research / xtreme

XTREME is a benchmark for the evaluation of the cross-lingual generalization ability of pre-trained multilingual models that covers 40 typologically diverse languages and includes nine tasks.

☆651

Alternatives and similar repositories for xtreme

Users that are interested in xtreme are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

facebookresearch / XLM
View on GitHub
PyTorch original implementation of Cross-lingual Language Model Pretraining.
☆2,925Feb 14, 2023Updated 3 years ago
facebookresearch / MLQA
View on GitHub
New dataset
☆311Aug 31, 2021Updated 4 years ago
facebookresearch / XNLI
View on GitHub
Evaluating Cross-lingual Sentence Representations
☆463Aug 30, 2021Updated 4 years ago
facebookresearch / LASER
View on GitHub
Language-Agnostic SEntence Representations
☆3,661May 2, 2024Updated 2 years ago
google-deepmind / xquad
View on GitHub
☆210Nov 12, 2021Updated 4 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
microsoft / XGLUE
View on GitHub
Cross-lingual GLUE
☆49Jun 15, 2023Updated 3 years ago
shijie-wu / crosslingual-nlp
View on GitHub
This repo supports various cross-lingual transfer learning & multilingual NLP models.
☆92Sep 13, 2023Updated 2 years ago
google-research / electra
View on GitHub
ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generators
☆2,367Mar 23, 2024Updated 2 years ago
google-research / bleurt
View on GitHub
BLEURT is a metric for Natural Language Generation based on transfer learning.
☆794Aug 4, 2023Updated 2 years ago
google-research-datasets / tydiqa
View on GitHub
TyDi QA contains 200k human-annotated question-answer pairs in 11 Typologically Diverse languages, written without seeing the answer and …
☆319May 28, 2020Updated 6 years ago
facebookresearch / SpanBERT
View on GitHub
Code for using and evaluating SpanBERT.
☆908Jul 25, 2023Updated 2 years ago
facebookresearch / LAMA
View on GitHub
LAnguage Model Analysis
☆1,391Jul 7, 2024Updated 2 years ago
microsoft / Unicoder
View on GitHub
Unicoder model for understanding and generation.
☆91Dec 12, 2023Updated 2 years ago
google-research / text-to-text-transfer-transformer
View on GitHub
Code for the paper "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"
☆6,536Jul 8, 2026Updated last week
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
microsoft / MetaXL
View on GitHub
Meta Representation Transformation for Low-resource Cross-lingual Learning
☆41May 5, 2021Updated 5 years ago
facebookresearch / SentAugment
View on GitHub
SentAugment is a data augmentation technique for NLP that retrieves similar sentences from a large bank of sentences. It can be used in c…
☆359Feb 22, 2022Updated 4 years ago
AkariAsai / XORQA
View on GitHub
This is the official repository for NAACL 2021, "XOR QA: Cross-lingual Open-Retrieval Question Answering".
☆80Jun 3, 2021Updated 5 years ago
facebookresearch / anli
View on GitHub
Adversarial Natural Language Inference Benchmark
☆402May 12, 2022Updated 4 years ago
allenai / longformer
View on GitHub
Longformer: The Long-Document Transformer
☆2,201Feb 8, 2023Updated 3 years ago
facebookresearch / unlikelihood_training
View on GitHub
Neural Text Generation with Unlikelihood Training
☆311Aug 31, 2021Updated 4 years ago
facebookresearch / cc_net
View on GitHub
Tools to download and cleanup Common Crawl data
☆1,046Apr 25, 2023Updated 3 years ago
danqi / acl2020-openqa-tutorial
View on GitHub
ACL2020 Tutorial: Open-Domain Question Answering
☆835Jan 1, 2021Updated 5 years ago
microsoft / MASS
View on GitHub
MASS: Masked Sequence to Sequence Pre-training for Language Generation
☆1,117Nov 28, 2022Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
CZWin32768 / XNLG
View on GitHub
AAAI-20 paper: Cross-Lingual Natural Language Generation via Pre-Training
☆128Aug 4, 2021Updated 4 years ago
google-research / language
View on GitHub
Shared repository for open-sourced projects from the Google AI Language team.
☆1,787Jun 10, 2026Updated last month
nyu-mll / jiant
View on GitHub
jiant is an nlp toolkit
☆1,675Jul 6, 2023Updated 3 years ago
facebookresearch / KILT
View on GitHub
Library for Knowledge Intensive Language Tasks
☆978Mar 31, 2022Updated 4 years ago
bozheng-hit / xTune
View on GitHub
☆50Jul 15, 2021Updated 5 years ago
google-research / multilingual-t5
View on GitHub
☆1,294Dec 15, 2022Updated 3 years ago
marcotcr / checklist
View on GitHub
Beyond Accuracy: Behavioral Testing of NLP models with CheckList
☆2,052Jan 9, 2024Updated 2 years ago
ahmetustun / udapter
View on GitHub
UDapter is a multilingual dependency parser that uses "contextual" adapters together with language-typology features for language-specifi…
☆31Dec 5, 2022Updated 3 years ago
cisnlp / simalign
View on GitHub
[EMNLP 2020] Obtain Word Alignments using Pretrained Language Models (e.g., mBERT)
☆398Nov 7, 2023Updated 2 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
xwhan / ProQA
View on GitHub
Progressively Pretrained Dense Corpus Index for Open-Domain QA and Information Retrieval
☆43Jun 12, 2023Updated 3 years ago
roomylee / nlp-papers-with-arxiv
View on GitHub
Statistics and accepted paper list of NLP conferences with arXiv link
☆423Jul 24, 2021Updated 4 years ago
zihangdai / xlnet
View on GitHub
XLNet: Generalized Autoregressive Pretraining for Language Understanding
☆6,180May 28, 2023Updated 3 years ago
linzehui / mRASP
View on GitHub
☆167Dec 24, 2021Updated 4 years ago
thunlp / PLMpapers
View on GitHub
Must-read Papers on pre-trained language models.
☆3,360Nov 6, 2022Updated 3 years ago
facebookresearch / MUSE
View on GitHub
A library for Multilingual Unsupervised or Supervised word Embeddings
☆3,248Aug 31, 2022Updated 3 years ago
facebookresearch / flores
View on GitHub
Facebook Low Resource (FLoRes) MT Benchmark
☆771Nov 20, 2023Updated 2 years ago