google-research / url-nlp
☆208Updated last month
Alternatives and similar repositories for url-nlp:
Users that are interested in url-nlp are comparing it to the libraries listed below
- Glot500: Scaling Multilingual Corpora and Language Models to 500 Languages -- ACL 2023☆100Updated 11 months ago
- Tools for evaluating the performance of MT metrics on data from recent WMT metrics shared tasks.☆105Updated last month
- BLOOM+1: Adapting BLOOM model to support a new unseen language☆71Updated last year
- A simple library for querying the URIEL typological database.☆89Updated last year
- a tool for calcualting character n-gram F score☆72Updated 2 years ago
- The FLORES+ Machine Translation Benchmark☆101Updated 5 months ago
- NTREX -- News Test References for MT Evaluation☆81Updated 10 months ago
- The Benchmark of Linguistic Minimal Pairs☆150Updated 2 years ago
- ☆84Updated 6 months ago
- Official implementations for (1) BlonDe: An Automatic Evaluation Metric for Document-level Machine Translation and (2) Discourse Centric …☆76Updated last year
- ☆34Updated 9 months ago
- A tool that locates, downloads, and extracts machine translation corpora☆154Updated 2 weeks ago
- PyTorch + HuggingFace code for RetoMaton: "Neuro-Symbolic Language Modeling with Automaton-augmented Retrieval" (ICML 2022), including an…☆271Updated 2 years ago
- ☆97Updated 2 years ago
- Code for Multilingual Eval of Generative AI paper published at EMNLP 2023☆68Updated last year
- A library for preparing data for machine translation research (monolingual preprocessing, bitext mining, etc.) built by the FAIR NLLB te…☆272Updated 2 months ago
- XCOPA: A Multilingual Dataset for Causal Commonsense Reasoning☆102Updated 4 years ago
- A library for parameter-efficient and composable transfer learning for NLP with sparse fine-tunings.☆71Updated 8 months ago
- Code and data accompanying our ACL 2020 paper, "Unsupervised Domain Clusters in Pretrained Language Models".☆58Updated 4 years ago
- Multilingual Large Language Models Evaluation Benchmark☆121Updated 7 months ago
- ☆97Updated 2 years ago
- code associated with ACL 2021 DExperts paper☆114Updated last year
- A Dataset for Tuning and Evaluation of Sentence Simplification Models with Multiple Rewriting Transformations☆55Updated 2 years ago
- A Multilingual Replicable Instruction-Following Model☆93Updated last year
- Obtain Word Alignments using Pretrained Language Models (e.g., mBERT)☆361Updated last year
- A curated list of research papers and resources on Cultural LLM.☆41Updated 6 months ago
- This repository provides details and links to the ACL anthology corpus/collection including .bib, .pdf and grobid extractions of the pdfs☆178Updated last year
- GEMBA — GPT Estimation Metric Based Assessment☆115Updated 8 months ago
- Build a dialog dataset from online books in many languages☆72Updated 2 years ago
- Code for WECHSEL: Effective initialization of subword embeddings for cross-lingual transfer of monolingual language models.☆80Updated 7 months ago