google-research / url-nlp
☆190Updated 5 months ago
Related projects ⓘ
Alternatives and complementary repositories for url-nlp
- Glot500: Scaling Multilingual Corpora and Language Models to 500 Languages -- ACL 2023☆96Updated 6 months ago
- Tools for evaluating the performance of MT metrics on data from recent WMT metrics shared tasks.☆90Updated last month
- A simple library for querying the URIEL typological database.☆88Updated 7 months ago
- A tool that locates, downloads, and extracts machine translation corpora☆147Updated 5 months ago
- BLOOM+1: Adapting BLOOM model to support a new unseen language☆70Updated 8 months ago
- GEMBA — GPT Estimation Metric Based Assessment☆100Updated 3 months ago
- OpusFilter - Parallel corpus processing toolkit☆102Updated 2 months ago
- NTREX -- News Test References for MT Evaluation☆75Updated 5 months ago
- ☆65Updated last year
- a tool for calcualting character n-gram F score☆66Updated last year
- ☆95Updated last year
- A neural word aligner based on multilingual BERT☆328Updated 2 years ago
- A library for preparing data for machine translation research (monolingual preprocessing, bitext mining, etc.) built by the FAIR NLLB te…☆250Updated last month
- The original implementation of Min et al. "Nonparametric Masked Language Modeling" (paper https//arxiv.org/abs/2212.01349)☆156Updated last year
- ☆24Updated 4 months ago
- The Benchmark of Linguistic Minimal Pairs☆141Updated last year
- Official implementations for (1) BlonDe: An Automatic Evaluation Metric for Document-level Machine Translation and (2) Discourse Centric …☆71Updated last year
- ☆78Updated last month
- PyTorch + HuggingFace code for RetoMaton: "Neuro-Symbolic Language Modeling with Automaton-augmented Retrieval" (ICML 2022), including an…☆269Updated 2 years ago
- A Multilingual Replicable Instruction-Following Model☆93Updated last year
- [EMNLP'23] Official Code for "FOCUS: Effective Embedding Initialization for Monolingual Specialization of Multilingual Models"☆28Updated 3 weeks ago
- Obtain Word Alignments using Pretrained Language Models (e.g., mBERT)☆350Updated last year
- SacreROUGE is a library dedicated to the use and development of text generation evaluation metrics with an emphasis on summarization.☆138Updated 2 years ago
- [EMNLP 2021] LM-Critic: Language Models for Unsupervised Grammatical Error Correction☆119Updated 3 years ago
- Efficient Low-Memory Aligner☆137Updated 2 months ago
- This repository provides details and links to the ACL anthology corpus/collection including .bib, .pdf and grobid extractions of the pdfs☆167Updated last year
- Pipeline for pulling and processing online language model pretraining data from the web☆174Updated last year
- A repository with the code related to experiments around context-aware machine translation☆48Updated 2 years ago
- Yet Another Neural Machine Translation Toolkit☆174Updated 4 months ago
- A library for parameter-efficient and composable transfer learning for NLP with sparse fine-tunings.☆70Updated 3 months ago