mjpost / sacrebleu
Reference BLEU implementation that auto-downloads test sets and reports a version string to facilitate cross-lab comparisons
☆1,038Updated last month
Related projects: ⓘ
- Unsupervised Word Segmentation for Neural Machine Translation and Text Generation☆2,181Updated last month
- BERT score for text generation☆1,564Updated last month
- Simple, fast unsupervised word aligner☆732Updated 2 years ago
- Fast BPE☆651Updated 3 months ago
- A full Python Implementation of the ROUGE Metric (not a wrapper)☆665Updated last year
- BLEURT is a metric for Natural Language Generation based on transfer learning.☆685Updated last year
- Pre-trained subword embeddings in 275 languages, based on Byte-Pair Encoding (BPE)☆1,176Updated 6 months ago
- XTREME is a benchmark for the evaluation of the cross-lingual generalization ability of pre-trained multilingual models that covers 40 ty…☆628Updated last year
- A Neural Framework for MT Evaluation☆485Updated last month
- Python port of Moses tokenizer, truecaser and normalizer☆486Updated 3 months ago
- MASS: Masked Sequence to Sequence Pre-training for Language Generation☆1,115Updated last year
- PyTorch original implementation of Cross-lingual Language Model Pretraining.☆2,872Updated last year
- A tool for holistic analysis of language generations systems☆465Updated 2 years ago
- Minimalist NMT for educational purposes☆672Updated 7 months ago
- High-accuracy NLP parser with models for 11 languages.☆858Updated 2 years ago
- ☆358Updated last year
- Tools to download and cleanup Common Crawl data☆961Updated last year
- A python tool for evaluating the quality of sentence embeddings.☆2,081Updated 6 months ago
- Code for using and evaluating SpanBERT.☆884Updated last year
- A framework to learn cross-lingual word embedding mappings☆642Updated last year
- Longformer: The Long-Document Transformer☆2,028Updated last year
- Dense Passage Retriever - is a set of tools and models for open domain Q&A task.☆1,698Updated last year
- Facebook Low Resource (FLoRes) MT Benchmark☆685Updated 9 months ago
- Evaluation code for various unsupervised automated metrics for Natural Language Generation.☆1,338Updated 3 weeks ago
- Moses, the machine translation system☆1,575Updated 3 months ago
- jiant is an nlp toolkit☆1,637Updated last year
- ERRor ANnotation Toolkit: Automatically extract and classify grammatical errors in parallel original and corrected sentences.☆432Updated 5 months ago
- Open-Source Neural Machine Translation in Tensorflow☆797Updated last year
- A Python framework for sequence labeling evaluation(named-entity recognition, pos tagging, etc...)☆1,086Updated 3 weeks ago
- This dataset contains 108,463 human-labeled and 656k noisily labeled pairs that feature the importance of modeling structure, context, an…☆545Updated 2 years ago