MeLeLBGU / tokenizers_intrinsic_benchmarkLinks
Code for the paper "Greed is All You Need: An Evaluation of Tokenizer Inference Methods"
☆10Updated 11 months ago
Alternatives and similar repositories for tokenizers_intrinsic_benchmark
Users that are interested in tokenizers_intrinsic_benchmark are comparing it to the libraries listed below
Sorting:
- ☆230Updated 4 years ago
- A simple library for querying the URIEL typological database.☆91Updated last year
- Automated Semantic Analysis of Discourse Markers☆10Updated 3 years ago
- Easier Automatic Sentence Simplification Evaluation☆162Updated 2 years ago
- A neural word aligner based on multilingual BERT☆358Updated 3 years ago
- Repository for DISRPT2023 shared task☆17Updated last year
- Code and data for "A fine-grained comparison of pragmatic language understanding in humans and language models"☆11Updated 2 years ago
- The Benchmark of Linguistic Minimal Pairs☆155Updated 2 years ago
- An initiative to collect and distribute resources for co-reference resolution in a unified standard.☆25Updated last year
- This repository provides details and links to the ACL anthology corpus/collection including .bib, .pdf and grobid extractions of the pdfs☆184Updated 2 years ago
- Obtain Word Alignments using Pretrained Language Models (e.g., mBERT)☆379Updated last year
- Diagnostic tests for linguistic capacities in language models☆65Updated 3 years ago
- ☆55Updated 3 years ago
- Utility for behavioral and representational analyses of Language Models☆165Updated last month
- MT Evaluation in Many Languages via Zero-Shot Paraphrasing☆102Updated last year
- Appraise code used as part of WMT21 human evaluation campaign☆29Updated 3 weeks ago
- Natural Language Processing Research in North American Linguistics Departments☆20Updated 7 months ago
- Neural CRF Model for Sentence Alignment in Text Simplification☆68Updated 9 months ago
- Efficient Low-Memory Aligner☆146Updated 9 months ago
- A tool for holistic analysis of language generations systems☆471Updated last month
- Find informative examples to efficiently (human)-evaluate NLG models.☆16Updated 3 weeks ago
- This is a repository with the data and code for the ACL 2019 paper "When a Good Translation is Wrong in Context: ..." and the EMNLP 2019 …☆98Updated 5 years ago
- a tool for calcualting character n-gram F score☆74Updated 2 years ago
- ☆33Updated 2 months ago
- The official tool for creating proceedings for conferences of the Association for Computational Linguistics (ACL).☆239Updated 6 months ago
- PENMAN notation (e.g. AMR) in Python☆146Updated last year
- This repository houses the IMPlicature and PRESupposition diagnostic dataset (IMPPRES), consisting of >25k semiautomatically generated se…☆19Updated 4 years ago
- Lexical Substitution Framework☆46Updated 2 years ago
- Various utility scripts useful for natural language processing, machine translation, etc.☆49Updated 3 years ago
- A tool that locates, downloads, and extracts machine translation corpora☆159Updated last month