pln-fing-udelar / jojajovaiLinks
Jojajovai Guarani-Spanish Parallel Corpus
☆15Updated 3 years ago
Alternatives and similar repositories for jojajovai
Users that are interested in jojajovai are comparing it to the libraries listed below
Sorting:
- ☆44Updated 3 years ago
- ☆64Updated 2 years ago
- A minimal, pure Python library to interface with CoNLL-U format files.☆151Updated 2 years ago
- spaCy + UDPipe☆162Updated 3 years ago
- A french sequence to sequence pretrained model☆62Updated 2 years ago
- ☆49Updated last year
- Easier Automatic Sentence Simplification Evaluation☆161Updated last year
- NTREX -- News Test References for MT Evaluation☆85Updated last year
- Language Models for Zalando's flair library☆61Updated 5 years ago
- DBMDZ BERT, DistilBERT, ELECTRA, GPT-2 and ConvBERT models☆156Updated 2 years ago
- Bicleaner is a parallel corpus classifier/cleaner that aims at detecting noisy sentence pairs in a parallel corpus.☆158Updated last year
- Pipeline component for spaCy (and other spaCy-wrapped parsers such as spacy-stanza and spacy-udpipe) that adds CoNLL-U properties to a Do…☆81Updated last year
- BERT and ELECTRA models trained on Europeana Newspapers☆38Updated 3 years ago
- Full named-entity (i.e., not tag/token) evaluation metrics based on SemEval’13☆183Updated 3 weeks ago
- Obtain Word Alignments using Pretrained Language Models (e.g., mBERT)☆373Updated last year
- This repository contains the HiNER dataset released with our paper at LREC 2022☆15Updated 2 years ago
- Appraise code used as part of WMT21 human evaluation campaign☆27Updated this week
- Transformer based translation quality estimation☆113Updated 2 years ago
- A High-level Library for Named Entity Recognition in Python.☆24Updated last year
- ☆139Updated last year
- Efficient Low-Memory Aligner☆146Updated 6 months ago
- The Benchmark of Linguistic Minimal Pairs☆151Updated 2 years ago
- Tools for compiling corpora from Common Crawl☆14Updated 8 months ago
- The FLORES+ Machine Translation Benchmark☆106Updated 9 months ago
- MT Evaluation in Many Languages via Zero-Shot Paraphrasing☆101Updated last year
- A CoNLL-U parser that takes a CoNLL-U formatted string and turns it into a nested python dictionary.☆317Updated last week
- STREUSLE: a corpus with comprehensive lexical semantic annotation (multiword expressions, supersenses)☆66Updated last week
- A neural word aligner based on multilingual BERT☆354Updated 3 years ago
- Dataset of ML and NLP papers☆34Updated 2 years ago
- Ten Thousand German News Articles Dataset for Topic Classification☆84Updated 2 years ago