pln-fing-udelar / jojajovaiLinks
Jojajovai Guarani-Spanish Parallel Corpus
☆15Updated 3 years ago
Alternatives and similar repositories for jojajovai
Users that are interested in jojajovai are comparing it to the libraries listed below
Sorting:
- ☆44Updated 3 years ago
- spaCy + UDPipe☆161Updated 3 years ago
- A minimal, pure Python library to interface with CoNLL-U format files.☆151Updated 2 years ago
- A french sequence to sequence pretrained model☆62Updated 2 years ago
- ☆64Updated 2 years ago
- Easier Automatic Sentence Simplification Evaluation☆162Updated last year
- Alignment and annotation for comparable documents.☆22Updated 6 years ago
- NTREX -- News Test References for MT Evaluation☆84Updated last year
- TUFS Asian Language Parallel Corpus☆50Updated 2 years ago
- DBMDZ BERT, DistilBERT, ELECTRA, GPT-2 and ConvBERT models☆156Updated 2 years ago
- A multilingual lexicon of words to hurt.☆89Updated this week
- Pipeline component for spaCy (and other spaCy-wrapped parsers such as spacy-stanza and spacy-udpipe) that adds CoNLL-U properties to a Do…☆80Updated last year
- Python-based implementation of the Translate-Align-Retrieve method to automatically translate the SQuAD Dataset to Spanish.☆59Updated 2 years ago
- OpusFilter - Parallel corpus processing toolkit☆106Updated 2 weeks ago
- Transformer based translation quality estimation☆112Updated 2 years ago
- Bicleaner is a parallel corpus classifier/cleaner that aims at detecting noisy sentence pairs in a parallel corpus.☆158Updated last year
- ☆35Updated 3 years ago
- Live survey of off-the-shelf language identification tools for python☆26Updated 3 years ago
- ☆49Updated 11 months ago
- Efficient Low-Memory Aligner☆146Updated 6 months ago
- ☆109Updated last year
- Repository for XLM-T, a framework for evaluating multilingual language models on Twitter data☆157Updated 2 years ago
- A CoNLL-U parser that takes a CoNLL-U formatted string and turns it into a nested python dictionary.☆316Updated last month
- MT Evaluation in Many Languages via Zero-Shot Paraphrasing☆101Updated 11 months ago
- An initiative to collect and distribute resources for co-reference resolution in a unified standard.☆25Updated last year
- Language Models for Zalando's flair library☆61Updated 5 years ago
- Bilingual term extractor☆54Updated last year
- Dataset of ML and NLP papers☆34Updated 2 years ago
- 🕸 GlotWeb: Web Indexing for Low-Resource Languages -- under construction.☆13Updated 3 months ago
- A tool that locates, downloads, and extracts machine translation corpora☆155Updated last month