anzeyimana / kinyabert-acl2022Links
☆19Updated last year
Alternatives and similar repositories for kinyabert-acl2022
Users that are interested in kinyabert-acl2022 are comparing it to the libraries listed below
Sorting:
- NTREX -- News Test References for MT Evaluation☆83Updated last year
- DeepKIN -- A deep learning toolkit for Kinyarwanda NLP.☆11Updated 3 weeks ago
- Agile reading group that works☆13Updated 3 years ago
- Reduce the size of pretrained Hugging Face models via vocabulary trimming.☆45Updated 2 years ago
- A repository for publicly/freely available Natural Language Processing (NLP) datasets for African languages.☆106Updated last year
- Official Implementation of "DialogLM: Pre-trained Model for Long Dialogue Understanding and Summarization."☆139Updated 2 years ago
- Yet Another Neural Machine Translation Toolkit☆179Updated 3 months ago
- This repo contains a set of neural transducer, e.g. sequence-to-sequence model, focusing on character-level tasks.☆76Updated last year
- A tool that locates, downloads, and extracts machine translation corpora☆155Updated last month
- a tool for calcualting character n-gram F score☆73Updated 2 years ago
- OpusFilter - Parallel corpus processing toolkit☆104Updated this week
- TSAR2022 Shared Task on Lexical Simplification - Datasets and Evaluation scripts☆10Updated 2 years ago
- Automated Semantic Analysis of Discourse Markers☆10Updated 3 years ago
- Natural Language Processing Research in North American Linguistics Departments☆21Updated 3 months ago
- Lexical Simplification with Pretrained Encoders☆70Updated 4 years ago
- Tools for evaluating the performance of MT metrics on data from recent WMT metrics shared tasks.☆109Updated 3 months ago
- ☆17Updated 2 years ago
- Neural Machine Translation (NMT) tutorial. Data preprocessing, model training, evaluation, and deployment.☆167Updated last year
- Code and models used in "MUSS Multilingual Unsupervised Sentence Simplification by Mining Paraphrases".☆98Updated 2 years ago
- This code provides word level language identification tool for identifying language for individual words in Code-Mixed text. e.g. The tex…☆53Updated 4 years ago
- Code and resources for evaluating cross-lingual embedding spaces☆29Updated 5 years ago
- Complimentary code for our paper Automatic punctuation restoration with BERT models☆50Updated last year
- ☆103Updated 3 years ago
- An official implementation of "BPE-Dropout: Simple and Effective Subword Regularization" algorithm.☆52Updated 4 years ago
- MediaSum: A Large-scale Media Interview Dataset for Dialogue Summarization☆74Updated 3 years ago
- A library for preparing data for machine translation research (monolingual preprocessing, bitext mining, etc.) built by the FAIR NLLB te…☆280Updated 5 months ago
- How to finetune mbart using fairseq☆24Updated 4 years ago
- Tools for formatting WMT hypothesis and test sets in XML☆27Updated 2 months ago
- This is an ASR corpus for Bemba language. It contains read speech from diverse publicly available Bemba sources; Literature Books, Radio/…☆35Updated 6 months ago
- A neural word aligner based on multilingual BERT☆351Updated 3 years ago