dadelani / menyo-20k_MT
☆11Updated 3 years ago
Alternatives and similar repositories for menyo-20k_MT
Users that are interested in menyo-20k_MT are comparing it to the libraries listed below
Sorting:
- ☆22Updated 3 years ago
- Agile reading group that works☆13Updated 3 years ago
- A web interface to understand language-specific BERT-models☆18Updated last year
- Statistics on multilingual datasets☆17Updated 2 years ago
- Multilingual Open Text☆25Updated last week
- AfroLID, a powerful neural toolkit for African languages identification which covers 517 African languages.☆31Updated 2 months ago
- BERT models for many languages created from Wikipedia texts☆33Updated 4 years ago
- Code and data for "Superbizarre Is Not Superb: Derivational Morphology Improves BERT's Interpretation of Complex Words"☆16Updated 3 years ago
- A library for data streaming and augmentation☆20Updated last week
- ☆15Updated 4 years ago
- This repository hosts the code for a tokenizer of tweets.☆12Updated 6 years ago
- ☆17Updated last year
- Prabhupadavani: A Code-mixed Speech Translation Data for 25 languages☆13Updated 2 years ago
- ☆51Updated last year
- A simple neural truecaser written in pytorch and allennlp.☆33Updated 11 months ago
- ☆24Updated 5 years ago
- MasakhaNEWS: News Topic Classification for African Languages☆23Updated last year
- CodemixedNLP: An Extensible and Open NLP Toolkit for Code-Switching☆18Updated 4 years ago
- Minimal code to train ELMo models in recent versions of TensorFlow☆14Updated 2 years ago
- Code for the paper "Code-Mixing on Sesame Street: Dawn of the Adversarial Polyglots" (NAACL-HLT 2021)☆10Updated 2 weeks ago
- ☆10Updated 6 years ago
- Ranking of fine-tuned HF models as base models.☆35Updated last week
- A guide to building language technology in new languages.☆58Updated 3 years ago
- Scripts to create speech corpora from open.bible☆13Updated 3 years ago
- A small repository to test Captum Explainable AI with a trained Flair transformers-based text classifier.☆27Updated 4 years ago
- This repo contains a set of neural transducer, e.g. sequence-to-sequence model, focusing on character-level tasks.☆75Updated last year
- Toolkit to compile a comparable/parallel corpus from European Parliament proceedings☆16Updated 5 years ago
- CodeSwitch is a NLP tool, can use for language identification, pos tagging, name entity recognition, sentiment analysis of code mixed dat…☆35Updated 4 years ago
- A tiny BERT for low-resource monolingual models☆31Updated 7 months ago
- A program to choose transfer languages for cross-lingual learning☆72Updated 2 years ago