sharavsambuu / mongolian-text-classificationLinks
Cyrillic Mongolian text classification with tensorflow 2, and also some fine-tuning on TugsTugi's Mongolian BERT model and other NLP experiments are included.
☆33Updated 2 years ago
Alternatives and similar repositories for mongolian-text-classification
Users that are interested in mongolian-text-classification are comparing it to the libraries listed below
Sorting:
- Useful resources for Mongolian NLP☆190Updated 9 months ago
- Generate a 1 million-sample warm-up dataset for neural machine translation from a 700 million-word Mongolian text corpus using the Google…☆18Updated 2 months ago
- Pre-trained Mongolian BERT models☆47Updated 4 years ago
- Mongolian speech recognition with PyTorch☆135Updated 4 years ago
- Arabic Dialect Identification on AOC data.☆24Updated 6 years ago
- Machine Translation Web Interface for OpenNMT-py☆25Updated 3 years ago
- Sentence Classifications with Neural Networks☆237Updated 2 years ago
- Datasets and tools for basic natural language processing.☆385Updated 4 years ago
- Bitextor generates translation memories from multilingual websites☆295Updated 10 months ago
- ArSarcasm-v2 is an extension to the original ArSarcasm dataset. It was used for the shared task on sarcasm detection and sentiment analys…☆11Updated 3 years ago
- The Mongolian Wordnet (MonWN)☆18Updated 3 years ago
- ☆111Updated last year
- Arabic edition of BERT pretrained language models☆132Updated 4 years ago
- Bicleaner is a parallel corpus classifier/cleaner that aims at detecting noisy sentence pairs in a parallel corpus.☆158Updated last year
- Machine Translation (MT) Preparation Scripts☆33Updated 3 months ago
- multi_task_NLP is a utility toolkit enabling NLP developers to easily train and infer a single model for multiple tasks.☆372Updated 2 years ago
- A sentence segmenter that actually works!☆305Updated 5 years ago
- ERRor ANnotation Toolkit: Automatically extract and classify grammatical errors in parallel original and corrected sentences.☆450Updated last year
- A seq2seq model that can correct spelling mistakes.☆217Updated 8 years ago
- CoVoST: A Large-Scale Multilingual Speech-To-Text Translation Corpus (CC0 Licensed)☆391Updated 4 years ago
- Machine Translation for Africa☆296Updated 3 years ago
- This dataset contains synthetic training data for grammatical error correction. The corpus is generated by corrupting clean sentences fro…☆161Updated 11 months ago
- Hotels Arabic-Reviews Dataset☆32Updated 6 years ago
- This repository contains the Arabic sarcasm dataset (ArSarcasm)☆24Updated 4 years ago
- Python library & examples for Masked Language Model Scoring (ACL 2020)☆346Updated 2 years ago
- ☆501Updated 5 years ago
- Punctuation restoration and spell correction experiments.☆250Updated 4 years ago
- This repository contains examples of custom components for educational purposes.☆194Updated last year
- Pytorch-Named-Entity-Recognition-with-BERT☆15Updated 4 years ago
- Trankit is a Light-Weight Transformer-based Python Toolkit for Multilingual Natural Language Processing☆764Updated 2 months ago