sharavsambuu / mongolian-text-classificationLinks
Cyrillic Mongolian text classification with tensorflow 2, and also some fine-tuning on TugsTugi's Mongolian BERT model and other NLP experiments are included.
☆33Updated 2 years ago
Alternatives and similar repositories for mongolian-text-classification
Users that are interested in mongolian-text-classification are comparing it to the libraries listed below
Sorting:
- Useful resources for Mongolian NLP☆192Updated 11 months ago
- Pre-trained Mongolian BERT models☆47Updated 4 years ago
- Generate a 1 million-sample warm-up dataset for neural machine translation from a 700 million-word Mongolian text corpus using the Google…☆18Updated 5 months ago
- Arabic Dialect Identification on AOC data.☆24Updated 6 years ago
- Bicleaner is a parallel corpus classifier/cleaner that aims at detecting noisy sentence pairs in a parallel corpus.☆160Updated last year
- Mongolian speech recognition with PyTorch☆137Updated 4 years ago
- ERRor ANnotation Toolkit: Automatically extract and classify grammatical errors in parallel original and corrected sentences.☆454Updated last year
- Bitextor generates translation memories from multilingual websites☆297Updated last year
- A tool for converting TMX files into bilingual corpora☆18Updated 5 years ago
- TUFS Asian Language Parallel Corpus☆51Updated 2 years ago
- This repository contains the Arabic sarcasm dataset (ArSarcasm)☆24Updated 4 years ago
- chatbot_ner: Named Entity Recognition for chatbots.☆331Updated 9 months ago
- Transformer based translation quality estimation☆114Updated 2 years ago
- A sentence segmenter that actually works!☆304Updated 5 years ago
- cLang-8 is a dataset for grammatical error correction.☆110Updated 3 years ago
- Neural Machine Translation system for English to Vietnamese (IWSLT'15 English-Vietnamese data)☆62Updated 6 years ago
- Crawler for linguistic corpora☆211Updated 3 months ago
- ArSarcasm-v2 is an extension to the original ArSarcasm dataset. It was used for the shared task on sarcasm detection and sentiment analys…☆11Updated 3 years ago
- Machine Translation (MT) Preparation Scripts☆33Updated 6 months ago
- A language model-based approach to Grammatical Error Correction for English that uses minimal annotated data.☆48Updated 6 years ago
- Tools for filtering and cleaning parallel and monolingual corpora for machine translation and other natural language processing tasks.☆41Updated last year
- HateEval 2019 - Task 5☆16Updated 6 years ago
- ☆115Updated last month
- A seq2seq model that can correct spelling mistakes.☆217Updated 8 years ago
- Arabic edition of BERT pretrained language models☆132Updated 4 years ago
- Named Entity Recognition in Nepali Language☆11Updated 2 years ago
- Corpora for evaluating NLU services (like API.ai, RASA, Microsoft LUIS, ...)☆147Updated 6 years ago
- Sentence Classifications with Neural Networks☆237Updated 2 years ago
- Official implementation of the papers "GECToR – Grammatical Error Correction: Tag, Not Rewrite" (BEA-20) and "Text Simplification by Tagg…☆942Updated last year
- Fast + Non-Autoregressive Grammatical Error Correction using BERT. Code and Pre-trained models for paper "Parallel Iterative Edit Models …☆231Updated 2 years ago