sharavsambuu / mongolian-text-classificationLinks
Cyrillic Mongolian text classification with tensorflow 2, and also some fine-tuning on TugsTugi's Mongolian BERT model and other NLP experiments are included.
☆33Updated 3 years ago
Alternatives and similar repositories for mongolian-text-classification
Users that are interested in mongolian-text-classification are comparing it to the libraries listed below
Sorting:
- Generate a 1 million-sample warm-up dataset for neural machine translation from a 700 million-word Mongolian text corpus using the Google…☆18Updated 6 months ago
- Useful resources for Mongolian NLP☆194Updated last year
- Pre-trained Mongolian BERT models☆49Updated 4 years ago
- Neural Adaptive Machine Translation that adapts to context and learns from corrections.☆351Updated 3 years ago
- Bitextor generates translation memories from multilingual websites☆299Updated last year
- Crawler for linguistic corpora☆213Updated 4 months ago
- Bicleaner is a parallel corpus classifier/cleaner that aims at detecting noisy sentence pairs in a parallel corpus.☆160Updated last year
- Arabic Dialect Identification on AOC data.☆24Updated 6 years ago
- chatbot_ner: Named Entity Recognition for chatbots.☆331Updated 2 weeks ago
- Machine Translation for Africa☆303Updated 3 years ago
- A sentence segmenter that actually works!☆304Updated 5 years ago
- Machine-Translation-based sentence alignment tool for parallel text☆313Updated 4 years ago
- Neural Machine Translation (NMT) tutorial. Data preprocessing, model training, evaluation, and deployment.☆174Updated 2 weeks ago
- ERRor ANnotation Toolkit: Automatically extract and classify grammatical errors in parallel original and corrected sentences.☆457Updated last year
- Datasets and tools for basic natural language processing.☆387Updated 4 years ago
- Bilingual term extractor☆58Updated last month
- Open-Source Machine Translation Quality Estimation in PyTorch☆231Updated 3 years ago
- Facebook Low Resource (FLoRes) MT Benchmark☆757Updated 2 years ago
- Arabic edition of BERT pretrained language models☆132Updated 5 years ago
- Tools for filtering and cleaning parallel and monolingual corpora for machine translation and other natural language processing tasks.☆41Updated 2 years ago
- Improved Sentence Alignment in Linear Time and Space☆186Updated 2 years ago
- Punctuation restoration and spell correction experiments.☆252Updated 4 years ago
- TUFS Asian Language Parallel Corpus☆52Updated 2 years ago
- This dataset contains synthetic training data for grammatical error correction. The corpus is generated by corrupting clean sentences fro…☆162Updated last year
- ☆116Updated 2 months ago
- DBMDZ BERT, DistilBERT, ELECTRA, GPT-2 and ConvBERT models☆156Updated 3 years ago
- Named Entity Recognition in Nepali Language☆11Updated 2 years ago
- An educational tool to train, inspect, evaluate and translate using neural engines☆19Updated 9 months ago
- Sentence Classifications with Neural Networks☆237Updated 2 years ago
- Use Language Model (LM) for Grammar Error Correction (GEC), without the use of annotated data.☆85Updated 6 years ago