portmind / armenian-ocr
☆11Updated 2 years ago
Alternatives and similar repositories for armenian-ocr:
Users that are interested in armenian-ocr are comparing it to the libraries listed below
- Open language modeling toolkit based on PyTorch☆109Updated 2 weeks ago
- ☆25Updated last month
- Repository accompanying "An Open Dataset and Model for Language Identification" (Burchell et al., 2023)☆72Updated 2 weeks ago
- Multilingual sentence alignment using sentence embeddings☆114Updated 5 months ago
- NEREL: A Russian Dataset with Nested Named Entities, Relations and Events☆29Updated last year
- Fine-tuning Open-Source LLMs for Adaptive Machine Translation☆77Updated last week
- 💬 Language Identification with Support for More Than 2000 Labels -- EMNLP 2023☆127Updated 4 months ago
- Compound splitter for German language ("Komposita-Zerlegung") based on large dictionary combined with highly efficient multi-pattern stri…☆25Updated 2 years ago
- Improved Sentence Alignment in Linear Time and Space☆169Updated 2 years ago
- Terminal UI for monitoring SLURM jobs☆10Updated 3 weeks ago
- SIGMORPHON 2022 Shared Task on Morpheme Segmentation☆25Updated 2 years ago
- ☆47Updated 8 months ago
- Simple multilingual lemmatizer for Python, especially useful for speed and efficiency☆154Updated 4 months ago
- The central repo for Creole based NLU and NLG work☆18Updated 10 months ago
- Library for pruning experts per language pair in NLLB-200☆33Updated last year
- An open-source Kazakh named entity recognition dataset (KazNERD), annotation guidelines, and baseline NER models.☆29Updated 3 months ago
- ☆9Updated 2 months ago
- Bicleaner is a parallel corpus classifier/cleaner that aims at detecting noisy sentence pairs in a parallel corpus.☆157Updated 10 months ago
- Machine Translation (MT) Preparation Scripts☆31Updated last month
- Sentence aligner☆112Updated 3 years ago
- Extracts parallel corpora from the 2 raw texts in different languages.☆35Updated 2 years ago
- ☆25Updated last year
- Finetune VITS and MMS using HuggingFace's tools☆145Updated last year
- A neural word aligner based on multilingual BERT☆346Updated 3 years ago
- Bicleaner fork that uses neural networks☆39Updated 8 months ago
- Python Finite-State Toolkit☆54Updated last month
- MorphyNet: a Large Multilingual Database of Derivational and Inflectional Morphology (+morpheme segmentation)☆44Updated 2 years ago
- Neural based model for automatic diacritics restoration.☆25Updated 6 years ago
- A comprehensive list of Arabic NLP resources.☆31Updated 4 months ago
- Finetuning Whisper ASR model for Belarusian language☆17Updated 2 months ago