roshan-research / hazm
Persian NLP Toolkit
☆1,205Updated 3 months ago
Related projects ⓘ
Alternatives and complementary repositories for hazm
- Obtain Word Alignments using Pretrained Language Models (e.g., mBERT)☆350Updated last year
- Improved Sentence Alignment in Linear Time and Space☆163Updated last year
- A neural word aligner based on multilingual BERT☆328Updated 2 years ago
- Open neural machine translation models and web services☆616Updated last month
- Pyserini is a Python toolkit for reproducible information retrieval research with sparse and dense representations.☆1,674Updated this week
- Multilingual text (NLP) processing toolkit☆2,313Updated 11 months ago
- A tool for holistic analysis of language generations systems☆466Updated 2 years ago
- Open-Source Neural Machine Translation in Tensorflow☆797Updated last year
- Moses, the machine translation system☆1,583Updated 5 months ago
- A BERT model for scientific text.☆1,520Updated 2 years ago
- Neural Adaptive Machine Translation that adapts to context and learns from corrections.☆342Updated 2 years ago
- NBoost is a scalable, search-api-boosting platform for deploying transformer models to improve the relevance of search results on differe…☆675Updated 4 years ago
- Anserini is a Lucene toolkit for reproducible information retrieval research☆1,028Updated last week
- NCRF++, a Neural Sequence Labeling Toolkit. Easy use to any sequence labeling tasks (e.g. NER, POS, Segmentation). It includes character …☆1,888Updated 2 years ago
- Language-Agnostic SEntence Representations☆3,596Updated 6 months ago
- S2ORC: The Semantic Scholar Open Research Corpus: https://www.aclweb.org/anthology/2020.acl-main.447/☆828Updated 6 months ago
- A tool for extracting plain text from Wikipedia dumps☆3,746Updated 5 months ago
- Machine-Translation-based sentence alignment tool for parallel text☆299Updated 3 years ago
- Science Parse parses scientific papers (in PDF form) and returns them in structured form.☆626Updated 5 months ago
- An educational tool to train, inspect, evaluate and translate using neural engines☆18Updated last year
- XTREME is a benchmark for the evaluation of the cross-lingual generalization ability of pre-trained multilingual models that covers 40 ty…☆631Updated last year
- Module for automatic summarization of text documents and HTML pages.☆3,520Updated 5 months ago
- A framework to learn cross-lingual word embedding mappings☆645Updated last year
- Bitextor generates translation memories from multilingual websites☆290Updated last month
- Crawler for linguistic corpora☆192Updated 11 months ago
- Sentence aligner☆108Updated 3 years ago
- ☆22Updated 4 years ago
- Pre-trained subword embeddings in 275 languages, based on Byte-Pair Encoding (BPE)☆1,184Updated last month
- Entity Linker solution☆1,170Updated last year
- Named Entity Recognition data for Europeana Newspapers☆172Updated last year
- Minimalist NMT for educational purposes☆675Updated 9 months ago