MarsPanther / crawl-for-parallel-corporaLinks
simple bs4 based web crawl for a corpus in need of statistical machine translation
☆13Updated 3 years ago
Alternatives and similar repositories for crawl-for-parallel-corpora
Users that are interested in crawl-for-parallel-corpora are comparing it to the libraries listed below
Sorting:
- ☆15Updated 5 years ago
- Morphological processing for languages of the Horn of Africa☆46Updated this week
- A JavaScript-based converter for transliterating Amharic text into Latin characters☆19Updated 3 years ago
- Different semantic models for Amharic☆21Updated last year
- Morphological analysis and generation of Amharic, Oromo, and Tigrinya☆11Updated 8 years ago
- Amharic English Machine Translation Corpus prepared through website crawelling and custom preprocessing.☆43Updated 6 years ago
- Machine translation (MT) benchmark dataset for languages in the Horn of Africa.☆39Updated 2 years ago
- The set of files used for the development of the Amharic Corpus.☆11Updated 8 years ago
- Natural Language Processing in Ethiopian Languages: Current State, Challenges, and Opportunities☆13Updated 3 weeks ago
- Lexical Data of Ge'ez Languages☆54Updated 2 years ago
- Best Practices in Translation Memory Management☆45Updated 6 years ago
- Amharic/Tigrinya/Oromo Dictionaries☆38Updated last year
- Resources to go with the Indic NLP Library☆73Updated 3 years ago
- ☆64Updated last year
- A list of resources for conservation, development, and documentation of endangered, minority, and low or under-resourced human languages.☆34Updated 2 years ago
- A repository for publicly/freely available Natural Language Processing (NLP) datasets for African languages.☆106Updated last year
- Transliteration module for Indian Languages☆78Updated last year
- Machine Translation for Africa☆289Updated 3 years ago
- A python code to detect emotions from text☆62Updated 8 years ago
- Hindi POS Tags and keywords using TNT model. Created Date: 28 Sept 2018☆25Updated 5 years ago
- Python API to access glottolog/glottolog☆29Updated 2 weeks ago
- How to extract sentiment from opinions without any labels☆139Updated 3 years ago
- Collection of Urdu datasets for POS, NER, Sentiment, Summarization and NLP tasks.☆72Updated 10 months ago
- The NLG tool for Finnish☆23Updated last year
- Python package for indic script transliteration☆187Updated 3 weeks ago
- Morphological Dictionaries for German Language☆29Updated 7 years ago
- Crowd sourced training data for Rasa NLU models☆201Updated last year
- In this notebook, we will build an abstractive based text summarizer using deep learning from the scratch in python using keras☆210Updated 3 years ago
- A Simple Flask App to interact with your Machine Translation Model☆12Updated 5 years ago
- Language Tool style grammar handling with spaCy 2.0☆42Updated 6 years ago