dumitrescustefan / RoWordNet
Romanian WordNet (Data + API for Python)
☆49Updated 4 years ago
Alternatives and similar repositories for RoWordNet:
Users that are interested in RoWordNet are comparing it to the libraries listed below
- Romanian Named Entity Corpus (RONEC) version 2.0☆61Updated 2 years ago
- A list of Romanian NLP Datasets☆37Updated 3 months ago
- This repo is the home of Romanian Transformers.☆98Updated 2 years ago
- A single model that parses Universal Dependencies across 75 languages. Given a sentence, jointly predicts part-of-speech tags, morphology…☆221Updated 2 years ago
- Bitextor generates translation memories from multilingual websites☆293Updated 2 months ago
- Text and Punctuation correction with Deep Learning☆128Updated 4 years ago
- Punctuation restoration and spell correction experiments.☆250Updated 3 years ago
- A sentence segmenter that actually works!☆303Updated 4 years ago
- A novel dataset for emotion detection from Romanian text.☆17Updated 2 months ago
- 📃Language Model based sentences scoring library☆307Updated 2 years ago
- A list of Natural Language Processing Tools for Romanian☆27Updated 3 years ago
- Polish morphological tagger.☆42Updated last year
- Text to sentence splitter using heuristic algorithm by Philipp Koehn and Josh Schroeder.☆234Updated 2 years ago
- DBMDZ BERT, DistilBERT, ELECTRA, GPT-2 and ConvBERT models☆155Updated 2 years ago
- Named Entity Recognition for Romanian, based on transformer models☆12Updated 2 years ago
- ✔️Contextual word checker for better suggestions (not actively maintained)☆412Updated last month
- Transformer language model (GPT-2) with sentencepiece tokenizer☆163Updated 3 years ago
- Python-based implementation of the Translate-Align-Retrieve method to automatically translate the SQuAD Dataset to Spanish.☆59Updated 2 years ago
- Automatically constructing corpus for automatic speech recognition from YouTube videos☆153Updated 4 years ago
- Text tokenization and sentence segmentation (segtok v2)☆203Updated 2 years ago
- Sentiment polarity analysis for english and romanian with tensorflow and tflearn and LSTM cells☆16Updated 6 years ago
- A CoNLL-U parser that takes a CoNLL-U formatted string and turns it into a nested python dictionary.☆311Updated this week
- A tokenizer and sentence splitter for German and English web and social media texts.☆137Updated last month
- OpusFilter - Parallel corpus processing toolkit☆104Updated this week
- This dataset contains synthetic training data for grammatical error correction. The corpus is generated by corrupting clean sentences fro…☆159Updated 3 months ago
- Bicleaner is a parallel corpus classifier/cleaner that aims at detecting noisy sentence pairs in a parallel corpus.☆152Updated 7 months ago
- Use Language Model (LM) for Grammar Error Correction (GEC), without the use of annotated data.☆83Updated 5 years ago
- Terminology EXtraction and Text Analytics (TEXTA) Toolkit☆34Updated 2 years ago
- A model that predicts the punctuation of English, Italian, French and German texts.☆77Updated last year
- Stanford's Alexa Prize socialbot☆133Updated last year