NNLP-IL / Hebrew-ResourcesLinks
A comprehensive list of Hebrew NLP resources.
☆283Updated 8 months ago
Alternatives and similar repositories for Hebrew-Resources
Users that are interested in Hebrew-Resources are comparing it to the libraries listed below
Sorting:
- A curated list of resources for NLP (Natural Language Processing) for Hebrew☆108Updated 2 years ago
- The Vision and goals of the Open Natural Language Processing in Hebrew Project☆108Updated 7 years ago
- HeBERT: Pre-training BERT for modern Hebrew☆80Updated 2 years ago
- Yet Another (natural language) Parser☆87Updated 3 years ago
- Neural Modeling for Named Entities and Morphology (Hebrew NER)☆32Updated 3 years ago
- An NLP pipeline for Hebrew☆40Updated 6 months ago
- ☆55Updated 3 years ago
- A national initiative for the creation of infrastructure, research and development of advanced capabilities for the advancement of the fi…☆38Updated 3 years ago
- Dump of Project Ben-Yehuda's public domain texts☆31Updated 2 months ago
- Hebrew word lists☆48Updated last year
- A neural parsing pipeline for segmentation, morphological tagging, dependency parsing and lemmatization with pre-trained models for more …☆115Updated last year
- Neural Sentiment Analyzer for Modern Hebrew☆43Updated 5 years ago
- Ten Thousand German News Articles Dataset for Topic Classification☆86Updated 3 years ago
- Latin BERT☆69Updated last year
- DBMDZ BERT, DistilBERT, ELECTRA, GPT-2 and ConvBERT models☆156Updated 3 years ago
- Segment documents into coherent parts using word embeddings.☆149Updated 3 years ago
- A Python library for calculating a large variety of metrics from text☆359Updated last year
- Hebrew oriented NER spaCy pipeline☆21Updated last year
- This repository contains an easy and intuitive approach to few-shot classification using sentence-transformers or spaCy models, or zero-s…☆220Updated 11 months ago
- Simple multilingual lemmatizer for Python, especially useful for speed and efficiency☆182Updated 7 months ago
- A simple tool for splitting up an ebook into its chapters. Works well with Project Gutenberg texts. May also be used to clean up books fo…☆114Updated 7 years ago
- Unannotated Spanish 3 Billion Words Corpora☆104Updated 3 years ago
- ✔️Contextual word checker for better suggestions (not actively maintained)☆417Updated 11 months ago
- Linguistic and stylistic complexity measures for (literary) texts☆84Updated last year
- Text tokenization and sentence segmentation (segtok v2)☆208Updated 3 years ago
- A character-wise tokenizer for morphologically rich languages☆29Updated 3 months ago
- Google USE (Universal Sentence Encoder) for spaCy☆184Updated 2 years ago
- A multilingual parallel corpus created from translations of the Bible.☆191Updated 7 months ago
- ☆45Updated 3 years ago
- 🤘Lemmy is a lemmatizer for Danish 🇩🇰 and Swedish 🇸🇪☆79Updated 4 years ago