gidim / HebrewStopWords
List of hebrew stop words + script that computed them
☆20Updated 8 years ago
Related projects: ⓘ
- A maximum-strength name parser for record linkage.☆29Updated last month
- Guess gender from first name in Python 2 and 3☆129Updated 2 years ago
- Dataframe Integration with spaCy.☆100Updated 3 years ago
- python package for performing deduplication using flexible text matching and cleaning in pandas dataframe☆25Updated 3 years ago
- Python wrapper for a C++ Double Metaphone☆15Updated last year
- Bag of, not words, but tricks!☆68Updated 10 months ago
- Yet Another (natural language) Parser☆82Updated last year
- A browser user interface for manual labeling of record pairs.☆41Updated last year
- Extracts key terminology (n-grams) from any large collection of documents (>1000) and forecasts emergence☆62Updated 11 months ago
- Generate reports for spaCy models.☆28Updated 2 years ago
- A library to extract a publication date from a web page, along with a measure of the accuracy.☆42Updated 5 years ago
- A TextBlob sentiment analysis pipeline component for spaCy.☆54Updated 2 years ago
- ☆70Updated last year
- 📂 Additional lookup tables and data resources for spaCy☆98Updated last year
- A data package containing lexicons and dictionaries for text analysis☆110Updated 2 years ago
- ☆65Updated 2 years ago
- ☆29Updated 2 years ago
- ☆71Updated last week
- Easy PDF to text to spaCy text extraction in Python.☆33Updated 11 months ago
- General programming utilities from Pew Research Center☆69Updated 2 years ago
- Tutorials for Stance Detection: A practical guide☆21Updated last year
- ☆17Updated last year
- HeBERT: Pre-training BERT for modern Hebrew☆72Updated last year
- A simple command line interface to the datamade/dedupe library.☆42Updated last year
- Performs unique entity estimation corresponding to Chen, Shrivastava, Steorts (2018).☆14Updated 5 years ago
- Python based Wikidata framework for easy dataframe extraction☆39Updated 9 months ago
- Fuzzy matches and merging of datasets in pandas using csvmatch☆74Updated 4 years ago
- spaCy-wrap is a wrapper library for spaCy for including fine-tuned transformers from Huggingface in your spaCy pipeline allowing you to i…☆46Updated 5 months ago
- ☄️ Parallel and distributed training with spaCy and Ray☆54Updated last year
- Tools for interactive visual exploration of semantic embeddings.☆24Updated 2 weeks ago