gidim / HebrewStopWordsLinks
List of hebrew stop words + script that computed them
☆20Updated 9 years ago
Alternatives and similar repositories for HebrewStopWords
Users that are interested in HebrewStopWords are comparing it to the libraries listed below
Sorting:
- A maximum-strength name parser for record linkage.☆39Updated 3 months ago
- A TextBlob sentiment analysis pipeline component for spaCy.☆56Updated last month
- Dataset: BuzzFeed News “Trending” Strip, 2018–2023☆19Updated 2 years ago
- Guess gender from first name in Python 2 and 3☆138Updated 7 months ago
- Tool for probabilistically linking the records of individual entities (e.g. people) within and across datasets☆118Updated 3 weeks ago
- Parse numbers written in natural language☆124Updated last year
- A Python Client for collect and parse public data from the Youtube Data API☆81Updated 2 years ago
- Text and statistics utilities from Pew Research Center☆85Updated 3 years ago
- Open Source Proxy Demographic module written in Python☆35Updated last year
- Group thousands of similar spreadsheet or database text entries in seconds☆157Updated 2 years ago
- Data Donation Module: A Django application to setup and manage data donation projects.☆26Updated last week
- Literature 📄 and datasets 📚 on automatic populism detection☆19Updated 9 months ago
- General programming utilities from Pew Research Center☆70Updated 3 years ago
- A Python port of the #rstats sentimentr package☆10Updated 7 years ago
- Classify names by gender, U.S. ethnicity, or leaf nationality☆19Updated 7 years ago
- Accurately find/replace/remove emojis in text strings☆163Updated 2 years ago
- Python wrapper for a C++ Double Metaphone☆15Updated last month
- A browser user interface for manual labeling of record pairs.☆48Updated 2 years ago
- Data on international first names and sex of people with that name☆12Updated 6 years ago
- A data package for R containing historical datasets about gender☆25Updated 3 years ago
- Given a job title and job description, the algorithm assigns a standard occupational classification (SOC) code to the job.☆74Updated last year
- Deduplicate and parse list of `dirty names'☆23Updated 5 years ago
- Predict Gender from Names Using Historical Data☆191Updated 4 years ago
- Fast, flexible name matching for large datasets☆71Updated 3 months ago
- Python client for the Genderize.io web service.☆77Updated 5 years ago
- ☆74Updated last week
- This project is wraper for Leilex, legal entity identifier API. Includes ISIN-LEI conversion. Search LEI number using company name.☆25Updated last year
- Dataframe Integration with spaCy.☆103Updated 4 years ago
- Generate reports for spaCy models.☆29Updated 3 years ago
- Efficient string matching with regular expressions☆146Updated this week