gidim / HebrewStopWords
List of hebrew stop words + script that computed them
β20Updated 9 years ago
Alternatives and similar repositories for HebrewStopWords:
Users that are interested in HebrewStopWords are comparing it to the libraries listed below
- A maximum-strength name parser for record linkage.β36Updated 3 weeks ago
- Guess gender from first name in Python 2 and 3β133Updated 2 years ago
- Literature π and datasets π on automatic populism detectionβ18Updated last month
- Data Donation Module: A Django application to setup and manage data donation projects.β23Updated last month
- General programming utilities from Pew Research Centerβ69Updated 3 years ago
- A python package with methods to handle the complexities of Hebrew text, calculate Gematria, and more.β39Updated 8 months ago
- Dataset: BuzzFeed News βTrendingβ Strip, 2018β2023β19Updated last year
- Hierarchical clustering of 2011-2022 Congress Twitterβ29Updated 2 years ago
- Parse numbers written in natural languageβ113Updated 6 months ago
- Text and statistics utilities from Pew Research Centerβ84Updated 3 years ago
- A Python port of the #rstats sentimentr packageβ10Updated 6 years ago
- Classify names by gender, U.S. ethnicity, or leaf nationalityβ19Updated 6 years ago
- idiolect: An R package for forensic authorship analysisβ14Updated 2 weeks ago
- A data package for R containing historical datasets about genderβ25Updated 3 years ago
- A browser user interface for manual labeling of record pairs.β47Updated last year
- Tools for training and evaluating word embeddings based on subtitles. Published as "subs2vec: Word embeddings from subtitles in 55 languaβ¦β33Updated 4 years ago
- The NLP Bias Identification Toolkitβ36Updated last year
- Hebrew oriented NER spaCy pipelineβ16Updated 8 months ago
- Repository for public code and data associated with the paper "Fake News on Twitter During the 2016 U.S. Presidential Electionβ11Updated 5 years ago
- R package for Google Document AIβ42Updated 5 months ago
- A helper library full of URL-related heuristics.β69Updated last month
- Easy PDF to text to spaCy text extraction in Python.β39Updated 6 months ago
- π Additional lookup tables and data resources for spaCyβ106Updated 3 months ago
- QualtricsAPI is a lightweight Python Package for the Qualtrics API.β22Updated 10 months ago
- Tutorials for Stance Detection: A practical guideβ22Updated 2 years ago
- R package associated with Benoit, Munger and Spirling (2017) paper(s)β43Updated 3 years ago
- Python client for the Genderize.io web service.β75Updated 5 years ago
- Tool for probabilistically linking the records of individual entities (e.g. people) within and across datasetsβ112Updated 4 months ago
- A set of jupyter notebooks demonstrating how to use the Media Cloud API.β37Updated last year
- Group thousands of similar spreadsheet or database text entries in secondsβ155Updated last year