workforce-data-initiative / skills-labellerLinks
A WDI system for labelling and extracting skills within job postings. Implements an entire intelligent system utilizing a front end, pulling down job postings and online learning all under constrained system resources (e.g. EC2 micro/small) for ease of public use.
☆14Updated 7 years ago
Alternatives and similar repositories for skills-labeller
Users that are interested in skills-labeller are comparing it to the libraries listed below
Sorting:
- Data Processing and Machine learning methods for the Open Skills Project☆171Updated 7 months ago
- Predict age and gender from a first name☆60Updated 6 years ago
- Predict Race and Ethnicity Based on the Sequence of Characters in a Name☆245Updated 2 months ago
- Package that returns a company embedding given a company name☆46Updated 5 years ago
- Scalable String Similarity Joins in Python☆39Updated last year
- Text Mining and Topic Modeling Toolkit for Python with parallel processing power☆190Updated 2 years ago
- Extract countries, regions and cities from a URL or text☆218Updated 4 years ago
- Simplifies use of the Dedupe library via Pandas☆136Updated 2 years ago
- 💙 Emoji handling and meta data for spaCy with custom extension attributes☆181Updated 2 years ago
- Open Source Proxy Demographic module written in Python☆35Updated last year
- Fast, flexible name matching for large datasets☆72Updated 2 months ago
- A simple command line interface to the datamade/dedupe library.☆42Updated 2 years ago
- Materials for the workshop Advanced Text Analysis with SpaCy and Scikit-Learn, given at NYU during NYCDH Week 2017, at PyData NYC in Nov.…☆83Updated 2 years ago
- Guess gender from first name in Python 2 and 3☆137Updated 2 months ago
- Dataframe Integration with spaCy.☆103Updated 4 years ago
- Python client for the Genderize.io web service.☆76Updated 5 years ago
- An introduction to using spaCy for NLP and machine learning☆191Updated 3 years ago
- A Python package for gender classification.☆86Updated 2 years ago
- Language detection extension for spaCy 2.0+☆113Updated 6 years ago
- I analysed online user comments on articles by German news publishers SPON, ZEIT, and Focus☆19Updated 7 years ago
- Tool for exploring Word Vector models☆179Updated 7 years ago
- Notebooks configured to be run with Binder, usually found on my blog.☆42Updated 2 years ago
- Record linking package that fuzzy matches two Python pandas dataframes using sqlite3 fts4☆283Updated 2 years ago
- A library to extract a publication date from a web page, along with a measure of the accuracy.☆41Updated 5 years ago
- Group thousands of similar spreadsheet or database text entries in seconds☆156Updated 2 years ago
- Geotext extracts country and city mentions from text☆139Updated 2 years ago
- A fully customisable language detection pipeline for spaCy☆93Updated 6 years ago
- 📛 Fuzzy Name Matching with Machine Learning☆264Updated last year
- A fork of boilerpipe with python 3 and small fixes, ported from source `https://pypi.python.org/pypi/boilerpipe-py3.☆45Updated 5 years ago
- An open-source implementation of the Linguistic Inquiry Word Count in Python☆15Updated 8 years ago