workforce-data-initiative / skills-labellerLinks
A WDI system for labelling and extracting skills within job postings. Implements an entire intelligent system utilizing a front end, pulling down job postings and online learning all under constrained system resources (e.g. EC2 micro/small) for ease of public use.
☆14Updated 7 years ago
Alternatives and similar repositories for skills-labeller
Users that are interested in skills-labeller are comparing it to the libraries listed below
Sorting:
- Data Processing and Machine learning methods for the Open Skills Project☆172Updated 10 months ago
- Predict age and gender from a first name☆59Updated 7 years ago
- Predict Race and Ethnicity Based on the Sequence of Characters in a Name☆248Updated 3 weeks ago
- Python client for the Genderize.io web service.☆76Updated 5 years ago
- Extract countries, regions and cities from a URL or text☆217Updated 5 years ago
- An introduction to using spaCy for NLP and machine learning☆192Updated 3 years ago
- An open-source implementation of the Linguistic Inquiry Word Count in Python☆16Updated 8 years ago
- Data Server for Topic Models☆122Updated 2 years ago
- Trend detection algorithms for Twitter time series data☆193Updated 8 years ago
- White house data jam: Skill extraction from unstructured text.☆27Updated 10 years ago
- [development moved to termite-data-server]☆61Updated 11 years ago
- Materials for the workshop Advanced Text Analysis with SpaCy and Scikit-Learn, given at NYU during NYCDH Week 2017, at PyData NYC in Nov.…☆83Updated 2 years ago
- Tutorial code and data for the entity resolution workshops.☆45Updated 10 years ago
- Tool for exploring Word Vector models☆180Updated 7 years ago
- Python package for API access to news articles and events in the Event Registry☆242Updated 2 years ago
- Scalable String Similarity Joins in Python☆39Updated last year
- Group thousands of similar spreadsheet or database text entries in seconds☆157Updated 2 years ago
- Text Mining and Topic Modeling Toolkit for Python with parallel processing power☆190Updated 2 years ago
- A guide on extracting entities from raw text in order to conduct social network analysis.☆21Updated 8 years ago
- Scrapes Google Trends data over long timescales and stitches together for daily data☆72Updated 5 years ago
- A simple command line interface to the datamade/dedupe library.☆42Updated 2 years ago
- A fork of boilerpipe with python 3 and small fixes, ported from source `https://pypi.python.org/pypi/boilerpipe-py3.☆45Updated 5 years ago
- The Python-language successor to the TABARI event-data coding software.☆45Updated 8 years ago
- Genderizer is a language independent module which tries to detect gender by looking given first names and/or analyzing sample texts.☆64Updated 11 years ago
- ☆46Updated 2 months ago
- 💙 Emoji handling and meta data for spaCy with custom extension attributes☆182Updated 2 years ago
- A python tool for collecting tweets in mongoDB using the search API☆80Updated 2 years ago
- ☆59Updated 4 years ago
- Geotext extracts country and city mentions from text☆139Updated 2 years ago
- Quickly extract multi-word phrases from a corpus☆194Updated 5 years ago