workforce-data-initiative / skills-labeller
A WDI system for labelling and extracting skills within job postings. Implements an entire intelligent system utilizing a front end, pulling down job postings and online learning all under constrained system resources (e.g. EC2 micro/small) for ease of public use.
☆13Updated 6 years ago
Alternatives and similar repositories for skills-labeller:
Users that are interested in skills-labeller are comparing it to the libraries listed below
- Predict age and gender from a first name☆60Updated 6 years ago
- Fast, flexible name matching for large datasets☆70Updated last year
- Data Processing and Machine learning methods for the Open Skills Project☆170Updated 3 months ago
- classify a job description (or noisy job title) into a ONET job title☆18Updated 8 years ago
- Package for performing Reddit-based text analysis☆20Updated 6 years ago
- Notebooks configured to be run with Binder, usually found on my blog.☆42Updated last year
- Materials for the workshop Advanced Text Analysis with SpaCy and Scikit-Learn, given at NYU during NYCDH Week 2017, at PyData NYC in Nov.…☆82Updated 2 years ago
- A browser user interface for manual labeling of record pairs.☆45Updated last year
- ☆46Updated 2 weeks ago
- Skill Representations in Vector Space☆34Updated last year
- Python library providing sentiment lexicons.☆26Updated 8 years ago
- Record Linkage ToolKit (Find and link entities)☆109Updated last year
- Scrape Indeed for job listings and Indeed & Glassdoor for company reviews. Topic model the reviews.☆36Updated 9 years ago
- A guide on extracting entities from raw text in order to conduct social network analysis.☆20Updated 7 years ago
- A visualisation tool for Spacy using Hierplane.☆65Updated 2 years ago
- Build intelligent data-driven applications with minimal effort. Sentence Clustering, Topics Extraction, Text Similarity, Opinion Summariz…☆40Updated 5 years ago
- White house data jam: Skill extraction from unstructured text.☆27Updated 10 years ago
- Scalable String Similarity Joins in Python☆38Updated 7 months ago
- High-coverage and high-precision lexica of terms annotated with emotion scores for English and Italian.☆152Updated 4 months ago
- Language detection extension for spaCy 2.0+☆112Updated 6 years ago
- Python package aiding in entity disambiguation based on string and location matching☆18Updated last year
- The Python-language successor to the TABARI event-data coding software.☆45Updated 7 years ago
- Topic modelling on financial news with Natural Language Processing☆58Updated 7 years ago
- Python bindings to the Compact Language Detector☆33Updated 4 years ago
- Extracts key terminology (n-grams) from any large collection of documents (>1000) and forecasts emergence☆62Updated last year
- Stability analysis for topic models☆51Updated 8 years ago
- Regex like pattern tree matching but on sentence's tree instead of Strings☆42Updated 7 years ago
- Dataframe Integration with spaCy.☆103Updated 3 years ago
- An Easy to Use, Accurate Python Geolocation Library☆40Updated 2 years ago
- Package that returns a company embedding given a company name☆45Updated 4 years ago