workforce-data-initiative / skills-labeller
A WDI system for labelling and extracting skills within job postings. Implements an entire intelligent system utilizing a front end, pulling down job postings and online learning all under constrained system resources (e.g. EC2 micro/small) for ease of public use.
☆13Updated 6 years ago
Related projects ⓘ
Alternatives and complementary repositories for skills-labeller
- Data Processing and Machine learning methods for the Open Skills Project☆169Updated this week
- Predict age and gender from a first name☆60Updated 6 years ago
- Fast, flexible name matching for large datasets☆70Updated 11 months ago
- Notebooks configured to be run with Binder, usually found on my blog.☆41Updated last year
- Dataframe Integration with spaCy.☆102Updated 3 years ago
- Package for performing Reddit-based text analysis☆20Updated 5 years ago
- Python library providing sentiment lexicons.☆26Updated 7 years ago
- Scalable String Similarity Joins in Python☆39Updated 4 months ago
- classify a job description (or noisy job title) into a ONET job title☆17Updated 8 years ago
- Language detection extension for spaCy 2.0+☆111Updated 5 years ago
- A simple command line interface to the datamade/dedupe library.☆42Updated last year
- ☆28Updated 4 years ago
- High-coverage and high-precision lexica of terms annotated with emotion scores for English and Italian.☆152Updated 3 weeks ago
- Package that returns a company embedding given a company name☆42Updated 4 years ago
- Anonymization of legal cases (Fr) based on Flair embeddings☆87Updated 3 years ago
- Data Server for Topic Models☆121Updated last year
- Regex like pattern tree matching but on sentence's tree instead of Strings☆42Updated 6 years ago
- A fully customisable language detection pipeline for spaCy☆93Updated 5 years ago
- A maximum-strength name parser for record linkage.☆34Updated 3 months ago
- python package for performing deduplication using flexible text matching and cleaning in pandas dataframe☆25Updated 3 years ago
- Materials for the workshop Advanced Text Analysis with SpaCy and Scikit-Learn, given at NYU during NYCDH Week 2017, at PyData NYC in Nov.…☆82Updated last year
- ☆71Updated this week
- Build intelligent data-driven applications with minimal effort. Sentence Clustering, Topics Extraction, Text Similarity, Opinion Summariz…☆40Updated 5 years ago
- A Python package for gender classification.☆81Updated last year
- [development moved to termite-data-server]☆61Updated 10 years ago
- Running Prodigy for a team of annotators☆53Updated 3 years ago
- 📝Natural language processing (NLP) utils: word embeddings (Word2Vec, GloVe, FastText, ...) and preprocessing transformers, compatible wi…☆60Updated last year
- Group thousands of similar spreadsheet or database text entries in seconds☆155Updated last year