jneidel / job-titlesLinks
Normalized dataset of 70k job titles
☆70Updated last year
Alternatives and similar repositories for job-titles
Users that are interested in job-titles are comparing it to the libraries listed below
Sorting:
- Open Source Thesaurus of Job Titles in US English☆139Updated 3 years ago
- LexPredict Legal Dictionaries☆124Updated 3 years ago
- Data Processing and Machine learning methods for the Open Skills Project☆172Updated 9 months ago
- A (smart) rule based NLP module to extract job skills from text☆191Updated last year
- Index Common Crawl archives in tabular format☆122Updated last month
- ☆16Updated 7 years ago
- A python utility for downloading Common Crawl data☆244Updated 2 years ago
- spaCy REST API, wrapped in a Docker container.☆16Updated 4 years ago
- Semantic Search + Keyword Search + Hybrid Search + Filtering + Faceting on 300K HN Comments☆53Updated 8 months ago
- This repository contains an implementation of a US address parser built using spaCy NLP library.☆38Updated 2 years ago
- Code and Dataset for the Bhola et al. (2020) Retrieving Skills from Job Descriptions: A Language Model Based Extreme Multi-label Classifi…☆54Updated 3 years ago
- Nesta's Skills Extractor Library☆141Updated 2 months ago
- Code and data for the paper 'The impact of founder personalities on startup success'☆16Updated last year
- A dataset for pretraining language models targeted for legal tasks.☆138Updated 3 years ago
- JSON-NLP Schema for transfer of NLP output using JSON☆54Updated 5 years ago
- multi-labeled dataset of resumes☆95Updated 4 years ago
- Collection of Datasets for Legal Text Processing☆118Updated last month
- 🎀 JavaScript API for spaCy with Python REST API☆196Updated last year
- A Sample repo using the Apriori and FP Growth algorithms to produce categories for queries, and BERT for PoP change visualization.☆39Updated 3 years ago
- 🏖TagEditor - Annotation tool for spaCy☆192Updated 2 years ago
- Word analysis, by domain, on the Common Crawl data set for the purpose of finding industry trends☆57Updated last year
- A word embedding and graph-based keyword extraction tool☆17Updated 3 months ago
- Semantic Segmentation of Legal texts that labels sentences with one of 7 rhetorical roles.☆77Updated last year
- Process Common Crawl data with Python and Spark☆440Updated this week
- This repository contains an easy and intuitive approach to few-shot classification using sentence-transformers or spaCy models, or zero-s…☆219Updated 7 months ago
- Semantic search through a vectorized Wikipedia (SentenceBERT) with the Weaviate vector search engine☆242Updated 2 years ago
- Pinecone text client library☆65Updated 3 weeks ago
- ✨ Bootstrap annotation with zero- & few-shot learning via OpenAI GPT-3☆323Updated 2 years ago
- LexNLP by LexPredict☆744Updated last year
- Article extraction benchmark: dataset and evaluation scripts☆321Updated last year