jneidel / job-titlesLinks
Normalized dataset of 70k job titles
☆72Updated last year
Alternatives and similar repositories for job-titles
Users that are interested in job-titles are comparing it to the libraries listed below
Sorting:
- Open Source Thesaurus of Job Titles in US English☆140Updated 3 years ago
- Index Common Crawl archives in tabular format☆124Updated 2 weeks ago
- A (smart) rule based NLP module to extract job skills from text☆201Updated last year
- multi-labeled dataset of resumes☆102Updated 4 years ago
- Data Processing and Machine learning methods for the Open Skills Project☆173Updated last year
- ☆16Updated 7 years ago
- Nesta's Skills Extractor Library☆150Updated 7 months ago
- The definitive guide to using Vector Search to solve your semantic search production workload needs.☆270Updated 2 years ago
- Code and Dataset for the Bhola et al. (2020) Retrieving Skills from Job Descriptions: A Language Model Based Extreme Multi-label Classifi…☆56Updated 4 years ago
- find any kind of occupation or job title in a text or file☆83Updated last year
- Process Common Crawl data with Python and Spark☆452Updated last month
- new skills taxonomy using TextKernel data☆36Updated 3 years ago
- LexPredict Legal Dictionaries☆131Updated 3 years ago
- 🎀 JavaScript API for spaCy with Python REST API☆199Updated 2 years ago
- spaCy REST API, wrapped in a Docker container.☆16Updated 4 years ago
- Word analysis, by domain, on the Common Crawl data set for the purpose of finding industry trends☆58Updated last year
- Extracting addresses from text☆44Updated 7 years ago
- A simple Python script to crawl complete list of LinkedIn skills☆122Updated 7 years ago
- A word embedding and graph-based keyword extraction tool☆19Updated 2 months ago
- Semantic Search + Keyword Search + Hybrid Search + Filtering + Faceting on 300K HN Comments☆55Updated last year
- SKILLSPAN: Competences as Spans for Skill Extraction from Job Postings☆65Updated 10 months ago
- A spaCy wrapper for GliNER☆128Updated 11 months ago
- A Corpus of 475,000 Industrial Occupations☆70Updated 5 years ago
- JSON-NLP Schema for transfer of NLP output using JSON☆54Updated 5 years ago
- Matches a category of Google's Taxonomy to product that is described in any kind of text data☆63Updated 7 years ago
- Pinecone text client library☆66Updated 4 months ago
- Fast and robust date extraction from web pages, with Python or on the command-line☆142Updated 2 months ago
- A basic tool that extracts the structure from the PDF files of scientific articles.☆76Updated 4 years ago
- A spaCy wrapper of Entity-Fishing (component) for named entity disambiguation and linking on Wikidata☆169Updated 3 years ago
- News crawling with StormCrawler - stores content as WARC☆361Updated 10 months ago