jneidel / job-titles
Normalized dataset of 70k job titles
☆64Updated 8 months ago
Related projects ⓘ
Alternatives and complementary repositories for job-titles
- Open Source Thesaurus of Job Titles in US English☆136Updated 2 years ago
- Index Common Crawl archives in tabular format☆106Updated this week
- ☆16Updated 6 years ago
- new skills taxonomy using TextKernel data☆30Updated 2 years ago
- SKILLSPAN: Competences as Spans for Skill Extraction from Job Postings☆56Updated 9 months ago
- A Corpus of 475,000 Industrial Occupations☆63Updated 4 years ago
- The dataset used to evaluate JobBERT on the task of job title normalization.☆22Updated 2 years ago
- Statistics of Common Crawl monthly archives mined from URL index files☆157Updated this week
- Nesta's Skills Extractor Library☆123Updated 3 weeks ago
- find any kind of occupation or job title in a text or file☆82Updated 10 months ago
- Get annotation suggestions for the INCEpTION text annotation platform from spaCy, Sentence BERT, scikit-learn and more. Runs as a web-ser…☆40Updated last month
- multi-labeled dataset of resumes☆71Updated 3 years ago
- Package that returns a company embedding given a company name☆42Updated 4 years ago
- https://duyet.github.io/related-skills-visualization/index.html☆11Updated 4 years ago
- Building a Job Dataset☆21Updated 2 years ago
- Code and Dataset for the Bhola et al. (2020) Retrieving Skills from Job Descriptions: A Language Model Based Extreme Multi-label Classifi…☆52Updated 3 years ago
- Dataset and pre-trained model for Skill2vec☆75Updated 4 months ago
- A dataset for pretraining language models targeted for legal tasks.☆122Updated 2 years ago
- A Named Entity Recognition system that extracts soft skills from text☆27Updated 3 months ago
- A python utility for downloading Common Crawl data☆225Updated last year
- LexPredict Legal Dictionaries☆111Updated 2 years ago
- A spaCy wrapper for DBpedia Spotlight☆105Updated last year
- A simple web application for searching Word2Vec embeddings derived from approximately 2,000 law reports published by the The Incorporated…☆25Updated 2 years ago
- Source code for the Medium article "Extracting the author of news stories with DOM-based segmentation and BERT"☆29Updated 4 years ago
- The Open Jobs Observatory public mirror repo☆20Updated last year
- ☆14Updated last year
- A natural language search microservice☆96Updated 3 years ago
- Asent is a python library for performing efficient and transparent sentiment analysis using spaCy.☆115Updated 7 months ago
- Code accompanying the submission "Structural Text Segmentation of Legal Documents" by Aumiller et al.☆96Updated last year
- Trying to generate name synonyms from wikidata☆33Updated 4 years ago