junhua / IPOD
A Corpus of 475,000 Industrial Occupations
☆66Updated 4 years ago
Alternatives and similar repositories for IPOD:
Users that are interested in IPOD are comparing it to the libraries listed below
- A monolingual and cross-lingual meta-embedding generation and evaluation framework☆80Updated 2 years ago
- Creating class-based TF-IDF matrices☆82Updated 2 years ago
- Explainable Zero-Shot Topic Extraction☆62Updated 7 months ago
- Entity Disambiguation as text extraction (ACL 2022)☆181Updated 2 years ago
- Recon NER, Debug and correct annotated Named Entity Recognition (NER) data for inconsistencies and get insights on improving the quality …☆106Updated last year
- The dataset used to evaluate JobBERT on the task of job title normalization.☆26Updated 2 years ago
- Code and Dataset for the Bhola et al. (2020) Retrieving Skills from Job Descriptions: A Language Model Based Extreme Multi-label Classifi…☆53Updated 3 years ago
- Dataset used to evaluate Skill Extraction systems based on the ESCO skills taxonomy.☆13Updated 8 months ago
- Fine-tuning a Hugging Face BERT model for the United Nations Named Entity Recognition task.☆31Updated 3 years ago
- Nesta's Skills Extractor Library☆129Updated 4 months ago
- SKILLSPAN: Competences as Spans for Skill Extraction from Job Postings☆60Updated last month
- HDBSCAN Tuning for BERTopic Models☆45Updated last year
- ☆18Updated 2 years ago
- Implementation, trained models and result data for the paper "Aspect-based Document Similarity for Research Papers" #COLING2020☆62Updated 10 months ago
- A Python library aimed at dissecting and augmenting NER training data.☆58Updated last year
- KeypartX is a graph-based approach to represent perception (text in general) by key parts of speech.Updated last year
- [NAACL 2021] This is the code for our paper `Fine-Tuning Pre-trained Language Model with Weak Supervision: A Contrastive-Regularized Self…☆201Updated 2 years ago
- A spaCy wrapper for DBpedia Spotlight☆109Updated last year
- 💫 SpaCy wrapper for ConceptNet 💫☆90Updated last year
- Collection of NLP model explanations and accompanying analysis tools☆145Updated last year
- ☆61Updated 4 years ago
- Augmenty is an augmentation library based on spaCy for augmenting texts.☆151Updated 9 months ago
- ☆54Updated 3 years ago
- Code and data form the paper BERT Got a Date: Introducing Transformers to Temporal Tagging☆66Updated 2 years ago
- The NewSHead dataset is a multi-doc headline dataset used in NHNet for training a headline summarization model.☆37Updated 3 years ago
- Hashformers is a framework for hashtag segmentation with Transformers and Large Language Models (LLMs).☆70Updated 7 months ago
- Information extraction pipeline containing coreference resolution, named entity linking, and relationship extraction☆81Updated 4 years ago
- Asent is a python library for performing efficient and transparent sentiment analysis using spaCy.☆118Updated 11 months ago
- Semantically Structured Sentence Embeddings☆65Updated 5 months ago
- Robust and fast topic models with sentence-transformers.☆46Updated this week