junhua / IPOD
A Corpus of 475,000 Industrial Occupations
☆63Updated 4 years ago
Alternatives and similar repositories for IPOD:
Users that are interested in IPOD are comparing it to the libraries listed below
- ☆54Updated 3 years ago
- The dataset used to evaluate JobBERT on the task of job title normalization.☆23Updated 2 years ago
- Creating class-based TF-IDF matrices☆82Updated 2 years ago
- Collection of NLP model explanations and accompanying analysis tools☆145Updated last year
- SKILLSPAN: Competences as Spans for Skill Extraction from Job Postings☆56Updated 11 months ago
- Find text features that are most related to an outcome, controlling for confounds.☆60Updated 5 months ago
- Sentence transformers models for SpaCy☆107Updated last year
- A monolingual and cross-lingual meta-embedding generation and evaluation framework☆80Updated 2 years ago
- CausalNLP is a practical toolkit for causal inference with text as treatment, outcome, or "controlled-for" variable.☆144Updated 7 months ago
- [NAACL 2021] This is the code for our paper `Fine-Tuning Pre-trained Language Model with Weak Supervision: A Contrastive-Regularized Self…☆201Updated 2 years ago
- [WWW 2020] Discriminative Topic Mining via Category-Name Guided Text Embedding☆50Updated 4 years ago
- [DEPRECATED] Adapt Transformer-based language models to new text domains☆86Updated 10 months ago
- Nesta's Skills Extractor Library☆124Updated 2 months ago
- Recon NER, Debug and correct annotated Named Entity Recognition (NER) data for inconsistencies and get insights on improving the quality …☆106Updated 10 months ago
- Information extraction pipeline containing coreference resolution, named entity linking, and relationship extraction☆80Updated 3 years ago
- Package that returns a company embedding given a company name☆42Updated 4 years ago
- Training Temporal Word Embeddings with a Compass☆64Updated 2 years ago
- Repository for the paper "Named Entity Recognition for Entity Linking: What Works and What's Next" (EMNLP 2021).☆75Updated 2 years ago
- Implementation, trained models and result data for the paper "Aspect-based Document Similarity for Research Papers" #COLING2020☆62Updated 8 months ago
- Use BERT to Fill in the Blanks☆82Updated 3 years ago
- Dataset used to evaluate Skill Extraction systems based on the ESCO skills taxonomy.☆13Updated 6 months ago
- ☆70Updated 10 months ago
- Google USE (Universal Sentence Encoder) for spaCy☆181Updated last year
- Examples for aligning, padding and batching sequence labeling data (NER) for use with pre-trained transformer models☆65Updated 2 years ago
- Code and data form the paper BERT Got a Date: Introducing Transformers to Temporal Tagging☆65Updated 2 years ago
- A spaCy wrapper for DBpedia Spotlight☆107Updated last year
- [LREC 2022] An off-the-shelf pre-trained Tweet NLP Toolkit (NER, tokenization, lemmatization, POS tagging, dependency parsing) + Tweeban…☆103Updated 11 months ago
- Self-Supervision for Named Entity Disambiguation at the Tail☆214Updated 2 years ago
- Entity Disambiguation as text extraction (ACL 2022)☆178Updated 2 years ago