jensjorisdecorte / Skill-Extraction-benchmark
Dataset used to evaluate Skill Extraction systems based on the ESCO skills taxonomy.
☆13Updated 6 months ago
Alternatives and similar repositories for Skill-Extraction-benchmark:
Users that are interested in Skill-Extraction-benchmark are comparing it to the libraries listed below
- ☆45Updated 2 years ago
- ☆38Updated last month
- GISTEmbed: Guided In-sample Selection of Training Negatives for Text Embeddings☆37Updated 10 months ago
- ☆57Updated 9 months ago
- ☆55Updated 2 years ago
- ☆31Updated 6 months ago
- One-stop shop for running and fine-tuning transformer-based language models for retrieval☆43Updated this week
- ☆31Updated this week
- A collection of datasets for language model pretraining including scripts for downloading, preprocesssing, and sampling.☆56Updated 5 months ago
- This repository contains the code for the paper 'PARM: Paragraph Aggregation Retrieval Model for Dense Document-to-Document Retrieval' pu…☆40Updated 3 years ago
- The corresponding code for our paper: "Exploring the Challenges of Open Domain Multi-Document Summarization". Do not hesitate to open an …☆31Updated last year
- [EMNLP 2023 Demo] fabricator - annotating and generating datasets with large language models.☆104Updated 8 months ago
- A Python library aimed at dissecting and augmenting NER training data.☆57Updated last year
- SPRINT Toolkit helps you evaluate diverse neural sparse models easily using a single click on any IR dataset.☆43Updated last year
- ☆67Updated 3 months ago
- ☆15Updated last month
- [Data + code] ExpertQA : Expert-Curated Questions and Attributed Answers☆124Updated 10 months ago
- Versatile framework designed to streamline the integration of your models, as well as those sourced from Hugging Face, into complex progr…☆26Updated 3 weeks ago
- The dataset used to evaluate JobBERT on the task of job title normalization.☆23Updated 2 years ago
- Dense hybrid representations for text retrieval☆61Updated last year
- provides a common interface to many IR measure tools☆80Updated last month
- ☆30Updated last year
- No Parameter Left Behind: How Distillation and Model Size Affect Zero-Shot Retrieval☆28Updated 2 years ago
- Neighborhood Contrastive Learning for Scientific Document Representations with Citation Embeddings (EMNLP 2022 paper)☆66Updated 2 years ago
- This repository contains the relevant materials for the tutorial "Legal IR and NLP: the History, Challenges, and State-of-the-Art", held …☆40Updated last year
- ☆36Updated 2 years ago
- Improving Efficient Neural Ranking Models with Cross-Architecture Knowledge Distillation☆109Updated 3 years ago
- Robust and fast topic models with sentence-transformers.☆42Updated last week
- SIGIR 2021: Efficiently Teaching an Effective Dense Retriever with Balanced Topic Aware Sampling☆58Updated 3 years ago
- Bi-encoder Based Entity Linking Tutorial. You can run experiment only in 5 minutes. Experiments on Co-lab pro GPU are also supported!☆34Updated 3 years ago