jensjorisdecorte / Skill-Extraction-benchmarkLinks
Dataset used to evaluate Skill Extraction systems based on the ESCO skills taxonomy.
☆13Updated 11 months ago
Alternatives and similar repositories for Skill-Extraction-benchmark
Users that are interested in Skill-Extraction-benchmark are comparing it to the libraries listed below
Sorting:
- ☆47Updated 3 years ago
- ☆86Updated 2 months ago
- GISTEmbed: Guided In-sample Selection of Training Negatives for Text Embeddings☆43Updated last year
- ☆48Updated 4 months ago
- [Data + code] ExpertQA : Expert-Curated Questions and Attributed Answers☆129Updated last year
- ☆60Updated last year
- INCOME: An Easy Repository for Training and Evaluation of Index Compression Methods in Dense Retrieval. Includes BPR and JPQ.☆24Updated last year
- Dense hybrid representations for text retrieval☆63Updated 2 years ago
- Semantically Structured Sentence Embeddings☆66Updated 8 months ago
- ☆38Updated 6 months ago
- This repository contains the code for the paper 'PARM: Paragraph Aggregation Retrieval Model for Dense Document-to-Document Retrieval' pu…☆40Updated 3 years ago
- One-stop shop for running and fine-tuning transformer-based language models for retrieval☆56Updated last week
- This repository contains materials for the SIGIR 2022 tutorial on opinion summarization.☆34Updated 2 years ago
- Mr. TyDi is a multi-lingual benchmark dataset built on TyDi, covering eleven typologically diverse languages.☆76Updated 3 years ago
- SPRINT Toolkit helps you evaluate diverse neural sparse models easily using a single click on any IR dataset.☆45Updated last year
- ☆29Updated last year
- Vespa application making an index of the CORD-19 dataset.☆39Updated 5 months ago
- Code and Data for "Evaluating Correctness and Faithfulness of Instruction-Following Models for Question Answering"☆84Updated 10 months ago
- Neighborhood Contrastive Learning for Scientific Document Representations with Citation Embeddings (EMNLP 2022 paper)☆69Updated 2 years ago
- A Corpus of 475,000 Industrial Occupations☆67Updated 4 years ago
- ☆16Updated 6 months ago
- [EMNLP 2023 Demo] fabricator - annotating and generating datasets with large language models.☆109Updated last year
- Automatically detect errors in annotated corpora.☆47Updated last year
- This repository contains the relevant materials for the tutorial "Legal IR and NLP: the History, Challenges, and State-of-the-Art", held …☆41Updated 2 years ago
- A multilingual version of MS MARCO passage ranking dataset☆145Updated last year
- A Human-LLM Collaborative Dataset for Generative Information-seeking with Attribution☆33Updated last year
- No Parameter Left Behind: How Distillation and Model Size Affect Zero-Shot Retrieval☆29Updated 2 years ago
- A Python library aimed at dissecting and augmenting NER training data.☆58Updated 2 years ago
- Bi-encoder Based Entity Linking Tutorial. You can run experiment only in 5 minutes. Experiments on Co-lab pro GPU are also supported!☆34Updated 4 years ago
- ☆54Updated 2 years ago