jensjorisdecorte / Skill-Extraction-benchmarkLinks
Dataset used to evaluate Skill Extraction systems based on the ESCO skills taxonomy.
☆17Updated last year
Alternatives and similar repositories for Skill-Extraction-benchmark
Users that are interested in Skill-Extraction-benchmark are comparing it to the libraries listed below
Sorting:
- provides a common interface to many IR measure tools☆94Updated last month
- ☆64Updated last year
- [Data + code] ExpertQA : Expert-Curated Questions and Attributed Answers☆136Updated last year
- Inquisitive Parrots for Search☆199Updated 6 months ago
- ☆46Updated 3 years ago
- ☆52Updated 5 months ago
- [EMNLP 2023 Demo] fabricator - annotating and generating datasets with large language models.☆111Updated last year
- ☆88Updated 9 months ago
- ☆46Updated 2 years ago
- Retrieval-Augmented Generation battle!☆61Updated 5 months ago
- Unified Learned Sparse Retrieval Framework☆68Updated last year
- ☆80Updated last year
- A RAG that can scale 🧑🏻💻☆11Updated last year
- Neighborhood Contrastive Learning for Scientific Document Representations with Citation Embeddings (EMNLP 2022 paper)☆75Updated this week
- Powerful unsupervised domain adaptation method for dense retrieval. Requires only unlabeled corpus and yields massive improvement: "GPL: …☆337Updated 2 years ago
- ☆16Updated 2 months ago
- SPRINT Toolkit helps you evaluate diverse neural sparse models easily using a single click on any IR dataset.☆47Updated 2 years ago
- triple-encoders is a library for contextualizing distributed Sentence Transformers representations.☆15Updated last year
- Versatile framework designed to streamline the integration of your models, as well as those sourced from Hugging Face, into complex progr…☆34Updated 4 months ago
- LexGLUE: A Benchmark Dataset for Legal Language Understanding in English☆231Updated 5 months ago
- Repository for Zheng and Guha et al., 2021, "When Does Pretraining Help? Assessing Self-Supervised Learning for Law and the CaseHOLD Data…☆94Updated 2 years ago
- GISTEmbed: Guided In-sample Selection of Training Negatives for Text Embeddings☆44Updated last year
- A multilingual version of MS MARCO passage ranking dataset☆145Updated 2 years ago
- ☆19Updated 4 months ago
- Code and Data for "Evaluating Correctness and Faithfulness of Instruction-Following Models for Question Answering"☆87Updated last year
- Semantically Structured Sentence Embeddings☆69Updated last year
- A Fast, Adaptive, Stable, and Transferable Topic Model (NeurIPS 2024)☆137Updated 5 months ago
- ☆37Updated last month
- A Corpus of 475,000 Industrial Occupations☆70Updated 5 years ago
- A collection of datasets for language model pretraining including scripts for downloading, preprocesssing, and sampling.☆64Updated last year