maastrichtlawtech / bsardLinks
π A statutory article retrieval dataset in French. (ACL 2022)
β40Updated last year
Alternatives and similar repositories for bsard
Users that are interested in bsard are comparing it to the libraries listed below
Sorting:
- Mr. TyDi is a multi-lingual benchmark dataset built on TyDi, covering eleven typologically diverse languages.β79Updated 3 years ago
- A multilingual version of MS MARCO passage ranking datasetβ144Updated last year
- Powerful unsupervised domain adaptation method for dense retrieval. Requires only unlabeled corpus and yields massive improvement: "GPL: β¦β337Updated 2 years ago
- GISTEmbed: Guided In-sample Selection of Training Negatives for Text Embeddingsβ43Updated last year
- A multi-purpose toolkit for table-to-text generation: web interface, Python bindings, CLI commands.β56Updated last year
- Inquisitive Parrots for Searchβ195Updated 2 months ago
- KeyPhraseTransformer lets you quickly extract key phrases, topics, themes from your text data with T5 transformer | Keyphrase extractionβ¦β104Updated last year
- Dataset from the paper "Mintaka: A Complex, Natural, and Multilingual Dataset for End-to-End Question Answering" (COLING 2022)β114Updated 2 years ago
- β54Updated 2 years ago
- TimeLMs: Diachronic Language Models from Twitterβ110Updated last year
- Ensembling Hugging Face transformers made easyβ63Updated 2 years ago
- This repository contains the code for the paper 'PARM: Paragraph Aggregation Retrieval Model for Dense Document-to-Document Retrieval' puβ¦β41Updated 3 years ago
- Bi-encoder entity linking architectureβ49Updated 11 months ago
- Code accompanying the submission "Structural Text Segmentation of Legal Documents" by Aumiller et al.β97Updated 2 years ago
- Dense hybrid representations for text retrievalβ63Updated 2 years ago
- LexGLUE: A Benchmark Dataset for Legal Language Understanding in Englishβ216Updated last month
- β45Updated 2 years ago
- Using business-level retrieval system (BM25) with Python in just a few lines.β31Updated 2 years ago
- A Framework for Textual Entailment based Zero Shot text classificationβ152Updated last year
- Search Engines with Autoregressive Language modelsβ291Updated 2 years ago
- The official code for PRIMERA: Pyramid-based Masked Sentence Pre-training for Multi-document Summarizationβ157Updated 2 years ago
- Code and model release for the paper "Task-aware Retrieval with Instructions" by Asai et al.β163Updated last year
- The autoregressive information extraction system GenIE (Generative Information Extraction) implemented in PyTorch.β104Updated 2 years ago
- Easy modernBERT fine-tuning and multi-task learningβ61Updated last month
- a gaggle of deep neural architectures for text ranking and question answering, designed for Pyseriniβ350Updated last year
- No Parameter Left Behind: How Distillation and Model Size Affect Zero-Shot Retrievalβ29Updated 2 years ago
- PropSegmEnt is an annotated dataset for segmenting English text into propositions, and recognizing proposition-level entailment relationsβ¦β19Updated 2 years ago
- Efficient Attention for Long Sequence Processingβ98Updated last year
- This repository contains materials for the SIGIR 2022 tutorial on opinion summarization.β34Updated 3 years ago
- The model implementations for T5 encoder decoder soft prompt tuning for text generation.β24Updated 2 years ago