maastrichtlawtech / bsardLinks
π A statutory article retrieval dataset in French. (ACL 2022)
β40Updated 2 years ago
Alternatives and similar repositories for bsard
Users that are interested in bsard are comparing it to the libraries listed below
Sorting:
- A multilingual version of MS MARCO passage ranking datasetβ145Updated 2 years ago
- Code accompanying the submission "Structural Text Segmentation of Legal Documents" by Aumiller et al.β98Updated 2 years ago
- GISTEmbed: Guided In-sample Selection of Training Negatives for Text Embeddingsβ44Updated last year
- Powerful unsupervised domain adaptation method for dense retrieval. Requires only unlabeled corpus and yields massive improvement: "GPL: β¦β338Updated 2 years ago
- Inquisitive Parrots for Searchβ199Updated 7 months ago
- β15Updated 3 weeks ago
- A Python Search Engine for Humans π₯Έβ243Updated 2 weeks ago
- Repo for Aspire - A scientific document similarity model based on matching fine-grained aspects of scientific papers.β54Updated 2 years ago
- This repository contains the code for the paper 'PARM: Paragraph Aggregation Retrieval Model for Dense Document-to-Document Retrieval' puβ¦β41Updated 4 years ago
- KeyPhraseTransformer lets you quickly extract key phrases, topics, themes from your text data with T5 transformer | Keyphrase extractionβ¦β106Updated last year
- LexGLUE: A Benchmark Dataset for Legal Language Understanding in Englishβ232Updated 5 months ago
- Mr. TyDi is a multi-lingual benchmark dataset built on TyDi, covering eleven typologically diverse languages.β79Updated 3 years ago
- β88Updated 9 months ago
- Bi-encoder entity linking architectureβ51Updated last year
- Using business-level retrieval system (BM25) with Python in just a few lines.β31Updated 2 years ago
- Repository for the paper "MultiNERD: A Multilingual, Multi-Genre and Fine-Grained Dataset for Named Entity Recognition (and Disambiguatioβ¦β45Updated last year
- The official code for PRIMERA: Pyramid-based Masked Sentence Pre-training for Multi-document Summarizationβ156Updated 3 years ago
- multimodal document analysisβ166Updated last month
- This repository contains the relevant materials for the tutorial "Legal IR and NLP: the History, Challenges, and State-of-the-Art", held β¦β41Updated 2 years ago
- A multi-purpose toolkit for table-to-text generation: web interface, Python bindings, CLI commands.β57Updated last year
- Easy modernBERT fine-tuning and multi-task learningβ63Updated 6 months ago
- β54Updated 2 years ago
- Search Engines with Autoregressive Language modelsβ295Updated 2 years ago
- Dataset from the paper "Mintaka: A Complex, Natural, and Multilingual Dataset for End-to-End Question Answering" (COLING 2022)β117Updated 3 years ago
- Repository for Zheng and Guha et al., 2021, "When Does Pretraining Help? Assessing Self-Supervised Learning for Law and the CaseHOLD Dataβ¦β94Updated 2 years ago
- β60Updated 3 years ago
- β80Updated last year
- Long Document Summarization Papersβ154Updated 2 years ago
- Neural information retrieval / Semantic search / Bi-encodersβ174Updated 2 years ago
- a gaggle of deep neural architectures for text ranking and question answering, designed for Pyseriniβ352Updated 2 years ago