maastrichtlawtech / bsardLinks
π A statutory article retrieval dataset in French. (ACL 2022)
β40Updated last year
Alternatives and similar repositories for bsard
Users that are interested in bsard are comparing it to the libraries listed below
Sorting:
- GISTEmbed: Guided In-sample Selection of Training Negatives for Text Embeddingsβ43Updated last year
- A multilingual version of MS MARCO passage ranking datasetβ144Updated last year
- Mr. TyDi is a multi-lingual benchmark dataset built on TyDi, covering eleven typologically diverse languages.β77Updated 3 years ago
- Inquisitive Parrots for Searchβ193Updated last month
- This repository contains the code for the paper 'PARM: Paragraph Aggregation Retrieval Model for Dense Document-to-Document Retrieval' puβ¦β40Updated 3 years ago
- KeyPhraseTransformer lets you quickly extract key phrases, topics, themes from your text data with T5 transformer | Keyphrase extractionβ¦β104Updated last year
- β42Updated 2 years ago
- A multi-purpose toolkit for table-to-text generation: web interface, Python bindings, CLI commands.β55Updated last year
- Dataset from the paper "Mintaka: A Complex, Natural, and Multilingual Dataset for End-to-End Question Answering" (COLING 2022)β114Updated 2 years ago
- TimeLMs: Diachronic Language Models from Twitterβ108Updated last year
- β54Updated 2 years ago
- Ensembling Hugging Face transformers made easyβ63Updated 2 years ago
- Long Document Summarization Papersβ148Updated last year
- Using business-level retrieval system (BM25) with Python in just a few lines.β31Updated 2 years ago
- β37Updated 7 months ago
- Repo for Aspire - A scientific document similarity model based on matching fine-grained aspects of scientific papers.β54Updated last year
- Powerful unsupervised domain adaptation method for dense retrieval. Requires only unlabeled corpus and yields massive improvement: "GPL: β¦β336Updated 2 years ago
- β86Updated 3 months ago
- Code to reproduce NeuralMind's submissions to COLIEE 2021 and COLIEE 2022β24Updated 3 years ago
- Dense hybrid representations for text retrievalβ63Updated 2 years ago
- The autoregressive information extraction system GenIE (Generative Information Extraction) implemented in PyTorch.β104Updated 2 years ago
- Repository for Zheng and Guha et al., 2021, "When Does Pretraining Help? Assessing Self-Supervised Learning for Law and the CaseHOLD Dataβ¦β90Updated 2 years ago
- multimodal document analysisβ166Updated last year
- This is the code for our KILT leaderboard submissions (KGI + Re2G models).β156Updated 2 months ago
- Improving Efficient Neural Ranking Models with Cross-Architecture Knowledge Distillationβ111Updated 4 years ago
- The official code for PRIMERA: Pyramid-based Masked Sentence Pre-training for Multi-document Summarizationβ156Updated 2 years ago
- This repository contains the relevant materials for the tutorial "Legal IR and NLP: the History, Challenges, and State-of-the-Art", held β¦β41Updated 2 years ago
- Main repository for "CharacterBERT: Reconciling ELMo and BERT for Word-Level Open-Vocabulary Representations From Characters"β202Updated last year
- Code for EMNLP 2021 paper: "Is Everything in Order? A Simple Way to Order Sentences"β42Updated last year
- LexGLUE: A Benchmark Dataset for Legal Language Understanding in Englishβ212Updated 2 years ago