maastrichtlawtech / bsardLinks
π A statutory article retrieval dataset in French. (ACL 2022)
β39Updated 2 years ago
Alternatives and similar repositories for bsard
Users that are interested in bsard are comparing it to the libraries listed below
Sorting:
- Mr. TyDi is a multi-lingual benchmark dataset built on TyDi, covering eleven typologically diverse languages.β79Updated 3 years ago
- A multilingual version of MS MARCO passage ranking datasetβ144Updated 2 years ago
- Powerful unsupervised domain adaptation method for dense retrieval. Requires only unlabeled corpus and yields massive improvement: "GPL: β¦β338Updated 2 years ago
- Repo for Aspire - A scientific document similarity model based on matching fine-grained aspects of scientific papers.β54Updated 2 years ago
- This repository contains the code for the paper 'PARM: Paragraph Aggregation Retrieval Model for Dense Document-to-Document Retrieval' puβ¦β41Updated 3 years ago
- GISTEmbed: Guided In-sample Selection of Training Negatives for Text Embeddingsβ44Updated last year
- multimodal document analysisβ166Updated last year
- Inquisitive Parrots for Searchβ198Updated 5 months ago
- LexGLUE: A Benchmark Dataset for Legal Language Understanding in Englishβ228Updated 3 months ago
- TimeLMs: Diachronic Language Models from Twitterβ111Updated last year
- The official code for PRIMERA: Pyramid-based Masked Sentence Pre-training for Multi-document Summarizationβ157Updated 3 years ago
- A multi-purpose toolkit for table-to-text generation: web interface, Python bindings, CLI commands.β56Updated last year
- Dataset from the paper "Mintaka: A Complex, Natural, and Multilingual Dataset for End-to-End Question Answering" (COLING 2022)β115Updated 3 years ago
- β45Updated 2 years ago
- Search Engines with Autoregressive Language modelsβ293Updated 2 years ago
- β39Updated 2 years ago
- Long Document Summarization Papersβ152Updated 2 years ago
- The autoregressive information extraction system GenIE (Generative Information Extraction) implemented in PyTorch.β104Updated 2 years ago
- β29Updated last year
- Code accompanying the submission "Structural Text Segmentation of Legal Documents" by Aumiller et al.β98Updated 2 years ago
- Dense hybrid representations for text retrievalβ63Updated 2 years ago
- a gaggle of deep neural architectures for text ranking and question answering, designed for Pyseriniβ351Updated last year
- KeyPhraseTransformer lets you quickly extract key phrases, topics, themes from your text data with T5 transformer | Keyphrase extractionβ¦β106Updated last year
- Repository for the paper "MultiNERD: A Multilingual, Multi-Genre and Fine-Grained Dataset for Named Entity Recognition (and Disambiguatioβ¦β45Updated last year
- β42Updated 4 years ago
- A simple recipe for training and inferencing Transformer architecture for Multi-Task Learning on custom datasets. You can find two approaβ¦β97Updated 3 years ago
- Efficient Attention for Long Sequence Processingβ97Updated last year
- This repository contains the relevant materials for the tutorial "Legal IR and NLP: the History, Challenges, and State-of-the-Art", held β¦β41Updated 2 years ago
- β54Updated 2 years ago
- Research code for "What to Pre-Train on? Efficient Intermediate Task Selection", EMNLP 2021β37Updated 3 years ago