JoelNiklaus / LEXTREMELinks
This repository provides scripts for evaluating NLP models on the LEXTREME benchmark, a set of diverse multilingual tasks in legal NLP
β23Updated 2 years ago
Alternatives and similar repositories for LEXTREME
Users that are interested in LEXTREME are comparing it to the libraries listed below
Sorting:
- πΈοΈ A graph-augmented dense statute retriever. (EACL 2023)β24Updated 2 years ago
- Dataset from the paper "Mintaka: A Complex, Natural, and Multilingual Dataset for End-to-End Question Answering" (COLING 2022)β117Updated 3 years ago
- β88Updated 8 months ago
- StAtutory Reasoning Assessmentβ15Updated 3 years ago
- Mining Legal Arguments in Court Decisions - Data and softwareβ73Updated 2 years ago
- multimodal document analysisβ166Updated last month
- [Data + code] ExpertQA : Expert-Curated Questions and Attributed Answersβ136Updated last year
- This repository contains the code for the paper 'PARM: Paragraph Aggregation Retrieval Model for Dense Document-to-Document Retrieval' puβ¦β41Updated 3 years ago
- Repository for "Attribute First, then Generate: Locally-attributable Grounded Text Generation", ACL 2024β29Updated last year
- Code and Data for "Evaluating Correctness and Faithfulness of Instruction-Following Models for Question Answering"β87Updated last year
- We believe the ability of an LLM to attribute the text that it generates is likely to be crucial for both system developers and users in β¦β54Updated 2 years ago
- β37Updated last month
- β54Updated 2 years ago
- β101Updated 3 years ago
- The autoregressive information extraction system GenIE (Generative Information Extraction) implemented in PyTorch.β104Updated 2 years ago
- Code for Relevance-guided Supervision for OpenQA with ColBERT (TACL'21)β40Updated 4 years ago
- Zero-shot evaluation on LEXGLUE tasks with GTP3.5β29Updated 2 years ago
- Pretraining Efficiently on S2ORC!β178Updated last year
- Neighborhood Contrastive Learning for Scientific Document Representations with Citation Embeddings (EMNLP 2022 paper)β74Updated 3 years ago
- No Parameter Left Behind: How Distillation and Model Size Affect Zero-Shot Retrievalβ29Updated 3 years ago
- The codebase for our ACL2023 paper: Did You Read the Instructions? Rethinking the Effectiveness of Task Definitions in Instruction Learniβ¦β30Updated 2 years ago
- Search Engines with Autoregressive Language modelsβ295Updated 2 years ago
- A Python Commonsense Knowledge Inference Toolkitβ64Updated 2 years ago
- SWIM-IR is a Synthetic Wikipedia-based Multilingual Information Retrieval training set with 28 million query-passage pairs spanning 33 laβ¦β49Updated 2 years ago
- This is the code for our KILT leaderboard submissions (KGI + Re2G models).β157Updated 3 months ago
- Repository for Zheng and Guha et al., 2021, "When Does Pretraining Help? Assessing Self-Supervised Learning for Law and the CaseHOLD Dataβ¦β93Updated 2 years ago
- Code and model release for the paper "Task-aware Retrieval with Instructions" by Asai et al.β165Updated 2 years ago
- Mr. TyDi is a multi-lingual benchmark dataset built on TyDi, covering eleven typologically diverse languages.β79Updated 3 years ago
- LexGLUE: A Benchmark Dataset for Legal Language Understanding in Englishβ231Updated 5 months ago
- Code for the paper SciCo: Hierarchical Cross-Document Coreference for Scientific Concepts (AKBC 2021). https://openreview.net/forum?id=OFβ¦β29Updated 4 years ago