coastalcph / lexlms
LeXFiles and LegalLAMA: Facilitating English Multinational Legal Language Model Development
☆17Updated last year
Related projects ⓘ
Alternatives and complementary repositories for lexlms
- Zero-shot evaluation on LEXGLUE tasks with GTP3.5☆27Updated last year
- This repository provides scripts for evaluating NLP models on the LEXTREME benchmark, a set of diverse multilingual tasks in legal NLP☆20Updated 10 months ago
- Tool to apply Legal Matter Specification Standard (LMSS) to documents☆11Updated 2 months ago
- Code for "Incorporating Relevance Feedback for Information-Seeking Retrieval using Few-Shot Document Re-Ranking" (https://arxiv.org/abs/2…☆12Updated last year
- Legal document similarity - Code, data, and models for the ICAIL 2021 paper "Evaluating Document Representations for Content-based Legal …☆31Updated 3 years ago
- This repository serves as a collection of scrapers procuring and structuring various legal datasets☆16Updated last year
- Starbucks: Improved Training for 2D Matryoshka Embeddings☆17Updated 3 weeks ago
- SMASHED is a toolkit designed to apply transformations to samples in datasets, such as fields extraction, tokenization, prompting, batchi…☆31Updated 5 months ago
- Plug-and-play Search Interfaces with Pyserini and Hugging Face☆32Updated last year
- StAtutory Reasoning Assessment☆11Updated last year
- Lightweight Non-Parametric Embedding Fine-Tuning☆17Updated last month
- ☆25Updated 5 months ago
- SWIM-IR is a Synthetic Wikipedia-based Multilingual Information Retrieval training set with 28 million query-passage pairs spanning 33 la…☆44Updated 11 months ago
- Using short models to classify long texts☆20Updated last year
- Source codes for the paper "Bounding the Capabilities of Large Language Models in Open Text Generation with Prompt Constraints"☆27Updated last year
- Code and Dataset for Learning to Solve Complex Tasks by Talking to Agents☆22Updated 2 years ago
- Versatile framework designed to streamline the integration of your models, as well as those sourced from Hugging Face, into complex progr…☆23Updated 2 months ago
- ☆20Updated this week
- Resources related to EACL 2023 paper "SwitchPrompt: Learning Domain-Specific Gated Soft Prompts for Classification in Low-Resource Domain…☆52Updated last year
- Summary Explorer is a tool to visually explore the state-of-the-art in text summarization.☆43Updated 5 months ago
- Documentation effort for the BookCorpus dataset☆32Updated 3 years ago
- The corresponding code for our paper: "Exploring the Challenges of Open Domain Multi-Document Summarization". Do not hesitate to open an …☆31Updated last year
- ZS4IE: A Toolkit for Zero-Shot Information Extraction with Simple Verbalizations☆26Updated 2 years ago
- This repository contains the relevant materials for the tutorial "Legal IR and NLP: the History, Challenges, and State-of-the-Art", held …☆37Updated last year
- ☆19Updated last year
- No Parameter Left Behind: How Distillation and Model Size Affect Zero-Shot Retrieval☆27Updated 2 years ago
- Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)☆68Updated 3 weeks ago
- A dataset for pretraining language models targeted for legal tasks.☆119Updated 2 years ago
- Large-scale query-focused multi-document Summarization dataset☆10Updated 3 years ago