neulab / cmulabLinks
CMU Linguistic Annotation Backend
☆14Updated 4 months ago
Alternatives and similar repositories for cmulab
Users that are interested in cmulab are comparing it to the libraries listed below
Sorting:
- Legal document similarity - Code, data, and models for the ICAIL 2021 paper "Evaluating Document Representations for Content-based Legal …☆32Updated 4 years ago
- Starbucks: Improved Training for 2D Matryoshka Embeddings☆22Updated 7 months ago
- Source code and data for Like a Good Nearest Neighbor☆30Updated last year
- ☆19Updated 4 months ago
- Label shift estimation for transfer difficulty with Familiarity.☆10Updated 11 months ago
- ☆26Updated 11 months ago
- ☆10Updated last year
- Code for SaGe subword tokenizer (EACL 2023)☆27Updated last year
- Multidocument Summarization for Literature Review Shared Task 2022☆30Updated 3 years ago
- A collection of datasets for language model pretraining including scripts for downloading, preprocesssing, and sampling.☆64Updated last year
- ☆59Updated last year
- The corresponding code for our paper: "Exploring the Challenges of Open Domain Multi-Document Summarization". Do not hesitate to open an …☆33Updated 2 years ago
- SMASHED is a toolkit designed to apply transformations to samples in datasets, such as fields extraction, tokenization, prompting, batchi…☆35Updated last year
- Semantically Structured Sentence Embeddings☆71Updated last year
- Plug-and-play Search Interfaces with Pyserini and Hugging Face☆32Updated 2 years ago
- Embedding Recycling for Language models☆38Updated 2 years ago
- Efficient few-shot learning with cross-encoders.☆61Updated last year
- Easy modernBERT fine-tuning and multi-task learning☆63Updated 6 months ago
- ☆23Updated last year
- Trully flash implementation of DeBERTa disentangled attention mechanism.☆74Updated this week
- ☆22Updated 3 years ago
- ☆37Updated 2 months ago
- Annotation meets Large Language Models (ChatGPT, GPT-3 and alike).☆58Updated 2 years ago
- Set-Encoder: Permutation-Invariant Inter-Passage Attention for Listwise Passage Re-Ranking with Cross-Encoders☆18Updated 8 months ago
- [EMNLP'23] Official Code for "FOCUS: Effective Embedding Initialization for Monolingual Specialization of Multilingual Models"☆36Updated 7 months ago
- My NER Experiments with ModernBERT and Ettin☆26Updated 6 months ago
- Official repository for our EACL 2023 paper "LongEval: Guidelines for Human Evaluation of Faithfulness in Long-form Summarization" (https…☆44Updated last year
- 🌾 Universal, customizable and deployable fine-grained evaluation for text generation.☆24Updated 2 years ago
- PropSegmEnt is an annotated dataset for segmenting English text into propositions, and recognizing proposition-level entailment relations…☆21Updated 3 years ago
- [EMNLP 2023 Demo] fabricator - annotating and generating datasets with large language models.☆111Updated last year