SALT-NLP / CoAnnotating
This is the official repository for "CoAnnotating: Uncertainty-Guided Work Allocation between Human and Large Language Models for Data Annotation"
☆14Updated 10 months ago
Related projects: ⓘ
- ☆33Updated 3 weeks ago
- ☆20Updated 10 months ago
- Repo for Aspire - A scientific document similarity model based on matching fine-grained aspects of scientific papers.☆50Updated last year
- Official repository for our EACL 2023 paper "LongEval: Guidelines for Human Evaluation of Faithfulness in Long-form Summarization" (https…☆41Updated last month
- Code and data for the FACTOR paper☆36Updated 10 months ago
- Data and code for the paper "CiteWorth: Cite-Worthiness Detection for Improved Scientific Document Understanding"☆13Updated 2 years ago
- CausalGym: Benchmarking causal interpretability methods on linguistic tasks☆28Updated 6 months ago
- Data and code for the SciFact-Open task☆24Updated 9 months ago
- [NeurIPS 2022] Generating Training Data with Language Models: Towards Zero-Shot Language Understanding☆63Updated 2 years ago
- FollowIR: Evaluating and Teaching Information Retrieval Models to Follow Instructions☆37Updated 2 months ago
- ☆55Updated last year
- Code and model checkpoints for the MultiVerS model for scientific claim verification.☆44Updated last year
- The data and the PyTorch implementation for the models and experiments in the paper "Exploiting Asymmetry for Synthetic Training Data Gen…☆56Updated last year
- This repository contains the dataset and code for "WiCE: Real-World Entailment for Claims in Wikipedia" in EMNLP 2023.☆38Updated 9 months ago
- Code for "Goodtriever: Toxicity Mitigation with Retrieval-augmented Language Models"☆22Updated 3 months ago
- ☆27Updated 9 months ago
- Dataset, metrics, and models for TACL 2023 paper MACSUM: Controllable Summarization with Mixed Attributes.☆33Updated last year
- This repository accompanies our paper “Do Prompt-Based Models Really Understand the Meaning of Their Prompts?”☆83Updated 2 years ago
- A toolkit for asynchronously validating dense retriever checkpoints during training.☆27Updated last year
- A Human-LLM Collaborative Dataset for Generative Information-seeking with Attribution☆30Updated last year
- Retrieval as Attention☆77Updated last year
- Code and Data for NeurIPS2021 Paper "A Dataset for Answering Time-Sensitive Questions"☆61Updated 2 years ago
- ☆32Updated last year
- The corresponding code for our paper: "Exploring the Challenges of Open Domain Multi-Document Summarization". Do not hesitate to open an …☆29Updated last year
- Dataset associated with "BOLD: Dataset and Metrics for Measuring Biases in Open-Ended Language Generation" paper☆63Updated 3 years ago
- The LM Contamination Index is a manually created database of contamination evidences for LMs.☆73Updated 5 months ago
- ☆31Updated last year
- ☆50Updated 2 years ago
- Codebase of ACL2024 paper "Spiral of Silence: How is Large Language Model Killing Information Retrieval?—A Case Study on Open Domain Ques…☆13Updated 3 months ago
- Findings of ACL'2023: Optimizing Test-Time Query Representations for Dense Retrieval☆29Updated 10 months ago