nilesc / Long-Structured-Debate-Generation-and-Evaluation
☆13Updated 2 years ago
Alternatives and similar repositories for Long-Structured-Debate-Generation-and-Evaluation:
Users that are interested in Long-Structured-Debate-Generation-and-Evaluation are comparing it to the libraries listed below
- ☆13Updated last year
- Code for NAACL 2022 paper "Reframing Human-AI Collaboration for Generating Free-Text Explanations"☆31Updated last year
- Code and pre-trained models for "ReasonBert: Pre-trained to Reason with Distant Supervision", EMNLP'2021☆29Updated 2 years ago
- ☆23Updated 5 months ago
- AIS is an evaluation framework for assessing whether the output of natural language models only contains information about the external w…☆31Updated 2 years ago
- Official codebase accompanying our ACL 2022 paper "RELiC: Retrieving Evidence for Literary Claims" (https://relic.cs.umass.edu).☆20Updated 2 years ago
- Official repository for our EACL 2023 paper "LongEval: Guidelines for Human Evaluation of Faithfulness in Long-form Summarization" (https…☆43Updated 6 months ago
- Code for the arXiv paper: "LLMs as Factual Reasoners: Insights from Existing Benchmarks and Beyond"☆59Updated 3 weeks ago
- Starbucks: Improved Training for 2D Matryoshka Embeddings☆17Updated 2 weeks ago
- ☆45Updated last year
- ☆45Updated 2 years ago
- This repository contains code and data for the EMNLP 2022 paper "CONDAQA: A Contrastive Reading Comprehension Dataset for Reasoning about…☆10Updated 2 years ago
- ☆58Updated 2 years ago
- SWIM-IR is a Synthetic Wikipedia-based Multilingual Information Retrieval training set with 28 million query-passage pairs spanning 33 la…☆46Updated last year
- 🌾 Universal, customizable and deployable fine-grained evaluation for text generation.☆22Updated last year
- Repository for Teaching Broad Reasoning Skills for Multi-Step QA by Generating Hard Contexts, EMNLP22☆18Updated last year
- Exploring Few-Shot Adaptation of Language Models with Tables☆23Updated 2 years ago
- Leaderboards are widely used in NLP and push the field forward. While leaderboards are a straightforward ranking of NLP models, this simp…☆17Updated 2 years ago
- Code for Stage-wise Fine-tuning for Graph-to-Text Generation☆26Updated 2 years ago
- This repository contains the dataset and code for "WiCE: Real-World Entailment for Claims in Wikipedia" in EMNLP 2023.☆40Updated last year
- ☆55Updated 2 years ago
- This repository contains some of the code used in the paper "Training Language Models with Langauge Feedback at Scale"☆27Updated last year
- ☆21Updated 3 years ago
- This repository contains the code for "How many data points is a prompt worth?"☆48Updated 3 years ago
- ☆18Updated last year
- Bayesian Assessment of Hypotheses☆24Updated last year
- Repository for the CODAH dataset☆22Updated 2 years ago
- Adaptation of TextWorld for materials synthesis procedures analysis using Text To Quest System☆9Updated last year
- [ICLR 2023] PyTorch code of Summarization Programs: Interpretable Abstractive Summarization with Neural Modular Trees☆23Updated last year
- [COLING 2022]: CommunityLM: Probing Partisan Worldviews from Language Models☆13Updated 2 years ago