esteng / calibration_metricLinks
☆10Updated last year
Alternatives and similar repositories for calibration_metric
Users that are interested in calibration_metric are comparing it to the libraries listed below
Sorting:
- ☆66Updated 2 years ago
- This repository accompanies our paper “Do Prompt-Based Models Really Understand the Meaning of Their Prompts?”☆85Updated 3 years ago
- Code to reproduce experiments in the paper "Constrained Language Models Yield Few-Shot Semantic Parsers" (EMNLP 2021).☆64Updated last year
- ☆75Updated 4 years ago
- Query-focused summarization data☆42Updated 2 years ago
- Rust library for indexing and quickly searching large pretraining corpora☆27Updated this week
- ☆101Updated 2 years ago
- The official code of EMNLP 2022, "SCROLLS: Standardized CompaRison Over Long Language Sequences".☆70Updated last year
- Automatic metrics for GEM tasks☆66Updated 2 years ago
- ☆99Updated last year
- PyTorch + HuggingFace code for RetoMaton: "Neuro-Symbolic Language Modeling with Automaton-augmented Retrieval" (ICML 2022), including an…☆276Updated 2 years ago
- ☆54Updated 2 years ago
- ☆58Updated 3 years ago
- ☆45Updated last year
- Implementation of the paper 'Sentence Bottleneck Autoencoders from Transformer Language Models'☆17Updated 3 years ago
- ☆48Updated last year
- This repository contains the dataset and code for "WiCE: Real-World Entailment for Claims in Wikipedia" in EMNLP 2023.☆41Updated last year
- Minimum Bayes Risk Decoding for Hugging Face Transformers☆58Updated last year
- Replication code for "With Little Power Comes Great Responsibility"☆39Updated 4 years ago
- M2D2: A Massively Multi-domain Language Modeling Dataset (EMNLP 2022) by Machel Reid, Victor Zhong, Suchin Gururangan, Luke Zettlemoyer☆54Updated 2 years ago
- Hercules: Attributable and Scalable Opinion Summarization (ACL 2023)☆21Updated last year
- ☆97Updated 3 years ago
- EMNLP 2022: Generating Natural Language Proofs with Verifier-Guided Search https://arxiv.org/abs/2205.12443☆86Updated 10 months ago
- ☆46Updated 3 years ago
- SacreROUGE is a library dedicated to the use and development of text generation evaluation metrics with an emphasis on summarization.☆144Updated 2 years ago
- Code and data accompanying the paper "TRUE: Re-evaluating Factual Consistency Evaluation".☆81Updated 3 weeks ago
- Benchmark API for Multidomain Language Modeling☆25Updated 2 years ago
- Hyperparameter Search for AllenNLP☆139Updated 5 months ago
- https://arxiv.org/abs/2404.10917☆14Updated 4 months ago
- Codebase describing experiments in Truncation Sampling as Language Model Desmoothing☆12Updated 2 years ago