jzbjyb / lm-calibrationView external linksLinks
☆35Nov 17, 2021Updated 4 years ago
Alternatives and similar repositories for lm-calibration
Users that are interested in lm-calibration are comparing it to the libraries listed below
Sorting:
- MetricEval: A framework that conceptualizes and operationalizes four main components of metric evaluation, in terms of reliability and va…☆12Nov 6, 2023Updated 2 years ago
- Open-WikiTable :Dataset for Open Domain Question Answering with Complex Reasoning over Table☆27Jun 2, 2023Updated 2 years ago
- Code for EMNLP 2022 Paper: On the Calibration of Massively Multilingual Language Models☆15Jun 12, 2023Updated 2 years ago
- Code and datasets for the EMNLP 2020 paper "Calibration of Pre-trained Transformers"☆61Jun 12, 2023Updated 2 years ago
- Code for "End-to-End Learning of Flowchart Grounded Task-Oriented Dialogs"☆14Oct 10, 2022Updated 3 years ago
- ☆18Jun 3, 2024Updated last year
- ☆21Jan 5, 2024Updated 2 years ago
- ☆17Dec 21, 2023Updated 2 years ago
- Implementation of the paper 'Sentence Bottleneck Autoencoders from Transformer Language Models'☆17Mar 14, 2022Updated 3 years ago
- Exploring limitations of LLM-as-a-judge☆20Aug 17, 2024Updated last year
- A heterogeneous entity-augmented academic language model based on Open Academic Graph (OAG)☆83Oct 31, 2024Updated last year
- EMNLP'2022: BERTScore is Unfair: On Social Bias in Language Model-Based Metrics for Text Generation☆41Oct 19, 2022Updated 3 years ago
- The project page for "SCITAB: A Challenging Benchmark for Compositional Reasoning and Claim Verification on Scientific Tables"☆23Dec 21, 2023Updated 2 years ago
- Code for "DocLens: Multi-aspect Fine-grained Evaluation for Medical Text Generation" (ACL 2024)☆21May 18, 2024Updated last year
- Code and data for: Low Resource Grammatical Error Correction Using Wikipedia Edits (WNUT 2018)☆17Jul 16, 2024Updated last year
- Official Github repo for the paper "Evaluating the Evaluation of Diversity in Natural Language Generation"☆20Feb 23, 2021Updated 4 years ago
- ☆28Updated this week
- ☆22Feb 26, 2024Updated last year
- [NeurIPS 2022] Non-Linguistic Supervision for Contrastive Learning of Sentence Embeddings☆22Jan 30, 2023Updated 3 years ago
- Momentum Decoding: Open-ended Text Generation as Graph Exploration☆19Jan 27, 2023Updated 3 years ago
- ☆19Jun 4, 2020Updated 5 years ago
- [SUKI'22] Table Retrieval May Not Necessitate Table-Specific Model Design☆22Sep 23, 2022Updated 3 years ago
- ☆24Jun 12, 2023Updated 2 years ago
- AASC: ACL Anthology Sentence Corpus☆20Oct 28, 2020Updated 5 years ago
- The official code repository for MetricMT - a reward optimization method for NMT with learned metrics☆25Apr 24, 2021Updated 4 years ago
- Awesome LLM for NLG Evaluation Papers☆25Jan 23, 2024Updated 2 years ago
- ☆30May 20, 2022Updated 3 years ago
- ☆25Oct 22, 2022Updated 3 years ago
- A2T: Towards Improving Adversarial Training of NLP Models (EMNLP 2021 Findings)☆26Sep 12, 2021Updated 4 years ago
- Code and resources for evaluating cross-lingual embedding spaces☆29Apr 7, 2020Updated 5 years ago
- KETOD Knowledge-Enriched Task-Oriented Dialogue☆32Jan 4, 2023Updated 3 years ago
- NILE : Natural Language Inference with Faithful Natural Language Explanations☆30Jun 12, 2023Updated 2 years ago
- ☆36Jan 26, 2025Updated last year
- DiscoScore: Evaluating Text Generation with BERT and Discourse Coherence☆36Jul 25, 2023Updated 2 years ago
- The Stanford Word Substitution (Swords) Benchmark☆32Mar 24, 2022Updated 3 years ago
- Code for "Variational Template Machine for Data-to-text generation"☆31Jul 17, 2020Updated 5 years ago
- [ICLR 2021] Contrastive Learning with Adversarial Perturbations for Conditional Text Generation☆86Oct 11, 2022Updated 3 years ago
- [EMNLP'24] MedAdapter: Efficient Test-Time Adaptation of Large Language Models Towards Medical Reasoning☆36Dec 26, 2024Updated last year
- COMS30017 Computational Neuroscience☆11Jan 7, 2022Updated 4 years ago