☆35Nov 17, 2021Updated 4 years ago
Alternatives and similar repositories for lm-calibration
Users that are interested in lm-calibration are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code and datasets for the EMNLP 2020 paper "Calibration of Pre-trained Transformers"☆60Jun 12, 2023Updated 2 years ago
- ☆24Jun 12, 2023Updated 2 years ago
- Code for EMNLP 2022 Paper: On the Calibration of Massively Multilingual Language Models☆15Jun 12, 2023Updated 2 years ago
- ☆28Feb 11, 2026Updated 2 months ago
- Open-WikiTable :Dataset for Open Domain Question Answering with Complex Reasoning over Table☆28Jun 2, 2023Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆22Jan 5, 2024Updated 2 years ago
- Implementation of the paper 'Sentence Bottleneck Autoencoders from Transformer Language Models'☆17Mar 14, 2022Updated 4 years ago
- MetricEval: A framework that conceptualizes and operationalizes four main components of metric evaluation, in terms of reliability and va…☆12Nov 6, 2023Updated 2 years ago
- [SUKI'22] Table Retrieval May Not Necessitate Table-Specific Model Design☆23Sep 23, 2022Updated 3 years ago
- Code of Journey to the Center of the Knowledge Neurons: Discoveries of Language-Independent Knowledge Neurons and Degenerate Knowledge Ne…☆28Mar 19, 2024Updated 2 years ago
- ☆18Jun 3, 2024Updated last year
- ☆10Jul 23, 2021Updated 4 years ago
- ☆15Nov 17, 2020Updated 5 years ago
- ☆42Feb 2, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆23Aug 10, 2022Updated 3 years ago
- Exploring limitations of LLM-as-a-judge☆20Aug 17, 2024Updated last year
- Code and resources for evaluating cross-lingual embedding spaces☆29Apr 7, 2020Updated 6 years ago
- Cross-modal Coherence Modeling for Caption Generation☆11Jul 24, 2020Updated 5 years ago
- Official Github repo for the paper "Evaluating the Evaluation of Diversity in Natural Language Generation"☆21Feb 23, 2021Updated 5 years ago
- Source codes for "Preference-grounded Token-level Guidance for Language Model Fine-tuning" (NeurIPS 2023).☆17Jan 8, 2025Updated last year
- ☆20Jun 7, 2020Updated 5 years ago
- ☆39Jul 24, 2025Updated 9 months ago
- ☆25Oct 22, 2022Updated 3 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- A repository for the EMNLP 2021 paper "Is Information Density Uniform in Task-Oriented Dialogues?" and for the CoNLL 2021 paper "Analysin…☆10Jun 17, 2024Updated last year
- Code for "DocLens: Multi-aspect Fine-grained Evaluation for Medical Text Generation" (ACL 2024)☆22May 18, 2024Updated last year
- The project page for "SCITAB: A Challenging Benchmark for Compositional Reasoning and Claim Verification on Scientific Tables"☆23Dec 21, 2023Updated 2 years ago
- [NeurIPS 2023 D&B Track] Code and data for paper "Revisiting Out-of-distribution Robustness in NLP: Benchmarks, Analysis, and LLMs Evalua…☆37Jun 8, 2023Updated 2 years ago
- This is the official repo for Towards Uncertainty-Aware Language Agent.☆30Aug 15, 2024Updated last year
- PACIFIC: Towards Proactive Conversational Question Answering over Tabular and Textual Data in Finance☆14May 15, 2024Updated last year
- Official implementation of NeurIPS'24 Spotlight paper "Monte Carlo Tree Search based Space Transfer for Black-box Optimization".☆13Nov 28, 2024Updated last year
- Implementation of AdaCQR(COLING 2025)☆15Dec 30, 2024Updated last year
- Awesome LLM for NLG Evaluation Papers☆26Jan 23, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆23Feb 26, 2024Updated 2 years ago
- In-BoXBART: Get Instructions into Biomedical Multi-task Learning☆15Aug 23, 2022Updated 3 years ago
- EMNLP'2022: BERTScore is Unfair: On Social Bias in Language Model-Based Metrics for Text Generation☆41Oct 19, 2022Updated 3 years ago
- ☆19Jun 4, 2020Updated 5 years ago
- "A Discrete Variational Recurrent Topic Model without the Reparametrization Trick" (NeurIPS 2020)☆11Apr 26, 2021Updated 5 years ago
- Repository contains demo code for MTAnchor, an interactive, multilingual topic modeling system. The code accompanies the paper Multiling…☆12Jan 25, 2019Updated 7 years ago
- Embedding-based evaluation metrics for dialogue generation.