☆53Apr 9, 2025Updated last year
Alternatives and similar repositories for calibration-tuning
Users that are interested in calibration-tuning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [EMNLP 2023] Once Upon a *Time* in *Graph*: Relative-Time Pretraining for Complex Temporal Reasoning☆17Oct 31, 2023Updated 2 years ago
- ☆18May 28, 2024Updated last year
- Source codes for "Preference-grounded Token-level Guidance for Language Model Fine-tuning" (NeurIPS 2023).☆17Jan 8, 2025Updated last year
- 🚀 Sliding Window Attention Training for Efficient Large Language Models☆16Dec 8, 2025Updated 5 months ago
- This is the repo for constructing a comprehensive and rigorous evaluation framework for LLM calibration.☆13Apr 9, 2024Updated 2 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Official Implementation of ACL2023: Don't Parse, Choose Spans! Continuous and Discontinuous Constituency Parsing via Autoregressive Span …☆14Aug 25, 2023Updated 2 years ago
- Source code of "Calibrating Large Language Models Using Their Generations Only", ACL2024☆22Nov 20, 2024Updated last year
- ☆25Jun 10, 2025Updated 11 months ago
- ☆14Nov 4, 2022Updated 3 years ago
- ☆13Feb 7, 2023Updated 3 years ago
- Neuro-Symbolic Integration Brings Causal and Reliable Reasoning Proofs☆41Feb 15, 2024Updated 2 years ago
- code repo for ICLR 2024 paper "Can LLMs Express Their Uncertainty? An Empirical Evaluation of Confidence Elicitation in LLMs"☆148Mar 14, 2024Updated 2 years ago
- Code for the ACL 2022 paper "Continual Sequence Generation with Adaptive Compositional Modules"☆39Apr 4, 2022Updated 4 years ago
- ☆15Dec 2, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Code for paper "Unraveling Cross-Modality Knowledge Conflicts in Large Vision-Language Models."☆52Oct 19, 2024Updated last year
- BeHonest: Benchmarking Honesty in Large Language Models☆35Aug 15, 2024Updated last year
- Code & Data for our Paper "Alleviating Hallucinations of Large Language Models through Induced Hallucinations"☆69Feb 27, 2024Updated 2 years ago
- ☆32Feb 13, 2024Updated 2 years ago
- This is the code for neural-Jacana aligner, and the data for MultiMWA dataset.☆20Feb 12, 2023Updated 3 years ago
- Data for EMNLP 2022 paper "arXivEdits: Understanding the Human Revision Process in Scientific Writing".☆14Sep 30, 2023Updated 2 years ago
- Code & Data for our Paper "RobustGEC: Robust Grammatical Error Correction Against Subtle Context Perturbation" (EMNLP 2023)☆17Jan 23, 2024Updated 2 years ago
- Awesome LLM for NLG Evaluation Papers☆26Jan 23, 2024Updated 2 years ago
- A Random Matrix Approach to Extreme Learning Machine☆15Feb 23, 2018Updated 8 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- [EMNLP 2025 Oral] IPIGuard: A Novel Tool Dependency Graph-Based Defense Against Indirect Prompt Injection in LLM Agents☆18Sep 16, 2025Updated 8 months ago
- ☆22Feb 1, 2024Updated 2 years ago
- ☆10May 27, 2024Updated last year
- Bayesian low-rank adaptation for large language models☆28May 4, 2024Updated 2 years ago
- [2025-TMLR] A Survey on the Honesty of Large Language Models☆65Dec 8, 2024Updated last year
- ☆185Jun 20, 2024Updated last year
- Confident Adaptive Transformers☆15Apr 18, 2021Updated 5 years ago
- instruction-following benchmark for large reasoning models☆48Apr 19, 2026Updated last month
- Codebase for Instruction Following without Instruction Tuning☆36Sep 24, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆37Jan 26, 2024Updated 2 years ago
- MDL Complexity computations and experiments from the paper "Revisiting complexity and the bias-variance tradeoff".☆18Jun 12, 2023Updated 2 years ago
- [EMNLP'23] Code for "Non-autoregressive Text Editing with Copy-aware Latent Alignments".☆20Oct 17, 2023Updated 2 years ago
- 🎉 TrustJudge is accepted to ICLR 2026!☆46Sep 27, 2025Updated 7 months ago
- ☆52Mar 17, 2025Updated last year
- Sarcasm Detection for Sentiment Analysis☆22Mar 26, 2019Updated 7 years ago
- Embodied-Planner-R1: Unleashing Embodied Task Planning Ability in LLMs via Reinforcement Learning☆27Mar 30, 2026Updated last month