activatedgeek / calibration-tuningView external linksLinks
☆53Apr 9, 2025Updated 10 months ago
Alternatives and similar repositories for calibration-tuning
Users that are interested in calibration-tuning are comparing it to the libraries listed below
Sorting:
- [EMNLP 2023] Once Upon a *Time* in *Graph*: Relative-Time Pretraining for Complex Temporal Reasoning☆17Oct 31, 2023Updated 2 years ago
- Source codes for "Preference-grounded Token-level Guidance for Language Model Fine-tuning" (NeurIPS 2023).☆17Jan 8, 2025Updated last year
- ☆17May 28, 2024Updated last year
- codes for Efficient Test-Time Scaling via Self-Calibration☆19Sep 13, 2025Updated 5 months ago
- Official Implementation of ACL2023: Don't Parse, Choose Spans! Continuous and Discontinuous Constituency Parsing via Autoregressive Span …☆14Aug 25, 2023Updated 2 years ago
- A Random Matrix Approach to Extreme Learning Machine☆15Feb 23, 2018Updated 7 years ago
- This is the repo for constructing a comprehensive and rigorous evaluation framework for LLM calibration.☆13Apr 9, 2024Updated last year
- Fast and Slow Generating: An Empirical Study on Large and Small Language Models Collaborative Decoding.☆13Nov 19, 2024Updated last year
- ☆32Feb 13, 2024Updated 2 years ago
- Code for paper "Unraveling Cross-Modality Knowledge Conflicts in Large Vision-Language Models."☆52Oct 19, 2024Updated last year
- ☆14Nov 4, 2022Updated 3 years ago
- This is the code for neural-Jacana aligner, and the data for MultiMWA dataset.☆20Feb 12, 2023Updated 3 years ago
- Data for EMNLP 2022 paper "arXivEdits: Understanding the Human Revision Process in Scientific Writing".☆14Sep 30, 2023Updated 2 years ago
- instruction-following benchmark for large reasoning models☆44Aug 9, 2025Updated 6 months ago
- Neuro-Symbolic Integration Brings Causal and Reliable Reasoning Proofs☆41Feb 15, 2024Updated 2 years ago
- BeHonest: Benchmarking Honesty in Large Language Models☆34Aug 15, 2024Updated last year
- reproduces experiments from "Grounding inductive biases in natural images: invariance stems from variations in data"☆17Sep 25, 2024Updated last year
- ☆17Apr 9, 2025Updated 10 months ago
- ☆57Jun 3, 2022Updated 3 years ago
- ☆15Dec 2, 2022Updated 3 years ago
- ☆16Mar 5, 2024Updated last year
- MDL Complexity computations and experiments from the paper "Revisiting complexity and the bias-variance tradeoff".☆18Jun 12, 2023Updated 2 years ago
- Code & Data for our Paper "Alleviating Hallucinations of Large Language Models through Induced Hallucinations"☆69Feb 27, 2024Updated last year
- Code for the ACL 2022 paper "Continual Sequence Generation with Adaptive Compositional Modules"☆39Apr 4, 2022Updated 3 years ago
- The code for "MoPE: Mixture of Prefix Experts for Zero-Shot Dialogue State Tracking"☆19Jan 25, 2025Updated last year
- [COLM 2025] "C3PO: Critical-Layer, Core-Expert, Collaborative Pathway Optimization for Test-Time Expert Re-Mixing"☆20Apr 9, 2025Updated 10 months ago
- ☁️ KUMO: Generative Evaluation of Complex Reasoning in Large Language Models☆19Jun 4, 2025Updated 8 months ago
- Group-conditional DRO to alleviate spurious correlations☆15Jul 15, 2021Updated 4 years ago
- Toy datasets to evaluate algorithms for domain generalization and invariance learning.☆43Dec 5, 2021Updated 4 years ago
- ☆20Feb 10, 2025Updated last year
- Exploring limitations of LLM-as-a-judge☆20Aug 17, 2024Updated last year
- ☆123Feb 21, 2025Updated 11 months ago
- FastCuRL: Curriculum Reinforcement Learning with Stage-wise Context Scaling for Efficient LLM Reasoning☆57Oct 10, 2025Updated 4 months ago
- Code for "[COLM'25] RepoST: Scalable Repository-Level Coding Environment Construction with Sandbox Testing"☆22Mar 18, 2025Updated 10 months ago
- [EMNLP'23] Code for "Non-autoregressive Text Editing with Copy-aware Latent Alignments".☆20Oct 17, 2023Updated 2 years ago
- 🎮Manipulates mobile phones just like how you would. Official code for "MobA: Multifaceted Memory-Enhanced Adaptive Planning for Efficien…☆27Oct 10, 2025Updated 4 months ago
- ☆20Nov 4, 2025Updated 3 months ago
- ☆17Apr 17, 2022Updated 3 years ago
- code repo for ICLR 2024 paper "Can LLMs Express Their Uncertainty? An Empirical Evaluation of Confidence Elicitation in LLMs"☆144Mar 14, 2024Updated last year