☆53Apr 9, 2025Updated last year
Alternatives and similar repositories for calibration-tuning
Users that are interested in calibration-tuning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [EMNLP 2023] Once Upon a *Time* in *Graph*: Relative-Time Pretraining for Complex Temporal Reasoning☆17Oct 31, 2023Updated 2 years ago
- ☆18May 28, 2024Updated 2 years ago
- codes for Efficient Test-Time Scaling via Self-Calibration☆20Sep 13, 2025Updated 8 months ago
- 🚀 Sliding Window Attention Training for Efficient Large Language Models☆17Dec 8, 2025Updated 6 months ago
- Fast and Slow Generating: An Empirical Study on Large and Small Language Models Collaborative Decoding.☆13Nov 19, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- This is the repo for constructing a comprehensive and rigorous evaluation framework for LLM calibration.☆13Apr 9, 2024Updated 2 years ago
- ☆25Jun 10, 2025Updated 11 months ago
- Neuro-Symbolic Integration Brings Causal and Reliable Reasoning Proofs☆41Feb 15, 2024Updated 2 years ago
- code repo for ICLR 2024 paper "Can LLMs Express Their Uncertainty? An Empirical Evaluation of Confidence Elicitation in LLMs"☆148Mar 14, 2024Updated 2 years ago
- Code for the ACL 2022 paper "Continual Sequence Generation with Adaptive Compositional Modules"☆39Apr 4, 2022Updated 4 years ago
- [TACL/EMNLP'24] Do Vision and Language Models Share Concepts? A Vector Space Alignment Study☆16Nov 22, 2024Updated last year
- ☆15Dec 2, 2022Updated 3 years ago
- Code for paper "Unraveling Cross-Modality Knowledge Conflicts in Large Vision-Language Models."☆53Oct 19, 2024Updated last year
- BeHonest: Benchmarking Honesty in Large Language Models☆35Aug 15, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Code & Data for our Paper "Alleviating Hallucinations of Large Language Models through Induced Hallucinations"☆69Feb 27, 2024Updated 2 years ago
- ☆32Feb 13, 2024Updated 2 years ago
- Data for EMNLP 2022 paper "arXivEdits: Understanding the Human Revision Process in Scientific Writing".☆14Sep 30, 2023Updated 2 years ago
- Awesome LLM for NLG Evaluation Papers☆26Jan 23, 2024Updated 2 years ago
- [EMNLP 2025 Oral] IPIGuard: A Novel Tool Dependency Graph-Based Defense Against Indirect Prompt Injection in LLM Agents☆19Sep 16, 2025Updated 8 months ago
- ☆22Feb 1, 2024Updated 2 years ago
- Bayesian low-rank adaptation for large language models☆29May 4, 2024Updated 2 years ago
- A well documented Jupyter notebook for learning how you could recognize Persian digits using CNNs.☆11Oct 4, 2018Updated 7 years ago
- ☆186Jun 20, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Confident Adaptive Transformers☆15Apr 18, 2021Updated 5 years ago
- ☆125Updated this week
- ☆19Apr 19, 2024Updated 2 years ago
- Codebase for Instruction Following without Instruction Tuning☆36Sep 24, 2024Updated last year
- ☆37Jan 26, 2024Updated 2 years ago
- MDL Complexity computations and experiments from the paper "Revisiting complexity and the bias-variance tradeoff".☆18Jun 12, 2023Updated 2 years ago
- [EMNLP'23] Code for "Non-autoregressive Text Editing with Copy-aware Latent Alignments".☆20Oct 17, 2023Updated 2 years ago
- FastCuRL: Curriculum Reinforcement Learning with Stage-wise Context Scaling for Efficient LLM Reasoning (EMNLP 2025)☆59Oct 10, 2025Updated 7 months ago
- 🎉 TrustJudge is accepted to ICLR 2026!☆47Sep 27, 2025Updated 8 months ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- ☆52Mar 17, 2025Updated last year
- Sarcasm Detection for Sentiment Analysis☆22Mar 26, 2019Updated 7 years ago
- [ACL 2024] Shifting Attention to Relevance: Towards the Predictive Uncertainty Quantification of Free-Form Large Language Models☆62Sep 4, 2024Updated last year
- We introduce EMMET and unify model editing with popular algorithms ROME and MEMIT.☆28Dec 16, 2024Updated last year
- Time Integration Package☆11Dec 17, 2024Updated last year
- ☆17Apr 17, 2022Updated 4 years ago
- ☆105Jun 30, 2024Updated last year