LiangruXie / Calibration-Process-in-Black-Box-LLMsLinks
☆18Updated last year
Alternatives and similar repositories for Calibration-Process-in-Black-Box-LLMs
Users that are interested in Calibration-Process-in-Black-Box-LLMs are comparing it to the libraries listed below
Sorting:
- [ICML 2025] Official resources of "KBQA-o1: Agentic Knowledge Base Question Answering with Monte Carlo Tree Search".☆33Updated 2 weeks ago
- Code for paper "Factual Confidence of LLMs: on Reliability and Robustness of Current Estimators"☆15Updated last year
- ☆12Updated last year
- ☆20Updated 5 months ago
- Models, data, and codes for the paper: MetaAligner: Towards Generalizable Multi-Objective Alignment of Language Models☆24Updated last year
- [ACL 2025 (Findings)] DEMO: Reframing Dialogue Interaction with Fine-grained Element Modeling☆20Updated last year
- ☆24Updated 8 months ago
- Source code of paper: Process vs. Outcome Reward: Which is Better for Agentic RAG Reinforcement Learning☆44Updated 5 months ago
- ☆15Updated last year
- ☆14Updated 10 months ago
- [ACL'25] Mosaic-IT: Cost-Free Compositional Data Synthesis for Instruction Tuning☆20Updated 2 months ago
- DataSciBench: An LLM Agent Benchmark for Data Science☆46Updated 3 months ago
- ☆16Updated last year
- The benchmark and datasets of the ICML 2024 paper "VisionGraph: Leveraging Large Multimodal Models for Graph Theory Problems in Visual C…☆17Updated last year
- Source code and dataset for the CCKS2021 paper "Text-guided Legal Knowledge Graph Reasoning".☆19Updated 3 years ago
- ☆16Updated 7 months ago
- The implementation for CIKM 2024: Towards Completeness-Oriented Tool Retrieval for Large Language Models.☆23Updated last year
- [ICML 2025] Beyond Bradley-Terry Models: A General Preference Model for Language Model Alignment (https://arxiv.org/abs/2410.02197)☆36Updated 3 months ago
- ☆31Updated last year
- ☆24Updated last month
- Code and Data for "FaithfulRAG: Fact-Level Conflict Modeling for Context-Faithful Retrieval-Augmented Generation" (ACL25)☆24Updated last month
- [EMNLP 2023] Knowledge Rumination for Pre-trained Language Models☆17Updated 2 years ago
- Evaluation Pipeline for medical tasks.☆12Updated last year
- a survey on deep research☆40Updated 3 months ago
- [ACL2023] Preserving Commonsense Knowledge from Pre-trained Language Models via Causal Inference☆24Updated last year
- Materials for paper "Are Large Language Models Temporally Grounded?"☆13Updated 2 years ago
- Official implementation of Vector-ICL: In-context Learning with Continuous Vector Representations (ICLR 2025)☆20Updated 6 months ago
- The official repository of "SmartAgent: Chain-of-User-Thought for Embodied Personalized Agent in Cyber World".☆27Updated 4 months ago
- Code for "Seeking Neural Nuggets: Knowledge Transfer in Large Language Models from a Parametric Perspective"☆33Updated last year
- ☆23Updated last year