LiangruXie / Calibration-Process-in-Black-Box-LLMsLinks
☆13Updated 6 months ago
Alternatives and similar repositories for Calibration-Process-in-Black-Box-LLMs
Users that are interested in Calibration-Process-in-Black-Box-LLMs are comparing it to the libraries listed below
Sorting:
- [ICML 2025] Official resources of "KBQA-o1: Agentic Knowledge Base Question Answering with Monte Carlo Tree Search".☆22Updated last month
- Source code of paper: Process vs. Outcome Reward: Which is Better for Agentic RAG Reinforcement Learning☆22Updated this week
- ☆12Updated 4 months ago
- ☆20Updated last month
- ☆19Updated last week
- The implementation for CIKM 2024: Towards Completeness-Oriented Tool Retrieval for Large Language Models.☆19Updated 7 months ago
- ☆15Updated 8 months ago
- Code for Robust Fine-tuning (RbFT)☆12Updated 4 months ago
- ☆16Updated 10 months ago
- ☆22Updated 11 months ago
- ☆15Updated 7 months ago
- KDD 2024 AQA competition 2nd place solution☆11Updated 10 months ago
- Control LLM☆14Updated 2 months ago
- [EMNLP 2023] Knowledge Rumination for Pre-trained Language Models☆17Updated last year
- Leveraging Outputs of Large Language Model as Feedback for Dynamic Reranking in Retrieval-Augmented Generation☆24Updated 3 weeks ago
- [Findings of EMNLP'2024] Unified Active Retrieval for Retrieval Augmented Generation☆21Updated 8 months ago
- The code for paper: Hierarchical Document Refinement for Long-context Retrieval-augmented Generation☆19Updated this week
- [KDD 2025] AtomR: Atomic Operator-Empowered Large Language Models for Heterogeneous Knowledge Reasoning☆11Updated last week
- Official code implementation for the ACL 2025 paper: 'CoT-based Synthesizer: Enhancing LLM Performance through Answer Synthesis'☆27Updated 2 weeks ago
- ☆24Updated last month
- 超简单复现Deepseek-R1-Zero和Deepseek-R1,以「24点游戏」为例。通过zero-RL、SFT以及SFT+RL,以激发LLM的自主验证反思能力。 About Clean, minimal, accessible reproduction of Dee…☆19Updated 2 months ago
- ☆17Updated 6 months ago
- Code for "CREAM: Consistency Regularized Self-Rewarding Language Models", ICLR 2025.☆22Updated 3 months ago
- [ACL 2025 (Findings)] DEMO: Reframing Dialogue Interaction with Fine-grained Element Modeling☆14Updated 5 months ago
- ☆19Updated 3 months ago
- Official repository of paper "Context-DPO: Aligning Language Models for Context-Faithfulness"☆15Updated 3 months ago
- RAG-RewardBench: Benchmarking Reward Models in Retrieval Augmented Generation for Preference Alignment☆16Updated 5 months ago
- ☆15Updated last month
- ☆57Updated 7 months ago
- Source code for EMNLP 2023 paper "Probabilistic Tree-of-thought Reasoning for Answering Knowledge-intensive Complex Questions".☆20Updated last year