LiangruXie / Calibration-Process-in-Black-Box-LLMsLinks
☆13Updated 7 months ago
Alternatives and similar repositories for Calibration-Process-in-Black-Box-LLMs
Users that are interested in Calibration-Process-in-Black-Box-LLMs are comparing it to the libraries listed below
Sorting:
- [ICLR 2025] Bridging and Modeling Correlations in Pairwise Data for Direct Preference Optimization☆12Updated 5 months ago
- A comprehensive and efficient long-context model evaluation framework☆15Updated this week
- ☆16Updated 11 months ago
- Source code of paper: Process vs. Outcome Reward: Which is Better for Agentic RAG Reinforcement Learning☆28Updated 3 weeks ago
- Control LLM☆17Updated 3 months ago
- ☆15Updated 10 months ago
- ☆12Updated 5 months ago
- ☆22Updated last year
- The implementation for CIKM 2024: Towards Completeness-Oriented Tool Retrieval for Large Language Models.☆19Updated 8 months ago
- ☆16Updated 2 weeks ago
- SELF-GUIDE: Better Task-Specific Instruction Following via Self-Synthetic Finetuning. COLM 2024 Accepted Paper☆33Updated last year
- [ACL 2025 (Findings)] DEMO: Reframing Dialogue Interaction with Fine-grained Element Modeling☆15Updated 7 months ago
- [NAACL 2024] CoE-SQL: In-Context Learning for Multi-Turn Text-to-SQL with Chain-of-Editions☆12Updated last year
- The code for paper: Hierarchical Document Refinement for Long-context Retrieval-augmented Generation [ACL2025 Oral]☆24Updated 2 weeks ago
- ROUTE: Robust Multitask Tuning and Collaboration for Text-to-SQL (ICLR 2025 Pytorch Code)☆15Updated 2 months ago
- Official repository of paper "Context-DPO: Aligning Language Models for Context-Faithfulness"☆15Updated 5 months ago
- An End-to-End Model with Adaptive Filtering for Retrieval-Augmented Generation☆15Updated 8 months ago
- ☆13Updated 5 months ago
- Code for paper: "LASeR: Learning to Adaptively Select Reward Models with Multi-Arm Bandits"☆13Updated 9 months ago
- The benchmark and datasets of the ICML 2024 paper "VisionGraph: Leveraging Large Multimodal Models for Graph Theory Problems in Visual C…☆14Updated last year
- DataSciBench: An LLM Agent Benchmark for Data Science☆22Updated 4 months ago
- Leveraging Outputs of Large Language Model as Feedback for Dynamic Reranking in Retrieval-Augmented Generation☆28Updated this week
- [ICML 2025] Official resources of "KBQA-o1: Agentic Knowledge Base Question Answering with Monte Carlo Tree Search".☆26Updated 2 months ago
- ☆12Updated last year
- A Multi-Dimensional Constraint Framework for Evaluating and Improving Instruction Following in Large Language Models☆14Updated last month
- RuleRAG: Rule-guided Retrieval-Augmented Generation with Language Models for Question Answering☆22Updated 8 months ago
- ☆18Updated 4 months ago
- ☆11Updated 3 months ago
- ☆14Updated last year
- RAG-RewardBench: Benchmarking Reward Models in Retrieval Augmented Generation for Preference Alignment☆16Updated 6 months ago