zz-haooo / LLMs-Preference-OptimizationLinks
☆16Updated last year
Alternatives and similar repositories for LLMs-Preference-Optimization
Users that are interested in LLMs-Preference-Optimization are comparing it to the libraries listed below
Sorting:
- ☆36Updated last year
- [NeurIPS 2024] Code and Data Repo for Paper "Embedding Trajectory for Out-of-Distribution Detection in Mathematical Reasoning"☆28Updated last year
- Benchmark, Toolbox, and Reflection-based Method for Clinical Agent☆17Updated last year
- MedAgentBoard: Benchmarking Multi-Agent Collaboration with Conventional Methods for Diverse Medical Tasks☆45Updated 4 months ago
- [ACL 2024 Findings] This is the code for our paper "Knowledge-Infused Prompting: Assessing and Advancing Clinical Text Data Generation wi…☆41Updated last year
- Official Implementation for EMNLP 2024 (main) "AgentReview: Exploring Academic Peer Review with LLM Agent."☆103Updated last year
- Code for paper Towards Mitigating LLM Hallucination via Self Reflection☆30Updated 2 years ago
- Collection of latest papers and materials in the area of RLVR!☆56Updated last week
- Code for CVPR 2024 paper: Positive-Unlabeled Learning by Latent Group-Aware Meta Disambiguation☆21Updated last year
- ☆29Updated last year
- A framework to empover LLMs on graph reasoning and generation. Refer to our paper: https://arxiv.org/pdf/2402.08785.pdf☆80Updated last year
- Official repository of paper "Parameters vs. Context: Fine-Grained Control of Knowledge Reliance in Language Models"☆23Updated 8 months ago
- [ACL 2024] Shifting Attention to Relevance: Towards the Predictive Uncertainty Quantification of Free-Form Large Language Models☆60Updated last year
- ☆69Updated last year
- This is a unified platform for implementing and evaluating test-time reasoning mechanisms in Large Language Models (LLMs).☆18Updated last year
- Principled Data Selection for Alignment: The Hidden Risks of Difficult Examples☆44Updated 6 months ago
- [EMNLP 2024] The official GitHub repo for the survey paper "Knowledge Conflicts for LLMs: A Survey"☆151Updated last year
- Code for "Knowledge Card: Filling LLMs' Knowledge Gaps with Plug-in Specialized Language Models", ICLR 2024 Oral.☆21Updated this week
- [ACL 2025 Main] (🏆 Outstanding Paper Award) Rethinking the Role of Prompting Strategies in LLM Test-Time Scaling: A Perspective of Proba…☆15Updated 5 months ago
- ☆75Updated last year
- [NeurIPS 2024] "Can Language Models Perform Robust Reasoning in Chain-of-thought Prompting with Noisy Rationales?"☆38Updated 6 months ago
- ☆57Updated last year
- [NeurIPS 2024] Uncertainty of Thoughts: Uncertainty-Aware Planning Enhances Information Seeking in Large Language Models☆106Updated last year
- code for EMNLP 2024 paper: How do Large Language Models Learn In-Context? Query and Key Matrices of In-Context Heads are Two Towers for M…☆13Updated last year
- [ICLR'25] DataGen: Unified Synthetic Dataset Generation via Large Language Models☆65Updated 11 months ago
- DoctorAgent-RL: A Multi-Agent Collaborative Reinforcement Learning System for Multi-Turn Clinical Dialogue☆64Updated 2 weeks ago
- [NeurIPS 2024 Datasets and Benchmark Track Oral] MedCalc-Bench: Evaluating Large Language Models for Medical Calculations☆79Updated last month
- A comprehensive paper list of Table-based Question Answering.☆36Updated 2 years ago
- [NeurIPS25 Spotlight] EMPO, A Fully Unsupervised RLVR Method☆94Updated 2 months ago
- ☆48Updated last month