zz-haooo / LLMs-Preference-OptimizationLinks
☆15Updated last year
Alternatives and similar repositories for LLMs-Preference-Optimization
Users that are interested in LLMs-Preference-Optimization are comparing it to the libraries listed below
Sorting:
- ☆12Updated last year
- ☆36Updated 10 months ago
- ☆68Updated 10 months ago
- Official Implementation for EMNLP 2024 (main) "AgentReview: Exploring Academic Peer Review with LLM Agent."☆99Updated last year
- [ACL 2024 Findings] This is the code for our paper "Knowledge-Infused Prompting: Assessing and Advancing Clinical Text Data Generation wi…☆40Updated last year
- Benchmark, Toolbox, and Reflection-based Method for Clinical Agent☆14Updated last year
- Code for paper Towards Mitigating LLM Hallucination via Self Reflection☆30Updated 2 years ago
- [ACL 2024] Shifting Attention to Relevance: Towards the Predictive Uncertainty Quantification of Free-Form Large Language Models☆59Updated last year
- [NeurIPS 2024] Code and Data Repo for Paper "Embedding Trajectory for Out-of-Distribution Detection in Mathematical Reasoning"☆27Updated last year
- [AAAI 2024] MELO: Enhancing Model Editing with Neuron-indexed Dynamic LoRA☆26Updated last year
- [NeurIPS 2024 Datasets and Benchmark Track Oral] MedCalc-Bench: Evaluating Large Language Models for Medical Calculations☆77Updated this week
- ☆181Updated last year
- [ACL 2024] This is the code for our paper ”RAM-EHR: Retrieval Augmentation Meets Clinical Predictions on Electronic Health Records“.☆39Updated last year
- [ICLR'25] DataGen: Unified Synthetic Dataset Generation via Large Language Models☆64Updated 9 months ago
- Collection of latest papers and materials in the area of RLVR!☆45Updated last month
- Code for "Knowledge Card: Filling LLMs' Knowledge Gaps with Plug-in Specialized Language Models", ICLR 2024 Oral.☆21Updated last year
- This is a unified platform for implementing and evaluating test-time reasoning mechanisms in Large Language Models (LLMs).☆19Updated 11 months ago
- DoctorAgent-RL: A Multi-Agent Collaborative Reinforcement Learning System for Multi-Turn Clinical Dialogue☆51Updated 2 months ago
- code for EMNLP 2024 paper: How do Large Language Models Learn In-Context? Query and Key Matrices of In-Context Heads are Two Towers for M…☆13Updated last year
- A framework to empover LLMs on graph reasoning and generation. Refer to our paper: https://arxiv.org/pdf/2402.08785.pdf☆80Updated last year
- Code for "DocLens: Multi-aspect Fine-grained Evaluation for Medical Text Generation" (ACL 2024)☆21Updated last year
- Accepted LLM Papers in NeurIPS 2024☆37Updated last year
- The implement of paper:"ReDeEP: Detecting Hallucination in Retrieval-Augmented Generation via Mechanistic Interpretability"☆52Updated 6 months ago
- [EMNLP 2024] This is the code for our paper "BMRetriever: Tuning Large Language Models as Better Biomedical Text Retrievers".☆22Updated last year
- Principled Data Selection for Alignment: The Hidden Risks of Difficult Examples☆44Updated 5 months ago
- A Sober Look at Language Model Reasoning☆89Updated last month
- [ACL 2025 Main] (🏆 Outstanding Paper Award) Rethinking the Role of Prompting Strategies in LLM Test-Time Scaling: A Perspective of Proba…☆14Updated 4 months ago
- [NeurIPS 2024] "Can Language Models Perform Robust Reasoning in Chain-of-thought Prompting with Noisy Rationales?"☆37Updated 5 months ago
- Must-read Papers on Large Language Model (LLM) Continual Learning☆148Updated 2 years ago
- [NeurIPS 2024] Uncertainty of Thoughts: Uncertainty-Aware Planning Enhances Information Seeking in Large Language Models☆105Updated last year