SALT-NLP / DARGLinks
The official repo for DARG: Dynamic Evaluation of Large Language Models via Adaptive Reasoning Graph
☆16Updated 8 months ago
Alternatives and similar repositories for DARG
Users that are interested in DARG are comparing it to the libraries listed below
Sorting:
- AbstainQA, ACL 2024☆26Updated 8 months ago
- This is the implementation for the paper "LARGE LANGUAGE MODEL CASCADES WITH MIX- TURE OF THOUGHT REPRESENTATIONS FOR COST- EFFICIENT REA…☆23Updated last year
- Models, data, and codes for the paper: MetaAligner: Towards Generalizable Multi-Objective Alignment of Language Models☆19Updated 9 months ago
- Code/data for MARG (multi-agent review generation)☆44Updated 7 months ago
- The source code for running LLMs on the AAAR-1.0 benchmark.☆16Updated 2 months ago
- Evaluate the Quality of Critique☆35Updated last year
- Official implementation of Vector-ICL: In-context Learning with Continuous Vector Representations☆19Updated 3 weeks ago
- This is the official repo for Towards Uncertainty-Aware Language Agent.☆25Updated 10 months ago
- [ACL'25 Main] Graph of Records: Boosting Retrieval Augmented Generation for Long-context Summarization with Graphs☆23Updated last month
- Code and Data for the NAACL 24 paper: MacGyver: Are Large Language Models Creative Problem Solvers?☆28Updated last year
- Resolving Knowledge Conflicts in Large Language Models, COLM 2024☆17Updated last week
- Official implementation of ICML 2025 paper "Beyond Bradley-Terry Models: A General Preference Model for Language Model Alignment" (https:…☆25Updated last month
- This is the code of MMOA-RAG.☆53Updated last month
- Evaluating the Factuality of Large Language Models using Large-Scale Knowledge Graphs☆30Updated 9 months ago
- ☆25Updated last year
- [ICLR 2024] Unveiling the Pitfalls of Knowledge Editing for Large Language Models☆22Updated last year
- ☆19Updated 4 months ago
- Code for paper "Unraveling Cross-Modality Knowledge Conflicts in Large Vision-Language Models."☆42Updated 8 months ago
- ☆24Updated 2 months ago
- Repository of paper "How Likely Do LLMs with CoT Mimic Human Reasoning?"☆22Updated 4 months ago
- [NAACL'25 Oral] Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering☆60Updated 7 months ago
- [EMNLP 2024] Official implementation of "Hierarchical Deconstruction of LLM Reasoning: A Graph-Based Framework for Analyzing Knowledge Ut…☆21Updated 6 months ago
- Aligning with Human Judgement: The Role of Pairwise Preference in Large Language Model Evaluators (Liu et al.; COLM 2024)☆47Updated 5 months ago
- [ICML 2025] Flow of Reasoning: Training LLMs for Divergent Reasoning with Minimal Examples☆95Updated 2 weeks ago
- The code implementation of MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Models…☆34Updated last year
- The benchmark proposed in paper: GraphInstruct: Empowering Large Language Models with Graph Understanding and Reasoning Capability☆20Updated last year
- Sotopia-π: Interactive Learning of Socially Intelligent Language Agents (ACL 2024)☆65Updated last year
- In-Context Sharpness as Alerts: An Inner Representation Perspective for Hallucination Mitigation (ICML 2024)☆59Updated last year
- [NAACL 2025] The official implementation of paper "Learning From Failure: Integrating Negative Examples when Fine-tuning Large Language M…☆26Updated last year
- [EMNLP 2024] This is the code for our paper "BMRetriever: Tuning Large Language Models as Better Biomedical Text Retrievers".☆21Updated 9 months ago