☆16Jun 25, 2025Updated 8 months ago
Alternatives and similar repositories for LRM-FactEval
Users that are interested in LRM-FactEval are comparing it to the libraries listed below
Sorting:
- RENT (Reinforcement Learning via Entropy Minimization) is an unsupervised method for training reasoning LLMs.☆43Oct 31, 2025Updated 4 months ago
- ☆12Mar 7, 2024Updated 2 years ago
- Code of EMNLP 2025 paper 'UltraIF: Advancing Instruction Following from the Wild'.☆21Apr 3, 2025Updated 11 months ago
- Code for NAACL 2025 paper "AdaCAD: Adaptively Decoding to Balance Conflicts between Contextual and Parametric Knowledge"☆17Mar 2, 2026Updated 2 weeks ago
- FactScoreLite is an implementation of the FactScore metric, designed for detailed accuracy assessment in text generation. This package bu…☆13Apr 25, 2024Updated last year
- "Found in the Middle: How Language Models Use Long Contexts Better via Plug-and-Play Positional Encoding" Zhenyu Zhang, Runjin Chen, Shiw…☆31May 7, 2024Updated last year
- Fast Memorization of Prompt Improves Context Awareness of Large Language Models (Findings of EMNLP 2024)☆23Oct 22, 2024Updated last year
- ☆17May 28, 2024Updated last year
- 同济大学编译原理课程设计☆11Jun 12, 2021Updated 4 years ago
- generative models on toys☆12Sep 10, 2024Updated last year
- ☆23Dec 17, 2024Updated last year
- ☆14Aug 27, 2022Updated 3 years ago
- Official repository for Robust Multimodal Large Language Models Against Modality Conflict☆20Jul 9, 2025Updated 8 months ago
- WWW2021: Interpreting and Unifying Graph Neural Networks with An Optimization Framework☆14Jun 23, 2021Updated 4 years ago
- A collection of papers in fairness of medical image analysis☆13Jun 16, 2023Updated 2 years ago
- Merging Generated and Retrieved Knowledge for Open-Domain QA (EMNLP 2023)☆22Oct 8, 2023Updated 2 years ago
- Simulation and robot code for contact-rich household object insertion (ICRA 2023).☆17Dec 18, 2024Updated last year
- Code for “SaLoRA: Safety-Alignment Preserved Low-Rank Adaptation(ICLR 2025)”☆25Oct 23, 2025Updated 4 months ago
- Data and codes for EMNLP 2022 paper "CDConv: A Benchmark for Contradiction Detection in Chinese Conversations"☆13May 8, 2023Updated 2 years ago
- ☆20Jan 27, 2026Updated last month
- [ACL 2025 Findings] "Worse than Random? An Embarrassingly Simple Probing Evaluation of Large Multimodal Models in Medical VQA"☆25Feb 21, 2025Updated last year
- Official Implementation for the paper "Integrative Decoding: Improving Factuality via Implicit Self-consistency"☆32Apr 12, 2025Updated 11 months ago
- Initially a fork of the GitHub repository for the paper "Informer" accepted by AAAI 2021. Heavily modified since then.☆15Apr 7, 2023Updated 2 years ago
- The repo for using the model https://huggingface.co/thu-coai/Attacker-v0.1☆13Apr 23, 2025Updated 10 months ago
- A Recipe for Building LLM Reasoners to Solve Complex Instructions☆31Oct 9, 2025Updated 5 months ago
- Repo for ICCV 2021 paper: Beyond Question-Based Biases: Assessing Multimodal Shortcut Learning in Visual Question Answering☆29Jul 1, 2024Updated last year
- 该项目是基于Python和数据库实现的学生信息管理系统☆13Jul 21, 2021Updated 4 years ago
- [ICLR2025 Spotlight] Advantage-Guided Distillation for Preference Alignment in Small Language Models☆24Feb 10, 2025Updated last year
- Dataset and baseline for Coling 2022 long paper (oral): "ConFiguRe: Exploring Discourse-level Chinese Figures of Speech"☆13Jul 27, 2023Updated 2 years ago
- 类C编译器,编译原理课程设计☆11Jul 30, 2021Updated 4 years ago
- ☆39May 2, 2024Updated last year
- ☆13Nov 7, 2023Updated 2 years ago
- Source code of our paper "Focus on the Target’s Vocabulary: Masked Label Smoothing for Machine Translation" @ ACL 2022☆13Apr 13, 2022Updated 3 years ago
- Unofficial implementation of Chain of Hindsight (https://arxiv.org/abs/2302.02676) using pytorch and huggingface Trainers.☆11Apr 5, 2023Updated 2 years ago
- ☆24Oct 14, 2024Updated last year
- Parse LaTeX math expressions☆39Jan 10, 2026Updated 2 months ago
- Vision Large Language Models trained on M3IT instruction tuning dataset☆17Aug 16, 2023Updated 2 years ago
- [NeurIPS 2025] Reasoning Models Better Express Their Confidence"☆22Nov 19, 2025Updated 4 months ago
- This repo explores how AMR to address tasks difficult for LLMs☆13Jan 15, 2024Updated 2 years ago