wuxiyang1996/AutoHallusion

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/wuxiyang1996/AutoHallusion)

wuxiyang1996 / AutoHallusion

AutoHallusion Codebase (EMNLP 2024)

☆23

Alternatives and similar repositories for AutoHallusion

Users that are interested in AutoHallusion are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

kai-wen-yang / IDAA
View on GitHub
[ICML2022] "Identity-Disentangled Adversarial Augmentation for Self-Supervised Learning"
☆10Jul 24, 2022Updated 3 years ago
tianyi-lab / HallusionBench
View on GitHub
[CVPR'24] HallusionBench: You See What You Think? Or You Think What You See? An Image-Context Reasoning Benchmark Challenging for GPT-4V(…
☆342Oct 14, 2025Updated 9 months ago
tianyi-lab / RoMA
View on GitHub
Code for "Routing Manifold Alignment Improves Generalization of Mixture-of-Experts LLMs"
☆19Nov 6, 2025Updated 8 months ago
tianyi-lab / Mosaic-IT
View on GitHub
[ACL'25] Mosaic-IT: Cost-Free Compositional Data Synthesis for Instruction Tuning
☆20Sep 27, 2025Updated 9 months ago
zxiangx / LC-R1
View on GitHub
Code for paper: Optimizing Length Compression in Large Reasoning Models
☆29Oct 20, 2025Updated 9 months ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
sailing-lab / sr2am
View on GitHub
SR²AM: Efficient Agentic Reasoning Through Self-Regulated Simulative Planning
☆21May 22, 2026Updated last month
Ruiyang-061X / Uncertainty-o
View on GitHub
✨ Official code for our paper: "Uncertainty-o: One Model-agnostic Framework for Unveiling Epistemic Uncertainty in Large Multimodal Model…
☆21Mar 13, 2025Updated last year
LinxinS97 / NLPBench
View on GitHub
NLPBench: Evaluating NLP-Related Problem-solving Ability in Large Language Models
☆10Oct 27, 2023Updated 2 years ago
tianyi-lab / TSRBench
View on GitHub
[ICML 2026] TSRBench: A Comprehensive Multi-task Multi-modal Time Series Reasoning Benchmark for Generalist Models
☆25Mar 24, 2026Updated 3 months ago
tianyi-lab / RuleR
View on GitHub
[NAACL'25] RuleR: Improving LLM Controllability by Rule-based Data Recycling
☆14Sep 27, 2025Updated 9 months ago
tianyi-lab / FaSTAR
View on GitHub
[ICLR 2026] Fast-Slow Toolpath Agent with Subroutine Mining for Efficient Multi-turn Image Editing
☆33May 30, 2026Updated last month
measure-infinity / mulan-code
View on GitHub
☆43Jul 16, 2024Updated 2 years ago
tianyi-lab / R2-T2
View on GitHub
[ICML 2025] Code for "R2-T2: Re-Routing in Test-Time for Multimodal Mixture-of-Experts"
☆19Mar 10, 2025Updated last year
wuxiyang1996 / iPLAN
View on GitHub
iPLAN: Intent-Aware Planning in Heterogeneous Traffic via Distributed Multi-Agent Reinforcement Learning
☆46Mar 3, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
melodi-lab / divfl
View on GitHub
Diverse Client Selection for Federated Learning via Submodular Maximization
☆35Aug 3, 2022Updated 3 years ago
sola-st / wasm-type-prediction
View on GitHub
☆11Mar 22, 2022Updated 4 years ago
tianyi-lab / C3PO
View on GitHub
[COLM 2025] "C3PO: Critical-Layer, Core-Expert, Collaborative Pathway Optimization for Test-Time Expert Re-Mixing"
☆21Apr 9, 2025Updated last year
tianyi-lab / Moltbook_Socialization
View on GitHub
Does Socialization Emerge in AI Agent Society? A Case Study of Moltbook
☆18Feb 17, 2026Updated 5 months ago
junyangwang0410 / AMBER
View on GitHub
An LLM-free Multi-dimensional Benchmark for Multi-modal Hallucination Evaluation
☆172Jan 15, 2024Updated 2 years ago
kai-wen-yang / CD-VAE
View on GitHub
[NeurIPS 2021] "Class-Disentanglement and Applications in Adversarial Detection and Defense"
☆47Jan 18, 2022Updated 4 years ago
xirui-li / MOSSBench
View on GitHub
An implementation for MLLM oversensitivity evaluation
☆18Nov 16, 2024Updated last year
tianyi-lab / DisCL
View on GitHub
[ICCV 2025] Diffusion Curriculum (DisCL)
☆18Sep 26, 2025Updated 9 months ago
armingh2000 / FactScoreLite
View on GitHub
FactScoreLite is an implementation of the FactScore metric, designed for detailed accuracy assessment in text generation. This package bu…
☆14Apr 25, 2024Updated 2 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
ayiyayi / EgoExoBench
View on GitHub
☆15Nov 13, 2025Updated 8 months ago
Yangyi-Chen / CoTConsistency
View on GitHub
The released data for paper "Measuring and Improving Chain-of-Thought Reasoning in Vision-Language Models".
☆34Sep 16, 2023Updated 2 years ago
princeton-nlp / ELIZA-Transformer
View on GitHub
[NAACL 2025] Representing Rule-based Chatbots with Transformers
☆23Feb 9, 2025Updated last year
sugar-fly / VSFormer
View on GitHub
[AAAI 2024] VSFormer: Visual-Spatial Fusion Transformer for Correspondence Pruning
☆16Apr 7, 2024Updated 2 years ago
FuxiaoLiu / LRV-Instruction
View on GitHub
[ICLR'24] Mitigating Hallucination in Large Multi-Modal Models via Robust Instruction Tuning
☆297Mar 13, 2024Updated 2 years ago
pipilurj / bootstrapped-preference-optimization-BPO
View on GitHub
code for "Strengthening Multimodal Large Language Model with Bootstrapped Preference Optimization"
☆63Aug 23, 2024Updated last year
htyao89 / Textual-based_Class-aware_prompt_tuning
View on GitHub
☆33Mar 7, 2024Updated 2 years ago
panjd123 / D3QN-Snake
View on GitHub
A greedy snake AI implemented with reinforcement learning(D3QN) algorithm under PyTorch framework.一个在PyTorch框架下使用强化学习(D3QN)实现的贪吃蛇AI。
☆17Dec 21, 2022Updated 3 years ago
stevenyangyj / Emma-Alfworld
View on GitHub
Official code for the paper: Embodied Multi-Modal Agent trained by an LLM from a Parallel TextWorld
☆63Mar 6, 2026Updated 4 months ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
suntea233 / DualLoRA
View on GitHub
Implementation of ACL 2024 paper "Zero-Shot Cross-Domain Dialogue State Tracking via Dual Low-Rank Adaptation".
☆15Nov 9, 2024Updated last year
AIoT-MLSys-Lab / MEDA
View on GitHub
[NAACL 2025🔥] MEDA: Dynamic KV Cache Allocation for Efficient Multimodal Long-Context Inference
☆22Jun 19, 2025Updated last year
SjJ1017 / CiteLab
View on GitHub
The predecessor of CiteLab.
☆18Feb 3, 2026Updated 5 months ago
locuslab / scaling_laws_data_filtering
View on GitHub
☆64Apr 9, 2024Updated 2 years ago
MingLiiii / Layer_Gradient
View on GitHub
[ACL'25 Oral] What Happened in LLMs Layers when Trained for Fast vs. Slow Thinking: A Gradient Perspective
☆75Jun 25, 2025Updated last year
r-three / AttriBoT
View on GitHub
Code for AttriBoT from "AttriBoT: A Bag of Tricks for Efficiently Approximating Leave-One-Out Context Attribution"
☆15Apr 21, 2025Updated last year
limenlp / verl
View on GitHub
AdaRFT: Efficient Reinforcement Finetuning via Adaptive Curriculum Learning
☆56Jun 13, 2025Updated last year