chujiezheng / LLM-MCQ-BiasView external linksLinks
Official repository for ICLR 2024 Spotlight paper "Large Language Models Are Not Robust Multiple Choice Selectors"
☆43May 20, 2025Updated 8 months ago
Alternatives and similar repositories for LLM-MCQ-Bias
Users that are interested in LLM-MCQ-Bias are comparing it to the libraries listed below
Sorting:
- ☆13May 21, 2024Updated last year
- RadGraph: Extracting Clinical Entities and Relations from Radiology Reports☆13Nov 22, 2022Updated 3 years ago
- Official PyTorch implementation of NeurIPS 2022 paper "Invertible Monotone Operators for Normalizing Flows"☆14Nov 28, 2022Updated 3 years ago
- [ACCV2024 (Oral)] Official pytorch implementation of X-RGen☆19Jan 20, 2025Updated last year
- Code for our EMNLP-2023 paper: "Active Instruction Tuning: Improving Cross-Task Generalization by Training on Prompt Sensitive Tasks"☆25Nov 16, 2023Updated 2 years ago
- Official code of paper "GEMeX: A Large-Scale, Groundable, and Explainable Medical VQA Benchmark for Chest X-ray Diagnosis" [ICCV 2025]☆42Jun 29, 2025Updated 7 months ago
- Code & Data for the paper "RedditBias: A Real-World Resource for Bias Evaluation and Debiasing of Conversational Language Models"☆32May 31, 2021Updated 4 years ago
- ☆26Jan 25, 2019Updated 7 years ago
- ☆38Jan 15, 2025Updated last year
- Deep learning network MEBCRN for separation of fat and water magnetic resonance images☆11Dec 29, 2020Updated 5 years ago
- Shopping MMLU: A Multi-Task Online Shopping Benchmark for LLMs.☆44Nov 4, 2024Updated last year
- Denoising of Impulsive noise in single/multichannel images☆11Dec 7, 2017Updated 8 years ago
- A Mechanistic Understanding of Alignment Algorithms: A Case Study on DPO and Toxicity.☆85Mar 7, 2025Updated 11 months ago
- Official implementation of "Meta-Entity Driven Triplet Mining for Aligning Medical Vision-Language Models"☆14Mar 19, 2025Updated 10 months ago
- COVID-19 Risk Estimation for L.A. County using a Bayesian Time-varying SIR-model☆12Feb 17, 2023Updated 3 years ago
- Consensus Based Distributed Stochastic Gradient Descent☆11Jun 24, 2018Updated 7 years ago
- ☆13Sep 23, 2022Updated 3 years ago
- Towards Memorization-Free Diffusion Models (CVPR2024) Codebase☆12Jun 2, 2024Updated last year
- Hierarchical Vision Transformers for Disease Progression Detection in Chest X-Ray Images☆11Jan 11, 2024Updated 2 years ago
- ☆12Feb 26, 2025Updated 11 months ago
- [npj Digital Medicine] An In-Depth Evaluation of Federated Learning on Biomedical Natural Language Processing for Information Extraction☆10May 1, 2024Updated last year
- ☆11Oct 12, 2021Updated 4 years ago
- ☆12Dec 11, 2024Updated last year
- Models for the assigments of image-to-image transfer between the domains of Xray images and DRR, bones and lungs images extracted from CT…☆12Nov 21, 2021Updated 4 years ago
- ☆11Nov 27, 2022Updated 3 years ago
- Official repository for "DEnsity: Open-domain Dialogue Evaluation Metric using Density Estimation (ACL2023 Findings)"☆11May 23, 2023Updated 2 years ago
- Code and Dataset release of "Carpe Diem: On the Evaluation of World Knowledge in Lifelong Language Models" (NAACL 2024)☆10Oct 16, 2024Updated last year
- The official implementation of "Enhancing Representation in Radiography-Reports Foundation Model: A Granular Alignment Algorithm Using Ma…☆13Sep 13, 2024Updated last year
- ☆10Jun 1, 2022Updated 3 years ago
- This repository provides an implementation of the DTi2Vec tool, to identify Drug-Target interaction using network embedding and ensemble …☆12Sep 28, 2021Updated 4 years ago
- ☆11Jun 21, 2025Updated 7 months ago
- ☆11May 17, 2024Updated last year
- [ACL'24 Findings] Official code for "TLCR: Token-Level Continuous Reward for Fine-grained Reinforcement Learning from Human Feedback"☆12Dec 6, 2024Updated last year
- Boilerplate free class-based action creator. Following flux-standard-action spec. Built with TypeScript. Works with redux and ngrx.☆21Mar 6, 2019Updated 6 years ago
- Minimum viable code for the Decodable Information Bottleneck paper. Pytorch Implementation.☆11Oct 20, 2020Updated 5 years ago
- Artificial Intelligence and Machine Learning python scripts for the Secure and Private AI Facebook Scholarship Challenge☆10Sep 27, 2019Updated 6 years ago
- How to really install tensorflow-gpu from source on a clean instance of Ubuntu☆11Sep 29, 2023Updated 2 years ago
- Efficient joint input optimization and inference with DEQ☆10Nov 25, 2021Updated 4 years ago
- CodeQUEST is a generalizable framework which leverages LLMs to iteratively evaluate and enhance code quality across multiple dimensions f…☆16Updated this week