alphadl / OOP-eval
The first Object-Oriented Programming (OOP) Evaluaion Benchmark for LLMs
☆21Updated 4 months ago
Related projects ⓘ
Alternatives and complementary repositories for OOP-eval
- [ICML 2024] Code for the paper "Confronting Reward Overoptimization for Diffusion Models: A Perspective of Inductive and Primacy Biases"☆30Updated 4 months ago
- FusionBench: A Comprehensive Benchmark/Toolkit of Deep Model Fusion☆86Updated this week
- ☆28Updated 3 months ago
- ☆20Updated last month
- ☆22Updated last month
- [ACL 2024 Findings] CriticBench: Benchmarking LLMs for Critique-Correct Reasoning☆20Updated 8 months ago
- [ICLR 2022] Official repository for "Knowledge Removal in Sampling-based Bayesian Inference"☆19Updated 2 years ago
- [NAACL 2024] A Synthetic, Scalable and Systematic Evaluation Suite for Large Language Models☆33Updated 5 months ago
- ☆23Updated 6 months ago
- Code for "Universal Adversarial Triggers Are Not Universal."☆16Updated 6 months ago
- ☆20Updated 4 months ago
- Code and data for "Living in the Moment: Can Large Language Models Grasp Co-Temporal Reasoning?" (ACL 2024)☆28Updated 4 months ago
- ☆17Updated 4 months ago
- Multilingual safety benchmark for Large Language Models☆24Updated 2 months ago
- Restore safety in fine-tuned language models through task arithmetic☆26Updated 7 months ago
- Code for paper "Diffusion Language Models Can Perform Many Tasks with Scaling and Instruction-Finetuning"☆63Updated 9 months ago
- Personality Alignment of Language Models☆18Updated 2 months ago
- RWKU: Benchmarking Real-World Knowledge Unlearning for Large Language Models. NeurIPS 2024☆62Updated last month
- ☆41Updated last year
- A Lightweight Visual Understanding and Reasoning Benchmark for Evaluating Large Multimodal Models through Coding Tasks☆14Updated this week
- Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering☆26Updated last month
- [EMNLP-2022 Findings] Code for paper “ProGen: Progressive Zero-shot Dataset Generation via In-context Feedback”.☆24Updated last year
- [Arxiv 2024] Adversarial attacks on multimodal agents☆39Updated 4 months ago
- Code repo for the paper: Attacking Vision-Language Computer Agents via Pop-ups☆19Updated 2 weeks ago
- [EMNLP 2024 Main] Official implementation of the paper "The Accuracy Paradox in RLHF: When Better Reward Models Don't Yield Better Langua…☆12Updated last week
- Official implementation of ICLR'24 paper, "Curiosity-driven Red Teaming for Large Language Models" (https://openreview.net/pdf?id=4KqkizX…☆62Updated 8 months ago
- [ICLR'24 spotlight] Tool-Augmented Reward Modeling☆36Updated 8 months ago
- ☆54Updated 2 months ago
- Code for paper "Unraveling Cross-Modality Knowledge Conflicts in Large Vision-Language Models."☆33Updated last month
- In-Context Sharpness as Alerts: An Inner Representation Perspective for Hallucination Mitigation (ICML 2024)☆45Updated 7 months ago