Official repository for Decentralized Arena via Collective LLM Intelligence
☆17May 19, 2025Updated 10 months ago
Alternatives and similar repositories for de-arena
Users that are interested in de-arena are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [EMNLP'24 (Main)] DRPO(Dynamic Rewarding with Prompt Optimization) is a tuning-free approach for self-alignment. DRPO leverages a search-…☆25Nov 17, 2024Updated last year
- FaiRR: Faithful and Robust Deductive Reasoning over Natural Language (ACL 2022)☆13May 19, 2022Updated 3 years ago
- CoV: Chain-of-View Prompting for Spatial Reasoning☆52Jan 23, 2026Updated 2 months ago
- Source Code for KDD'19 paper "SurfCon: Synonym Discovery on Privacy-Aware Clinical Data"☆10Apr 10, 2020Updated 5 years ago
- [NeurIPS 2024] Code and Data Repo for Paper "Embedding Trajectory for Out-of-Distribution Detection in Mathematical Reasoning"☆28May 28, 2024Updated last year
- [ICASSP 2022] Official PyTorch Implementation for "Attention Probe: Vision Transformer Distillation in the Wild" (ICASSP 2022)☆11Jan 23, 2022Updated 4 years ago
- Source Code for ACL 2020 paper, "Rationalizing Medical Relation Prediction from Corpus-level Statistics"☆11Sep 6, 2020Updated 5 years ago
- The official GitHub page for ''What Makes for Good Visual Instructions? Synthesizing Complex Visual Reasoning Instructions for Visual Ins…☆19Nov 10, 2023Updated 2 years ago
- Implementation of SurfCon model for Synonym Discovery on Privacy-Aware Clinical Data☆12Jul 6, 2023Updated 2 years ago
- Code and data for paper "A Semantic Invariant Robust Watermark for Large Language Models" accepted by ICLR 2024.☆37Nov 13, 2024Updated last year
- Task-Guided Pair Embedding in Heterogeneous Network (CIKM 2019)☆12Aug 19, 2021Updated 4 years ago
- ☆25Jun 28, 2024Updated last year
- ☆21Mar 25, 2023Updated 3 years ago
- Koishi's Day 2025 Paper (NeurIPS 2025): "Codifying Character Logic in Role-Playing"☆23Jan 15, 2026Updated 2 months ago
- Coherence boosting: When your pretrained language model is not paying enough attention (ACL 2022) https://arxiv.org/abs/2110.08294☆15Apr 23, 2023Updated 2 years ago
- Co-Reinforcement Learning for Unified Multimodal Understanding and Generation☆41Jul 22, 2025Updated 8 months ago
- Participant Kit for the TextGraphs-15 Shared Task on Explanation Regeneration☆19Nov 8, 2021Updated 4 years ago
- Fast spectral clustering, described in the NeurIPS'23 paper "Fast and Simple Spectral Clustering in Theory and Practice"☆17Jun 19, 2025Updated 9 months ago
- ☆17Aug 1, 2025Updated 7 months ago
- Evaluating Durability: Benchmark Insights into Multimodal Watermarking☆12Jun 7, 2024Updated last year
- Official Implementation of the ACL2024 Findings paper "Controllable Data Augmentation for Few-Shot Text Mining with Chain-of-Thought Attr…☆18May 18, 2024Updated last year
- Official Code for DOROTHIE: Spoken Dialogue for Handling Unexpected Situations in Interactive Autonomous Driving Agents (Findings of EMNL…☆22Oct 24, 2023Updated 2 years ago
- [IJCAI 2024] CMMU: A Benchmark for Chinese Multi-modal Multi-type Question Understanding and Reasoning☆25Feb 1, 2024Updated 2 years ago
- ☆21Jun 27, 2024Updated last year
- Block-Recurrent Dynamics in ViTs 🦖☆33Dec 24, 2025Updated 3 months ago
- Evaluating Large Language Models with Grid-Based Game Competitions: An Extensible LLM Benchmark and Leaderboard☆25Dec 14, 2024Updated last year
- [NeurIPS2023] "Selectivity Drives Productivity: Efficient Dataset Pruning for Enhanced Transfer Learning" by Yihua Zhang*, Yimeng Zhang*,…☆14Oct 12, 2023Updated 2 years ago
- The first open-domain closed-loop revisited benchmark for evaluating memory consistency and action control in world models.☆48Feb 10, 2026Updated last month
- CellPolaris☆13Jan 11, 2026Updated 2 months ago
- ☆37Feb 4, 2026Updated last month
- [ECCV'24 Oral] The official GitHub page for ''Images are Achilles' Heel of Alignment: Exploiting Visual Vulnerabilities for Jailbreaking …☆35Oct 23, 2024Updated last year
- exploring whether LLMs perform case-based or rule-based reasoning☆30Mar 2, 2024Updated 2 years ago
- Here is the code for experiments related to the NIS+ framework.☆16Dec 15, 2025Updated 3 months ago
- This the implementation of LeCo☆32Jan 20, 2025Updated last year
- [npj Digital Medicine] A multimodal multidomain multilingual medical foundation model for zero shot clinical diagnosis☆17Feb 6, 2025Updated last year
- Official repository for the paper "Flow Equivariant Recurrent Neural Networks"☆34Jul 2, 2025Updated 8 months ago
- ☆11Dec 12, 2020Updated 5 years ago
- Unveiling and Mitigating Bias in Mental Health Analysis with Large Language Models☆12Jun 21, 2024Updated last year
- ☆27May 20, 2025Updated 10 months ago