CATArena is an engineering-level tournament evaluation platform for Large Language Model-driven code agents (LLM-driven code agents), based on an iterative competitive peer learning framework.
☆60Dec 25, 2025Updated 2 months ago
Alternatives and similar repositories for CATArena
Users that are interested in CATArena are comparing it to the libraries listed below
Sorting:
- Differential Evolution Algorithm which uses Non-dominated Sorting for Multi-Objective Optimization☆10Mar 11, 2020Updated 6 years ago
- Demonstrates how to formulate the n-queens problem as a QUBO, which we then solve using Leap’s hybrid solvers.☆10Mar 3, 2026Updated 2 weeks ago
- The official code of "Beyond Walking: A Large-Scale Image-Text Benchmark for Text-based Person Anomaly Search"☆27Sep 15, 2025Updated 6 months ago
- Introduce a novel Video Trimming (VT) task and proposes an agent-based approach (AVT) for detecting wasted footage, selecting valuable se…☆23Jan 20, 2025Updated last year
- The implementation of “Fine-tuning Graph Neural Networks by Preserving Graph Generative Patterns”☆18Jun 18, 2024Updated last year
- Counterfactual generation of tumor perturbations from multiplexed tissue images☆23May 13, 2025Updated 10 months ago
- Paper: “MEMRL: SELF-EVOLVING AGENTS VIA RUNTIME REINFORCEMENT LEARNING ON EPISODIC MEMORY” Open-Source Code☆55Feb 27, 2026Updated 3 weeks ago
- Official implementation of the paper: [EMNLP 2025] RICO: Improving Accuracy and Completeness in Image Recaptioning via Visual Reconstruct…☆21Dec 9, 2025Updated 3 months ago
- ☆41Dec 15, 2025Updated 3 months ago
- ☆14Aug 22, 2024Updated last year
- [BMVC 2023 Oral] Boost Video Frame Interpolation via Motion Adaptation☆19Aug 22, 2024Updated last year
- Code for InstructBioMol, implementing the Nature Machine Intelligence paper "Advancing Biomolecular Understanding and Design Following Hu…☆31Aug 2, 2025Updated 7 months ago
- UnrealCV for image rendering from 3D model☆14May 21, 2020Updated 5 years ago
- A multi-objective evolutionary algorithm with interval based initialization and self-adaptive crossover operator for large-scale feature …☆13Sep 6, 2022Updated 3 years ago
- Prompt-R1: Collaborative Automatic Prompting Framework via End-to-end Reinforcement Learning☆56Feb 24, 2026Updated 3 weeks ago
- [TVCG 2021] Official Implementation of "FixationNet: Forecasting Eye Fixations in Task-Oriented Virtual Environments"☆11Aug 13, 2025Updated 7 months ago
- 浙江大学 ZJU 报告模板 (LaTex & Typora-Markdown)☆22Mar 11, 2025Updated last year
- [EMNLP 2024] The official GitHub repo for the paper "Course-Correction: Safety Alignment Using Synthetic Preferences"☆20Oct 2, 2024Updated last year
- ☆33Jul 15, 2025Updated 8 months ago
- Robust Principles: Architectural Design Principles for Adversarially Robust CNNs☆23Jan 13, 2024Updated 2 years ago
- Non-linear Motion Estimation for Video Frame Interpolation using Space-time Convolutions☆20Jun 23, 2022Updated 3 years ago
- Modality-Agnostic Self-Supervised Learning with Meta-Learned Masked Auto-Encoder (NeurIPS 2023)☆10Jun 5, 2024Updated last year
- MARSHAL: Incentivizing Multi-Agent Reasoning via Self-Play with Strategic LLMs☆39Mar 13, 2026Updated last week
- Flarum WeChat Login extension☆14Oct 19, 2023Updated 2 years ago
- High-Resolution Visual Reasoning via Multi-Turn Grounding-Based Reinforcement Learning☆53Jul 23, 2025Updated 8 months ago
- ☆18Aug 14, 2024Updated last year
- A docker-compose.yml for flarum☆11Jan 14, 2023Updated 3 years ago
- The source code for the paper: Joint Appearance and Motion Learning for Efficient Rolling Shutter Correction (CVPR2023)☆23May 29, 2023Updated 2 years ago
- [ACL'25] Code for ACL'25 paper "IRT-Router: Effective and Interpretable Multi-LLM Routing via Item Response Theory"☆29Feb 19, 2025Updated last year
- Evolution strategy NSGA-II for MO-G-FJSP☆24Feb 21, 2022Updated 4 years ago
- A database with automatic dynamic imputation of missing values.☆11Nov 2, 2017Updated 8 years ago
- A work-in-progress book on Dask☆12Jul 15, 2023Updated 2 years ago
- ☆16Sep 4, 2023Updated 2 years ago
- An AI benchmark for Pokémon VGC with agent implementations using multi-agent reinforcement learning, behavior cloning, LLMs, and heuristi…☆34Mar 12, 2026Updated last week
- lecture notes of probability notes☆17Jul 7, 2020Updated 5 years ago
- Improvement for Modular Camera based Tactile Sensor, with integrated circuit, optimized illumination, and biomimetic markers.☆16Feb 14, 2024Updated 2 years ago
- RL with Experience Replay☆55Jul 27, 2025Updated 7 months ago
- ☆28Feb 13, 2026Updated last month
- Brings RSS and Atom feeds to Flarum☆15Dec 20, 2023Updated 2 years ago