CATArena is an engineering-level tournament evaluation platform for Large Language Model-driven code agents (LLM-driven code agents), based on an iterative competitive peer learning framework.
☆65Dec 25, 2025Updated 5 months ago
Alternatives and similar repositories for CATArena
Users that are interested in CATArena are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Multi-step reasoning MLLM☆23Mar 8, 2026Updated 2 months ago
- Introduce a novel Video Trimming (VT) task and proposes an agent-based approach (AVT) for detecting wasted footage, selecting valuable se…☆26Jan 20, 2025Updated last year
- Official implementation of the paper: [EMNLP 2025] RICO: Improving Accuracy and Completeness in Image Recaptioning via Visual Reconstruct…☆21Dec 9, 2025Updated 5 months ago
- mxnet deploy version of pseudo-3d-residual-networks(P-3D), sport1m and Kinetics pretrained model is supported☆13Jul 27, 2018Updated 7 years ago
- ☆43Dec 15, 2025Updated 5 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Prompt-R1: Collaborative Automatic Prompting Framework via End-to-end Reinforcement Learning☆60Feb 24, 2026Updated 3 months ago
- Self evolve extension for openclaw. Let your claw grow continuously.☆96Apr 12, 2026Updated last month
- [EMNLP 2024] The official GitHub repo for the paper "Course-Correction: Safety Alignment Using Synthetic Preferences"☆20Oct 2, 2024Updated last year
- 展示 Segment Anything 模型能力的示例项目☆11Jun 18, 2023Updated 2 years ago
- High-Resolution Visual Reasoning via Multi-Turn Grounding-Based Reinforcement Learning☆55Jul 23, 2025Updated 10 months ago
- ☆31Aug 30, 2025Updated 8 months ago
- Utility for benchmarking changes in Spark using TPC-DS workloads☆16Jun 3, 2021Updated 4 years ago
- ☆121Oct 29, 2025Updated 6 months ago
- ☆92Apr 28, 2026Updated 3 weeks ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- 我的一些开源文档☆10Feb 18, 2025Updated last year
- Improvement for Modular Camera based Tactile Sensor, with integrated circuit, optimized illumination, and biomimetic markers.☆16Feb 14, 2024Updated 2 years ago
- ☆27Feb 13, 2026Updated 3 months ago
- Implementation and datasets for "Training Language Models to Generate Quality Code with Program Analysis Feedback"☆42Jul 21, 2025Updated 10 months ago
- A repository for Chinese text normalization.☆20May 2, 2021Updated 5 years ago
- ☆20Oct 18, 2025Updated 7 months ago
- ☆75Jun 10, 2025Updated 11 months ago
- Fork of Bliss☆15Dec 13, 2025Updated 5 months ago
- Code for the EMNLP24 paper "A simple and effective L2 norm based method for KV Cache compression."☆18Dec 13, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Code to reproduce the Arena environment experiments from Direct Behavior Specification via Constrained Reinforcement Learning.☆22Sep 10, 2022Updated 3 years ago
- Demonstrating the generation of an ACM paper using Emacs Org-mode and LaTeX export☆19Sep 4, 2012Updated 13 years ago
- NetEaseCrowd dataset, a collection of data obtained from You Ling crowdsourcing platform, Fuxi AI Lab, NetEase.☆13Dec 19, 2024Updated last year
- Implementation of paper accepted by EMNLP 2018 using Pytorch named "A Self-Attentive Model with Gate Mechanism for Spoken Language Unders…☆17Dec 11, 2018Updated 7 years ago
- X152b 机型开源项目,搭载 D430 双目深度相机+Edge2 6TOPS算力☆17Oct 27, 2025Updated 7 months ago
- ☆10Sep 9, 2024Updated last year
- [TUD Thesis] Isaac Gym Envs with Drone Racing Tasks☆14Feb 23, 2025Updated last year
- ☆11May 4, 2022Updated 4 years ago
- ☆12Mar 15, 2022Updated 4 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆14Oct 12, 2024Updated last year
- Neural network backend for training and inference for animal pose estimation.☆20Updated this week
- ☆15Aug 25, 2021Updated 4 years ago
- [ACL 2025 Findings] Text2World: Benchmarking Large Language Models for Symbolic World Model Generation☆29Feb 25, 2025Updated last year
- ☆17Mar 10, 2026Updated 2 months ago
- The Code Repo for Agent-Pro: Learning to Evolve via Policy-Level Reflection and Optimization☆129Sep 2, 2024Updated last year
- Exploring techniques to generate diverse conventions in multi-agent settings☆15Nov 14, 2023Updated 2 years ago