SWE Arena
☆36Jul 6, 2025Updated 11 months ago
Alternatives and similar repositories for SWE-Arena
Users that are interested in SWE-Arena are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- BigCodeBench: Benchmarking Code Generation with Diverse Function Calls and Complex Instructions☆26Aug 8, 2024Updated last year
- Making code edting up to 7.7x faster using multi-layer speculation☆24Feb 20, 2025Updated last year
- Small, simple agent task environments for training and evaluation☆20Nov 1, 2024Updated last year
- PaStiX (Parallel Sparse matriX package) solver library☆20Nov 20, 2018Updated 7 years ago
- Lightweight package that tracks and summarizes code changes using LLMs (Large Language Models)☆33Feb 27, 2025Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- [ICLR'25] BigCodeBench: Benchmarking Code Generation Towards AGI☆510Jan 3, 2026Updated 5 months ago
- ☆48Jun 11, 2026Updated 2 weeks ago
- LLM-based mutation testing☆16Feb 3, 2025Updated last year
- [EMNLP'24 (Main)] DRPO(Dynamic Rewarding with Prompt Optimization) is a tuning-free approach for self-alignment. DRPO leverages a search-…☆25Nov 17, 2024Updated last year
- ☆31Apr 10, 2023Updated 3 years ago
- ☆12Jun 24, 2017Updated 9 years ago
- Official Implementation for the paper "VisCodex: Unified Multimodal Code Generation via Merging Vision and Coding Models"☆23Aug 14, 2025Updated 10 months ago
- Replication package for evaluation of code generation metrics☆17Nov 24, 2025Updated 7 months ago
- Code to build models that effectively predict promoter-driven gene expression☆12May 15, 2025Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Code for paper "LEVER: Learning to Verifiy Language-to-Code Generation with Execution" (ICML'23)☆90Jul 5, 2023Updated 2 years ago
- CAT-probing: A Metric-based Approach to Interpret How Pre-trained Models for Programming Language Attend Code Structure, EMNLP 2022☆13Dec 10, 2022Updated 3 years ago
- ☆18Apr 19, 2023Updated 3 years ago
- diffusers with search engine☆12Jan 13, 2026Updated 5 months ago
- ☆12Oct 10, 2021Updated 4 years ago
- [NeurIPS 2025] Less Is More, but Where? Dynamic Token Compression via LLM-Guided Keyframe Prior☆79Feb 20, 2026Updated 4 months ago
- [NeurIPS 2025]"Enhancing Visual Grounding for GUI Agents via Self-Evolutionary Reinforcement Learning"☆105Oct 21, 2025Updated 8 months ago
- FormulaOne: A dataset of algorithmic problems based on MSO formulas.☆25Mar 1, 2026Updated 3 months ago
- [EACL 2024] ICE-Score: Instructing Large Language Models to Evaluate Code☆79Jun 16, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Google Ad Manager API Client Library for NodeJs.☆12Jul 2, 2023Updated 2 years ago
- [ICML'25] MELON: Provable Defense Against Indirect Prompt Injection Attacks in AI Agents☆32Jul 31, 2025Updated 11 months ago
- Use genetic algorithm to optimize the backpropagation neural network.☆17Aug 21, 2020Updated 5 years ago
- A CLI tool that fetches GitHub PR diffs, analyzes them with OpenAI, and generates a Markdown code review to streamline the review process…☆11Apr 29, 2025Updated last year
- ☆12Jul 10, 2023Updated 2 years ago
- [AAAI 2025] Neural-Symbolic Collaborative Distillation: Advancing Small Language Models for Complex Reasoning Tasks☆12Jun 19, 2025Updated last year
- [EMNLP'2023 Findings] MoqaGPT, for zero-shot multimodal question answering with LLMs☆13Dec 28, 2024Updated last year
- ☆26Jun 10, 2024Updated 2 years ago
- A pytorch implementation of "Latent Variable Dialogue Models and their Diversity"☆18Nov 30, 2017Updated 8 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Source Code & Datasets for "FBL: Feature-Balanced Loss for Long-Tailed Visual Recognition"☆13Sep 3, 2022Updated 3 years ago
- [NeurIPS 2025] | DIPO: Dual-State Images Controlled Articulated Object Generation Powered by Diverse Data☆50Dec 12, 2025Updated 6 months ago
- arXiv fragment loader plugin for https://llm.datasette.io/☆18May 17, 2025Updated last year
- The system enables sophisticated coordination of multiple drones through natural language commands, visual inputs, and real-time environm…☆17Dec 15, 2025Updated 6 months ago
- [NeurIPS'24] SelfCodeAlign: Self-Alignment for Code Generation☆322Feb 24, 2025Updated last year
- LeetCode Training and Evaluation Dataset☆52Apr 22, 2025Updated last year
- Awesome papers, datasets and projects about the study of large language models like GPT-3, GPT-3.5, ChatGPT, GPT-4, etc.☆20Jun 10, 2023Updated 3 years ago