SWE Arena
☆36Jul 6, 2025Updated 11 months ago
Alternatives and similar repositories for SWE-Arena
Users that are interested in SWE-Arena are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- BigCodeBench: Benchmarking Code Generation with Diverse Function Calls and Complex Instructions☆25Aug 8, 2024Updated last year
- Making code edting up to 7.7x faster using multi-layer speculation☆24Feb 20, 2025Updated last year
- A benchmark of programming tasks for LLMs that supports almost any programming language.☆13Jun 30, 2025Updated 11 months ago
- Lightweight package that tracks and summarizes code changes using LLMs (Large Language Models)☆33Feb 27, 2025Updated last year
- A collection of deep reinforcement learning algorithm implementations☆11Jan 9, 2020Updated 6 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- [ACL2025 Oral🔥]Turning Trash into Treasure: Accelerating Inference of Large Language Models with Token Recycling☆29Nov 11, 2025Updated 6 months ago
- BigCodeArena: Unveiling More Reliable Human Preferences in Code Generation via Execution☆61Oct 13, 2025Updated 7 months ago
- ☆10Jul 14, 2018Updated 7 years ago
- [ICLR'25] BigCodeBench: Benchmarking Code Generation Towards AGI☆503Jan 3, 2026Updated 5 months ago
- Based on the R1-Zero method, using rule-based rewards and GRPO on the Code Contests dataset.☆18Apr 22, 2025Updated last year
- ☆47May 27, 2026Updated last week
- [EMNLP'24 (Main)] DRPO(Dynamic Rewarding with Prompt Optimization) is a tuning-free approach for self-alignment. DRPO leverages a search-…☆25Nov 17, 2024Updated last year
- Large Language Agents Modulating Behaviour in Decentralized Autonomous Organizations☆24Jul 14, 2023Updated 2 years ago
- (AAAI 2026) OSVBench, a new benchmark for evaluating Large Language Models (LLMs) in generating complete specification code pertaining to…☆14May 13, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A curated list of awesome multi-modal recommendation.☆10Mar 16, 2022Updated 4 years ago
- ☆15Sep 30, 2022Updated 3 years ago
- Official Implementation for the paper "VisCodex: Unified Multimodal Code Generation via Merging Vision and Coding Models"☆23Aug 14, 2025Updated 9 months ago
- Replication package for evaluation of code generation metrics☆17Nov 24, 2025Updated 6 months ago
- [NeurIPS 2025@FoRLM] R1-Compress: Long Chain-of-Thought Compression via Chunk Compression and Search☆17Jan 24, 2026Updated 4 months ago
- Code to build models that effectively predict promoter-driven gene expression☆12May 15, 2025Updated last year
- CAT-probing: A Metric-based Approach to Interpret How Pre-trained Models for Programming Language Attend Code Structure, EMNLP 2022☆13Dec 10, 2022Updated 3 years ago
- ☆18Apr 19, 2023Updated 3 years ago
- diffusers with search engine☆12Jan 13, 2026Updated 4 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆12Oct 10, 2021Updated 4 years ago
- InstructCoder: Instruction Tuning Large Language Models for Code Editing | Oral ACL-2024 srw☆66Oct 4, 2024Updated last year
- musl: A C standard library☆18Apr 20, 2026Updated last month
- Domain Generation Algorithms research papers, datasets and code☆15May 17, 2020Updated 6 years ago
- Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks☆31May 22, 2024Updated 2 years ago
- Google Ad Manager API Client Library for NodeJs.☆12Jul 2, 2023Updated 2 years ago
- ☆26Dec 29, 2023Updated 2 years ago
- [ICML'25] MELON: Provable Defense Against Indirect Prompt Injection Attacks in AI Agents☆29Jul 31, 2025Updated 10 months ago
- Use genetic algorithm to optimize the backpropagation neural network.☆17Aug 21, 2020Updated 5 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- A CLI tool that fetches GitHub PR diffs, analyzes them with OpenAI, and generates a Markdown code review to streamline the review process…☆11Apr 29, 2025Updated last year
- ☆10Feb 26, 2021Updated 5 years ago
- [EMNLP'2023 Findings] MoqaGPT, for zero-shot multimodal question answering with LLMs☆13Dec 28, 2024Updated last year
- Code implementation for paper AbsenceBench: Language Models Can't Tell What's Missing☆19Oct 23, 2025Updated 7 months ago
- A pytorch implementation of "Latent Variable Dialogue Models and their Diversity"☆18Nov 30, 2017Updated 8 years ago
- Source Code & Datasets for "FBL: Feature-Balanced Loss for Long-Tailed Visual Recognition"☆13Sep 3, 2022Updated 3 years ago
- R1-like Computer-use Agent☆91Mar 21, 2025Updated last year