a-antoniades / swe-searchView external linksLinks
☆12Nov 5, 2024Updated last year
Alternatives and similar repositories for swe-search
Users that are interested in swe-search are comparing it to the libraries listed below
Sorting:
- Moatless Testbeds allows you to create isolated testbed environments in a Kubernetes cluster where you can apply code changes through git…☆14Apr 9, 2025Updated 10 months ago
- ☆131Jun 6, 2025Updated 8 months ago
- ☆24Oct 3, 2025Updated 4 months ago
- ☆33Jan 25, 2026Updated 3 weeks ago
- ☆14May 7, 2025Updated 9 months ago
- Landing page + leaderboard for SWE-Bench benchmark☆11Jan 26, 2026Updated 3 weeks ago
- small MCP server for orchestrating tasks across LLM instances☆24Apr 29, 2025Updated 9 months ago
- ☆35Nov 15, 2025Updated 3 months ago
- ☆19Jun 13, 2024Updated last year
- Boston - AI Assistant is an iOS, iPadOS, macOS, and visionOS application that uses SiriKit and OpenAI API's to allow users to access Chat…☆21Sep 12, 2025Updated 5 months ago
- This repository contains the code and data for the paper "Chaining the Evidence: Robust Reinforcement Learning for Deep Search Agents wit…☆54Feb 7, 2026Updated last week
- ☆28Nov 10, 2025Updated 3 months ago
- Config files for my GitHub profile.☆20Jan 27, 2026Updated 2 weeks ago
- ☆47Oct 28, 2025Updated 3 months ago
- ☆55Jan 15, 2026Updated last month
- Multi-step AI agents powered by Gemini 2.0 and the LangGraph framework. These agents orchestrate complex workflows and enhance their reas…☆10Dec 19, 2024Updated last year
- The approach involves the usage of Multi-Criteria Decision Analyses, including Weighted Sum Model (WSM), Weighted Product Model (WPM) and…☆11Oct 22, 2021Updated 4 years ago
- A Framework for the Systematic Evaluation of Chat-Optimized Language Models as Conversational Agents and an Extensible Benchmark☆32Jan 30, 2026Updated 2 weeks ago
- Automated Benchmarking of LLM Agents on Real-World Software Security Tasks [NeurIPS 2025]☆55Jan 27, 2026Updated 2 weeks ago
- Multi-agent synthetic data generation pipeline capable of generating and validating long horizon terminal/coding tasks for RL training☆51Jul 28, 2025Updated 6 months ago
- Benchmark evaluating ocean forecasting systems against reference datasets and observations.☆24Updated this week
- Martingale posterior neural networks for fast sequential decision making @ Neurips 2025☆22Nov 13, 2025Updated 3 months ago
- Booz Allen's lean manufacturing approach for holistically designing, developing and fielding AI solutions across the engineering lifecycl…☆42Oct 8, 2025Updated 4 months ago
- Agentless Lite: RAG-based SWE-Bench software engineering scaffold☆45Apr 15, 2025Updated 10 months ago
- A framework for few-shot evaluation of autoregressive language models.☆12Jul 14, 2025Updated 7 months ago
- ☆28Feb 3, 2026Updated last week
- ☆13Oct 21, 2024Updated last year
- DOMAINEVAL is an auto-constructed benchmark for multi-domain code generation that consists of 2k+ subjects (i.e., description, reference …☆14Dec 12, 2024Updated last year
- ☆13Updated this week
- [CVPR2024] Learning from Synthetic Human Group Activities☆14Feb 24, 2025Updated 11 months ago
- ☆12Jan 11, 2026Updated last month
- Payment rails made right. Award winning developer experience.☆28Jan 27, 2026Updated 2 weeks ago
- Code, figure, and data repository for: Haase et al. (2023) Nature. https://doi.org/10.1038/s41586-023-06400-1☆11Aug 10, 2023Updated 2 years ago
- MCP server for Grok AI API integration☆19Jun 2, 2025Updated 8 months ago
- Auction Theory Toolbox – Computer Verified Auctions☆14Jul 12, 2016Updated 9 years ago
- The official implementation of COOPER: A Unified Model for Cooperative Perception and Reasoning in Spatial Intelligence.☆28Dec 30, 2025Updated last month
- Official implementation of Rethinking the "Heatmap + Monte Carlo Tree Search" Paradigm for Large Scale TSP.☆11Nov 15, 2024Updated last year
- A Swedish Natural Language Understanding Benchmark☆11Dec 12, 2025Updated 2 months ago
- Extract streaming data from text using prefix completion.☆10Oct 6, 2024Updated last year