☆12Nov 5, 2024Updated last year
Alternatives and similar repositories for swe-search
Users that are interested in swe-search are comparing it to the libraries listed below
Sorting:
- Moatless Testbeds allows you to create isolated testbed environments in a Kubernetes cluster where you can apply code changes through git…☆14Apr 9, 2025Updated 11 months ago
- ☆132Jun 6, 2025Updated 9 months ago
- ☆24Oct 3, 2025Updated 5 months ago
- ☆34Jan 25, 2026Updated last month
- ☆14May 7, 2025Updated 10 months ago
- Landing page + leaderboard for SWE-Bench benchmark☆11Updated this week
- small MCP server for orchestrating tasks across LLM instances☆24Apr 29, 2025Updated 10 months ago
- ☆73Feb 8, 2026Updated last month
- ☆40Nov 15, 2025Updated 3 months ago
- ☆19Jun 13, 2024Updated last year
- This repository contains the code and data for the paper "Chaining the Evidence: Robust Reinforcement Learning for Deep Search Agents wit…☆55Feb 7, 2026Updated last month
- Boston - AI Assistant is an iOS, iPadOS, macOS, and visionOS application that uses SiriKit and OpenAI API's to allow users to access Chat…☆21Sep 12, 2025Updated 5 months ago
- ☆28Nov 10, 2025Updated 3 months ago
- Config files for my GitHub profile.☆20Jan 27, 2026Updated last month
- ☆48Oct 28, 2025Updated 4 months ago
- ☆56Jan 15, 2026Updated last month
- The approach involves the usage of Multi-Criteria Decision Analyses, including Weighted Sum Model (WSM), Weighted Product Model (WPM) and…☆11Oct 22, 2021Updated 4 years ago
- Multi-step AI agents powered by Gemini 2.0 and the LangGraph framework. These agents orchestrate complex workflows and enhance their reas…☆10Dec 19, 2024Updated last year
- A Framework for the Systematic Evaluation of Chat-Optimized Language Models as Conversational Agents and an Extensible Benchmark☆32Updated this week
- Automated Benchmarking of LLM Agents on Real-World Software Security Tasks [NeurIPS 2025]☆57Jan 27, 2026Updated last month
- Multi-agent synthetic data generation pipeline capable of generating and validating long horizon terminal/coding tasks for RL training☆55Jul 28, 2025Updated 7 months ago
- AI-native knowledge kernel for human/agent collaboration. Use it as a Knowledge Base, Wiki, Annotator, Research Tool, or Agentic Memory.☆29Updated this week
- Software to enable data-rich collaboration from high-resolution display walls to your laptop☆16Updated this week
- Martingale posterior neural networks for fast sequential decision making @ Neurips 2025☆23Nov 13, 2025Updated 3 months ago
- Booz Allen's lean manufacturing approach for holistically designing, developing and fielding AI solutions across the engineering lifecycl…☆42Oct 8, 2025Updated 5 months ago
- Agentless Lite: RAG-based SWE-Bench software engineering scaffold☆45Apr 15, 2025Updated 10 months ago
- A lightweight OAuth 2.0 Authorization Server supporting Device Authorization Grant (RFC 8628) and Authorization Code Flow with PKCE (RFC …☆32Updated this week
- [CVPR2024] Learning from Synthetic Human Group Activities☆14Feb 24, 2025Updated last year
- (CVPR 2026) Long-RVOS: A Comprehensive Benchmark for Long-term Referring Video Object Segmentation☆27Feb 28, 2026Updated last week
- Payment rails made right. Award winning developer experience.☆27Updated this week
- ☆12Jan 11, 2026Updated last month
- A Swedish Natural Language Understanding Benchmark☆11Dec 12, 2025Updated 2 months ago
- Code, figure, and data repository for: Haase et al. (2023) Nature. https://doi.org/10.1038/s41586-023-06400-1☆11Aug 10, 2023Updated 2 years ago
- Benchmark evaluating ocean forecasting systems against reference datasets and observations.☆26Updated this week
- Auction Theory Toolbox – Computer Verified Auctions☆14Jul 12, 2016Updated 9 years ago
- ☆13Updated this week
- Extract streaming data from text using prefix completion.☆10Oct 6, 2024Updated last year
- The official implementation of COOPER: A Unified Model for Cooperative Perception and Reasoning in Spatial Intelligence.☆28Dec 30, 2025Updated 2 months ago
- A framework for few-shot evaluation of autoregressive language models.☆12Jul 14, 2025Updated 7 months ago