☆46Jan 17, 2026Updated 2 months ago
Alternatives and similar repositories for trajectory-visualizer
Users that are interested in trajectory-visualizer are comparing it to the libraries listed below
Sorting:
- A collection of scripts and tools for analyzing SWE agents.☆16May 7, 2025Updated 10 months ago
- All Hands AI OpenHands Self-hosted Cloud☆42Updated this week
- Agent computer interface for AI software engineer.☆121Mar 12, 2026Updated last week
- Landing page + leaderboard for SWE-Bench benchmark☆12Mar 4, 2026Updated 2 weeks ago
- Lightweight OpenHands CLI in a binary executable☆131Updated this week
- The theory of mind module for the SWE agent☆90Jan 13, 2026Updated 2 months ago
- MCP server for interactive debugging☆31Jun 17, 2025Updated 9 months ago
- JAX bindings for the flash-attention3 kernels☆21Jan 2, 2026Updated 2 months ago
- ☆68May 20, 2025Updated 10 months ago
- Easiest way to build custom agents, in a no-code notion style editor, using simple macros.☆34Nov 8, 2024Updated last year
- Lesson plugins for Marimo notebooks☆19Apr 9, 2025Updated 11 months ago
- The repository for the paper "Predicting in-hospital mortality by combining clinical notes with time-series data"☆12May 23, 2021Updated 4 years ago
- ☆14Oct 6, 2020Updated 5 years ago
- Local lightning-fast semantic code search built for agents☆39Updated this week
- [ACL25] FEA-Bench: A Benchmark for Evaluating Repository-Level Code Generation for Feature Implementation☆47Jan 28, 2026Updated last month
- ☆15Oct 4, 2024Updated last year
- a web-based interactive browser of a survey on ML4VIS studies☆12Aug 7, 2024Updated last year
- SWE Arena☆35Jul 6, 2025Updated 8 months ago
- LockManager with deadlock detection for implementing 2PL☆13Mar 13, 2019Updated 7 years ago
- Code and dataset for the emnlp paper titled Instruct and Extract: Instruction Tuning for On-Demand Information Extraction☆54Jan 2, 2024Updated 2 years ago
- ☆26Sep 5, 2024Updated last year
- Sequence algorithms for use in Flashlight.☆14Jan 12, 2026Updated 2 months ago
- Configuration for all things CI☆16Updated this week
- basic Funcnodes package☆32Updated this week
- An MCP Server for analysing Github Repo Content with Gitingest☆20Jun 25, 2025Updated 8 months ago
- A scikit-learn compliant implementation of Monroe et al.'s Fightin' Words analysis method.☆11Mar 10, 2019Updated 7 years ago
- MULTIOPED: A Corpus of Multi-Perspective News Editorials.☆12Aug 25, 2021Updated 4 years ago
- Open sourced predictions, execution logs, trajectories, and results from model inference + evaluation runs on the SWE-bench task.☆255Feb 27, 2026Updated 3 weeks ago
- Code for Paper: Training Software Engineering Agents and Verifiers with SWE-Gym [ICML 2025]☆650Jul 29, 2025Updated 7 months ago
- ☆12Jul 31, 2025Updated 7 months ago
- Official Repo for SwS: A Weakness-driven Problem Synthesis Framework in RL for LLM Reasoning☆42Nov 11, 2025Updated 4 months ago
- Official repository for our EACL 2023 paper "LongEval: Guidelines for Human Evaluation of Faithfulness in Long-form Summarization" (https…☆44Aug 10, 2024Updated last year
- ☆17Aug 5, 2025Updated 7 months ago
- A toolkit to induce interpretable workflows from raw computer-use activities.☆42Nov 13, 2025Updated 4 months ago
- Tool to perform paired evaluation of automatic systems☆13Oct 20, 2021Updated 4 years ago
- Code for the MTEB Arena☆24Jul 2, 2025Updated 8 months ago
- A curated list of tools, projects, and resources from the Gaia community☆30Oct 6, 2025Updated 5 months ago
- Sandboxed code execution for AI agents, locally or on the cloud. Massively parallel, easy to extend. Powering SWE-agent and more.☆453Updated this week
- Pul(umi) schema from OpenAPI specs.☆19Mar 13, 2026Updated last week