☆122Feb 27, 2026Updated 3 weeks ago
Alternatives and similar repositories for terminal-bench-2
Users that are interested in terminal-bench-2 are comparing it to the libraries listed below
Sorting:
- ☆87Feb 12, 2026Updated last month
- Harbor is a framework for running agent evaluations and creating and using RL environments.☆1,011Updated this week
- Implementation of Recursive Language Model paper from scratch☆38Feb 10, 2026Updated last month
- Compare your aura with someone else's X account posts. Powered by Exa AI, DeepSeek Reasoning LLMs and Vercel AI SDK☆20Feb 12, 2025Updated last year
- Official PyTorch implementation of The Linear Attention Resurrection in Vision Transformer☆16Sep 7, 2024Updated last year
- Deeply supervised density regression for automatic cell counting in microscopy images☆12Jan 31, 2022Updated 4 years ago
- [ACL2023, Findings] Source codes for the paper "Werewolf Among Us: Multimodal Resources for Modeling Persuasion Behaviors in Social Deduc…☆16Feb 22, 2025Updated last year
- Submodule for Grounded-SAM☆12Apr 17, 2023Updated 2 years ago
- Official implementation of Log-linear Sparse Attention (LLSA).☆63Feb 2, 2026Updated last month
- This is the repository for the resources in TACL 2022 Paper "Ultra-fine Entity Typing with Indirect Supervision from Natural Language Inf…☆14Aug 17, 2022Updated 3 years ago
- Mixture Density Network Demo in Pytorch☆14Sep 17, 2018Updated 7 years ago
- ☆18Apr 8, 2025Updated 11 months ago
- [ICML 2024] SimPro: A Simple Probabilistic Framework Towards Realistic Long-Tailed Semi-Supervised Learning☆32Sep 30, 2024Updated last year
- Demo of using WASM to sandbox Plotly execution☆19Mar 30, 2025Updated 11 months ago
- "Omni-R1: Towards the Unified Generative Paradigm for Multimodal Reasoning"☆55Jan 28, 2026Updated last month
- Source code of our ICML 2025 paper "Flowing Datasets with Wasserstein over Wasserstein Gradient Flows"☆18May 21, 2025Updated 10 months ago
- ☆32Feb 2, 2025Updated last year
- Claw-Eval is an evaluation harness for evaluating LLM as agents. All tasks verified by humans.☆201Updated this week
- Here is the resources and code for the LotteryCodec.☆26Nov 3, 2025Updated 4 months ago
- Explore and Control with Adversarial Surprise☆10Jul 20, 2021Updated 4 years ago
- Multi-agent synthetic data generation pipeline capable of generating and validating long horizon terminal/coding tasks for RL training☆56Jul 28, 2025Updated 7 months ago
- [ICCV 2025 Highlight] official code of paper "DLF: Extreme Image Compression with Dual-generative Latent Fusion"☆40Dec 24, 2025Updated 2 months ago
- [ICLR 2022] Understanding and Improving Graph Injection Attack by Promoting Unnoticeability☆38Nov 27, 2023Updated 2 years ago
- Code repository for Beetle the robot.☆13Sep 12, 2023Updated 2 years ago
- [TCSVT 2023] RDO-PTQ: Rate-Distortion Optimized Post-Training Quantization for Learned Image Compression☆18Nov 1, 2023Updated 2 years ago
- The pytorch implementation of Cluster-Aware Supervised Contrastive Learning on Graphs (WWW 2022).☆11Jun 6, 2022Updated 3 years ago
- Prompt Jinja2 templates for LLMs☆35Jul 9, 2025Updated 8 months ago
- We aim to provide the best references to search, select, and synthesize high-quality and large-quantity data for post-training your LLMs.☆61Oct 3, 2024Updated last year
- A boundary detection algorithm in microscopic images considering 3D information.☆13Sep 19, 2018Updated 7 years ago
- ☆11Jan 23, 2021Updated 5 years ago
- A Data Source for Reasoning Embodied Agents☆19Sep 18, 2023Updated 2 years ago
- Flash-Linear-Attention models beyond language☆21Aug 28, 2025Updated 6 months ago
- A Model Context Protocol (MCP) server providing TomTom's location services, search, routing, and traffic data to AI agents.☆42Updated this week
- 💻 SETA: Scaling Environments for Terminal Agents - Environments☆119Feb 16, 2026Updated last month
- Codes and data for KDD 2024 Research Track paper "ProCom: A Few-shot Targeted Community Detection Algorithm"☆11Aug 15, 2024Updated last year
- pytorch implementation affinity loss☆11Feb 21, 2019Updated 7 years ago
- Open-source MCP server for secure, low-latency cloud-browser automation on Kernel.☆27Mar 9, 2026Updated last week
- ☆13Jul 14, 2024Updated last year
- [NeurIPS 2024] Implementation of "Enhancing Graph Transformers with Hierarchical Distance Structural Encoding"☆16May 19, 2025Updated 10 months ago