☆161Mar 6, 2026Updated this week
Alternatives and similar repositories for terminal-bench-env
Users that are interested in terminal-bench-env are comparing it to the libraries listed below
Sorting:
- ☆56Nov 12, 2025Updated 3 months ago
- [CVPR 2026] Official code of "EmbodiedSplat: Online Feed-Forward Semantic 3DGS for Open-Vocabulary 3D Scene Understanding"☆38Updated this week
- From Blind Spots to Gains: Diagnostic-Driven Iterative Training for Large Multimodal Models☆46Feb 27, 2026Updated last week
- Official implementation of FRAPPE: Infusing World Modeling into Generalist Policies via Multiple Future Representation Alignment☆30Feb 24, 2026Updated last week
- ☆14Mar 2, 2026Updated last week
- [CVPR 2026] Official repo for "VideoSSR: Video Self-Supervised Reinforcement Learning"☆33Nov 11, 2025Updated 3 months ago
- ☆13Nov 5, 2024Updated last year
- Official Implementation of MDK12-Bench: A Multi-Discipline Benchmark for Evaluating Reasoning in Multimodal Large Language Models☆12Nov 1, 2025Updated 4 months ago
- [ACM MM 2023] The released code of paper "Deconfounded Visual Question Generation with Causal Inference"☆11Sep 3, 2024Updated last year
- [ICML 2025 Oral] This is the official repository of the paper "What Limits Virtual Agent Application? OmniBench: A Scalable Multi-Dimensi…☆21Jun 12, 2025Updated 8 months ago
- This repository collects awesome representative papers and resources for "From Pre-training to Post-training: A Survey on Time Series Fou…☆31Feb 1, 2026Updated last month
- The PyTorch implementation of DSM (EMNLP 2022).☆10Mar 26, 2024Updated last year
- The pytorch implementation of Cluster-Aware Supervised Contrastive Learning on Graphs (WWW 2022).☆11Jun 6, 2022Updated 3 years ago
- PyTorch code for the Neurips 2021 paper: Fairness via Representation Neutralization☆10Oct 26, 2021Updated 4 years ago
- ☆15Feb 11, 2025Updated last year
- Generating Summaries with Controllable Readability Levels (EMNLP 2023)☆15Aug 6, 2025Updated 7 months ago
- Scripting Multi-Scene Videos with Time-Aware and Structural Audio-Visual Captions☆23Feb 11, 2026Updated 3 weeks ago
- All-in-One Safety Evaluation Framwork☆42Updated this week
- Plug-and-Play Benchmarking of Reinforcement Learning Algorithms for Large-Scale Flow Control☆37Feb 22, 2026Updated 2 weeks ago
- ☆11Jan 23, 2021Updated 5 years ago
- Source code for the paper 'Uncovering Neural Scaling Laws in Molecular Representation Learning' (NeurIPS 2023 Datasets and Benchmarks).☆14Dec 2, 2023Updated 2 years ago
- ☆36Feb 12, 2026Updated 3 weeks ago
- Codes and data for KDD 2024 Research Track paper "ProCom: A Few-shot Targeted Community Detection Algorithm"☆11Aug 15, 2024Updated last year
- ☆18Apr 10, 2025Updated 10 months ago
- The source code of paper: Learning Disentangled Semantic Representations for Zero-Shot Cross-Lingual Transfer in Multilingual Machine Rea…☆12Apr 6, 2022Updated 3 years ago
- A no-dependency utility to undervolt Intel CPUs on Linux systems, with user-friendly GUI☆16Apr 19, 2025Updated 10 months ago
- Official repository for the paper "Learning beyond Teacher: Generalized On-Policy Distillation with Reward Extrapolation"☆38Feb 21, 2026Updated 2 weeks ago
- RACE is a multi-dimensional benchmark for code generation that focuses on Readability, mAintainability, Correctness, and Efficiency.☆12Oct 12, 2024Updated last year
- This project demonstrates a real-time delivery location tracking system similar to Zomato/Swiggy, built using Spring Boot and Apache Kafk…☆28Dec 4, 2025Updated 3 months ago
- Official implementation of Self-Taught Agentic Long Context Understanding (ACL 2025).☆12Sep 22, 2025Updated 5 months ago
- [CVPR 2026 (Findings) 🔥🔥] Self Evolving Large Multimodal Models with Continuous Rewards☆20Updated this week
- Aligning Agentic World Models via Knowledgeable Experience Learning☆31Jan 25, 2026Updated last month
- AmneziaWG for Android☆18Updated this week
- Data generation and training repository for SERA: Soft-Verified Efficient Repository Agents.☆123Updated this week
- Code for Max-Margin Contrastive Learning - AAAI 2022☆17Apr 25, 2022Updated 3 years ago
- Official code repo for the paper "MemGUI-Bench: Benchmarking Memory of Mobile GUI Agents in Dynamic Environments"☆30Mar 1, 2026Updated last week
- [ICLR 2026] Official Implementation of "FeatureBench: Benchmarking Agentic Coding for Complex Feature Development"☆25Mar 3, 2026Updated last week
- [ICCV 2025] Factorized Learning for Temporally Grounded Video-Language Models☆24Jan 1, 2026Updated 2 months ago
- ☆45Feb 25, 2026Updated last week