OPT-BENCH: Evaluating LLM Agent on Large-Scale Search Spaces Optimization Problems
☆120Jul 13, 2025Updated 7 months ago
Alternatives and similar repositories for OPT-BENCH
Users that are interested in OPT-BENCH are comparing it to the libraries listed below
Sorting:
- Assessing Context-Aware Creative Intelligence in MLLMs☆23Jul 22, 2025Updated 7 months ago
- Official Repository of ACL 2025 paper OmniAlign-V: Towards Enhanced Alignment of MLLMs with Human Preference☆144Mar 2, 2025Updated last year
- [NIPS 2025 DB Oral] Official Repository of paper: Envisioning Beyond the Pixels: Benchmarking Reasoning-Informed Visual Editing☆141Feb 9, 2026Updated last month
- [EMNLP 2024 Findings] ProSA: Assessing and Understanding the Prompt Sensitivity of LLMs☆29May 22, 2025Updated 9 months ago
- [TMLR 2025 & ICLR 2025 DeLTa] Official Implementation of Design Editing for Offline Model-based Optimization 🧬 🤖☆10Apr 17, 2025Updated 10 months ago
- [CVPR 2026] Official Code for "ARM-Thinker: Reinforcing Multimodal Generative Reward Models with Agentic Tool Use and Visual Reasoning"☆84Feb 13, 2026Updated 3 weeks ago
- ☆30Dec 23, 2025Updated 2 months ago
- A Prompt Learning Framework for Source Code Summarization☆14Dec 26, 2023Updated 2 years ago
- ☆14Sep 6, 2024Updated last year
- Github repository for "Internalizing World Models via Self-Play Finetuning for Agentic RL"☆33Nov 1, 2025Updated 4 months ago
- [NeurIPS 2025] Encoder-Decoder Diffusion Language Models for Efficient Training and Inference☆36Oct 29, 2025Updated 4 months ago
- Simple Video Summarization using Text-to-Segment Anything (Florence2 + SAM2) This project provides a video processing tool that utilizes…☆10Feb 20, 2025Updated last year
- ☆12Jan 25, 2026Updated last month
- ☆11Nov 5, 2024Updated last year
- camera monitoring and alerts using deepstack☆13Jun 2, 2020Updated 5 years ago
- triton ver of gqa flash attn, based on the tutorial☆12Aug 4, 2024Updated last year
- This repository contains a series of 4 jupyter notebooks demonstrating how AWS AI Services like Amazon Rekognition, Amazon Transcribe and…☆13Nov 26, 2021Updated 4 years ago
- Official code repository for "Video-Mined Task Graphs for Keystep Recognition in Instructional Videos" arXiv, 2023☆14Apr 1, 2024Updated last year
- R1V, trained with AI feedback, answers open-ended visual questions.☆14Apr 12, 2025Updated 10 months ago
- A Framework for Decoupling and Assessing the Capabilities of VLMs☆43Jun 28, 2024Updated last year
- Reproduced the DFT method without using Verl. https://arxiv.org/abs/2508.05629☆21Oct 14, 2025Updated 4 months ago
- Simple agent framework using Ollama tool calling☆10Aug 27, 2024Updated last year
- The official code for "OG-HFYOLO :Orientation Gradient Guidance and Heterogeneous Feature Fusion For Deformation Table Cell Instance Segm…☆13Jul 28, 2025Updated 7 months ago
- Code for the paper "Bounce: Reliable High-Dimensional Bayesian Optimization for Combinatorial and Mixed Spaces"☆15Apr 30, 2024Updated last year
- OCR-D compliant toolset for optical layout recognition on historical german-language documents published in Brazil☆11Sep 24, 2021Updated 4 years ago
- 基于PyTorch GPT-2的针对各种数据并行pretrain的研究代码.☆11Dec 16, 2022Updated 3 years ago
- Hubsy is your HubSpot personal assistant.☆10Jul 17, 2017Updated 8 years ago
- Baselines for Model-Based Optimization installation fixes and compatible with newer AMPERE+ GPUs (e.g. 3090)☆11Apr 30, 2023Updated 2 years ago
- 100 game demos by Crossin的编程教室☆15Jun 4, 2025Updated 9 months ago
- Pyra: Automated EM27/SUN Greenhouse Gas Measurements☆15Feb 20, 2026Updated 2 weeks ago
- ☆12Feb 23, 2023Updated 3 years ago
- GPT Librarian understands all your docs☆13Oct 18, 2023Updated 2 years ago
- Open source task scheduler with dependency management☆15Jul 1, 2018Updated 7 years ago
- ☆20Jun 25, 2025Updated 8 months ago
- JSON schema-backed API framework for writing HTTP handlers in Go. Validation, decoding, and OpenAPI 3.1 output.☆17Dec 5, 2025Updated 3 months ago
- LMM for VQA, tcsvt version☆11Jul 19, 2024Updated last year
- Official implementation of NeurIPS'24 Spotlight paper "Monte Carlo Tree Search based Space Transfer for Black-box Optimization".☆12Nov 28, 2024Updated last year
- ☆13Mar 9, 2024Updated 2 years ago
- ☆12Jan 11, 2025Updated last year