lujiaxuan0520/Test-Time-Tool-Evol

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/lujiaxuan0520/Test-Time-Tool-Evol)

lujiaxuan0520 / Test-Time-Tool-Evol

Official repository for the paper "Beyond Static Tools: Test-Time Tool Evolution for Scientific Reasoning" and the SciEvo benchmark.

☆43

Alternatives and similar repositories for Test-Time-Tool-Evol

Users that are interested in Test-Time-Tool-Evol are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

xianjunhong / SeedTest
View on GitHub
🌾 Seed inspection platform with AI-powered detection, automated image acquisition, and integrated measurement systems for agricultural r…
☆46Jan 26, 2026Updated 5 months ago
RileyBear013 / code-navi-main
View on GitHub
☆10Apr 20, 2026Updated 3 months ago
yangwang-1211 / EasyDecrypt
View on GitHub
一款点点点就能实现流量自动加解密的代理工具
☆26Jul 9, 2026Updated 2 weeks ago
Epiphanyi / HAE-Agent-Security
View on GitHub
A survey on security in hierarchical autonomy evolution of AI agents
☆19Mar 10, 2026Updated 4 months ago
buyun00 / Seed-GameDev-Harness
View on GitHub
Seed 是一个专为游戏研发设计的 Claude Code 插件。一条命令描述任务，Seed 自动分析类型和领域，从五个专化 Agent 中组出最合适的组合，通过 Claude Code 原生 Team 机制启动协作。实现、调查、修复、审查、Unity Editor 操…
☆16Apr 30, 2026Updated 2 months ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
TIGER-AI-Lab / RationalRewards
View on GitHub
RationalRewards: a reasoning reward model for diffusion RL and test-time prompt tuning
☆56Jun 4, 2026Updated last month
LeoZhaorx / openclaw-desk-pet
View on GitHub
A native macOS desktop pet that visualizes OpenClaw agent activity.
☆30Jul 17, 2026Updated last week
lzc-shake / cs-paper-reading
View on GitHub
Deep academic paper analyzer for ML/DL research. Formula-by-formula explanation, reproducibility analysis, and research idea generation u…
☆18Mar 5, 2026Updated 4 months ago
SiChuchen / Scratchpad
View on GitHub
Description: A Windows floating scratchpad for AI coding workflows — collect text, screenshots, and files with Ctrl+V.
☆50Updated this week
slhleosun / EvoClaw
View on GitHub
Structured SOUL evolution framework for AI agents — experience, reflection, governed identity updates, and visual timelines.
☆86Mar 2, 2026Updated 4 months ago
Blackman99 / agent-feishu-channel
View on GitHub
Bridge Claude Code/Codex sessions to a Feishu (Lark) bot
☆46May 19, 2026Updated 2 months ago
yuanxiaochenAC / ASD-SpringBloom.AI
View on GitHub
ASD-SpringBloom is an intelligent support framework designed for Autism Spectrum Disorder (ASD) family scenarios. It focuses on three cor…
☆45Feb 12, 2026Updated 5 months ago
vihuela / lawnchair-sdk-sample
View on GitHub
☆32Apr 2, 2026Updated 3 months ago
kangverse / DALR
View on GitHub
The implementation of our ACL 2025 paper "DALR: Dual-level Alignment Learning for Multimodal Sentence Representation Learning"
☆42May 25, 2026Updated last month
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
Paradox-V / Nothing-but-a-pen-pusher-
View on GitHub
☆33May 4, 2026Updated 2 months ago
Tian-ye1214 / RedLotus
View on GitHub
基于 Pydantic AI 框架构建的多Agent协作系统，实现了管理Agent与工作Agent的分工协作，支持百轮级别的工具调用，让AI真正具备处理复杂任务的能力。
☆34Jun 29, 2026Updated 3 weeks ago
Adkid-Zephyr / Financial-research-agent-QBT
View on GitHub
A futures research and report review agent pipeline.
☆55Apr 20, 2026Updated 3 months ago
lynxlangya / knowject
View on GitHub
AI-assisted project knowledge workspace for development teams.
☆56May 24, 2026Updated 2 months ago
ocy1 / TRIO
View on GitHub
Official implementation for "TRIO: Token Reduction via Inference-Objective Guidance for Efficient Vision-Language Models" https://arxiv…
☆43Jun 3, 2026Updated last month
Enchograph / HuaJuan
View on GitHub
A comprehensive Android AI client | 完善的安卓 AI 客户端
☆39Jun 10, 2026Updated last month
JYP-jjbb / ShopAgents
View on GitHub
An intelligent shopping website that combines a multi-agent framework with personalized recommendation, interactive assistance, and produ…
☆40Apr 5, 2026Updated 3 months ago
KaihuaTang / Index.skill
View on GitHub
基于Claude与Obsidian/WebUI的个人知识管理、更新、交互系统“茵蒂克丝.skill”（LLM Wiki）。同济大学工程智能研究院出品~
☆77Jun 25, 2026Updated 3 weeks ago
Mrguanglei / SlideAgent
View on GitHub
☆81Mar 20, 2026Updated 4 months ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
vincentchen2026 / claude-code-java
View on GitHub
☆58Apr 9, 2026Updated 3 months ago
xuxuancheng0208 / SMRABooth
View on GitHub
[CVPR 2026] Official code for paper: SMRABooth: Subject and Motion Representation Alignment for Customized Video Generation
☆27Jun 14, 2026Updated last month
Haoan919 / Lattice-Su-2-
View on GitHub
☆39Mar 11, 2026Updated 4 months ago
SonicBotMan / SoloFlow
View on GitHub
Complete ETCLOVG framework for AI Agent workflows - DAG+FSM orchestration, Ebbinghaus memory, discipline routing, skill evolution, trace …
☆67May 31, 2026Updated last month
osakana373 / CodEOE
View on GitHub
☆84May 24, 2026Updated 2 months ago
dengxianghua888-ops / ecoalign-forge
View on GitHub
Multi-Agent DPO Data Synthesis Factory — 多智能体偏好训练数据自动合成框架 | 红队攻击 → 多persona审核 → 终审裁决 → DPO偏好对
☆71Apr 11, 2026Updated 3 months ago
HateCodingHateCoding / Bullying-detection-system-v2-public
View on GitHub
☆82Jun 1, 2026Updated last month
cyuQ1n / EasyVideoR1
View on GitHub
☆157Apr 27, 2026Updated 2 months ago
luoz6 / XxCode
View on GitHub
☆89Jun 11, 2026Updated last month
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
henrydiaosi / dorado
View on GitHub
以协议核心的 AI 项目工作流系统，支持 skills 驱动、change 队列与 Codex/Claude Code 协作。
☆107Apr 1, 2026Updated 3 months ago
yuxumin / ViQ
View on GitHub
[ECCV2026] ViQ: Text-Aligned Visual Quantized Representations at Any Resolution
☆127Jul 1, 2026Updated 3 weeks ago
herry2059 / project-os-for-codex
View on GitHub
Open-source control plane for Codex projects: Git-backed context, visible agent progress, scoped MCP access, resumable work, and safe han…
☆102Jul 14, 2026Updated last week
ballyang747 / dingTalkMutilAgent
View on GitHub
☆71Jun 28, 2026Updated 3 weeks ago
blackhaiyu-sudo / spec2case
View on GitHub
Spec2Case 是生产级 AI 测试用例生成智能体，支持图片/文本需求理解、人工确认、LangGraph 流程编排和 Excel 用例导出。
☆108May 20, 2026Updated 2 months ago
zhanghaotian0225 / Accumulative-Decoding
View on GitHub
Mitigating Hallucinations in Large Vision-Language Models via Accumulative Decoding
☆158Mar 26, 2026Updated 3 months ago
Sun15194 / Poly-DETR
View on GitHub
Towards Instance Segmentation with Polygon Detection Transformer.
☆79Mar 10, 2026Updated 4 months ago