Official repository for the paper "Beyond Static Tools: Test-Time Tool Evolution for Scientific Reasoning" and the SciEvo benchmark.
☆43Jan 13, 2026Updated 4 months ago
Alternatives and similar repositories for Test-Time-Tool-Evol
Users that are interested in Test-Time-Tool-Evol are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆38Nov 11, 2025Updated 7 months ago
- On Policy Distillation Build on top of Verl☆69May 25, 2026Updated 2 weeks ago
- Complete ETCLOVG framework for AI Agent workflows - DAG+FSM orchestration, Ebbinghaus memory, discipline routing, skill evolution, trace …☆130May 31, 2026Updated last week
- Description: A Windows floating scratchpad for AI coding workflows — collect text, screenshots, and files with Ctrl+V.☆102Apr 27, 2026Updated last month
- ☆49Apr 20, 2026Updated last month
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Source code for SWIFT, an efficient reward model.☆21Jan 13, 2026Updated 4 months ago
- [ACL 2026]From Experience to Skill: Multi-Agent Generative Engine Optimization via Reusable Strategy Learning☆38Apr 26, 2026Updated last month
- We introduce Reasoning via Video, a new paradigm that uses maze-solving video generation to probe multimodal reasoning; our VR-Bench show…☆64Feb 4, 2026Updated 4 months ago
- ☆17Mar 10, 2025Updated last year
- Advantage Alignment Algorithms (ICLR 2025 oral)☆20Apr 7, 2025Updated last year
- Official implementation of Our NeurIPS 2024 Paper "Boundary Matters: A Bi-Level Active Finetuning Method"☆14Feb 11, 2025Updated last year
- ☆51Mar 8, 2026Updated 3 months ago
- ☆18Aug 7, 2025Updated 10 months ago
- [CVPR 2026] MergeVLA: Cross-Skill Model Merging Toward a Generalist Vision-Language-Action Agent☆32Apr 30, 2026Updated last month
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- [RA-L 2026] Official code repository for "CLARE: Continual Learning for Vision-Language-Action Models via Autonomous Adapter Routing and …☆43Apr 11, 2026Updated 2 months ago
- This folder contains some of the data sets commonly used in the field of multivariate time series forecasting.☆10Jul 28, 2023Updated 2 years ago
- FakePartsBench: 25K+ AI-generated videos with pixel- and frame-level annotations of full and partial deepfakes.☆25May 29, 2026Updated last week
- The src for Paper "Frequency-aware Generative Models for Multivariate Time Series Imputation"☆16May 22, 2024Updated 2 years ago
- ☆14Nov 13, 2025Updated 6 months ago
- Official implementation of 'All in One and One for All: A Simple yet Effective Method towards Cross-domain Graph Pretraining' published i…☆47Oct 23, 2024Updated last year
- ☆17Apr 11, 2025Updated last year
- Using PCA, Autoencoder and Fisher linear discriminant to extract the effective representations from the face images. Do the reconstructio…☆12Apr 23, 2019Updated 7 years ago
- ☆18Aug 14, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆20Apr 15, 2025Updated last year
- 6,080-param transformer achieving 100% accuracy on 10-digit addition. Trained from scratch in 10 minutes.☆22Feb 19, 2026Updated 3 months ago
- The code is for our AAAI2023 paper: Efficient Embeddings of Logical Variables for Query Answering over Incomplete Knowledge Graphs (Ding…☆10Dec 17, 2022Updated 3 years ago
- ☆20Dec 30, 2025Updated 5 months ago
- the code of MoG☆22Aug 6, 2024Updated last year
- AIRS-Bench: an AI Research Science benchmark for quantifying the end-to-end AI research abilities of LLM agents☆95May 5, 2026Updated last month
- Logical Message Passing Networks with One-hop Inference in Atomic Formulas (ICLR 2023)☆15Jul 21, 2023Updated 2 years ago
- This repository contains the replication of the iGSM dataset generation process from the Physics of LLM paper by Zeyuan Zhu.☆17Sep 13, 2024Updated last year
- Official Implementation of Half-Hop☆20Oct 10, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆14Oct 29, 2020Updated 5 years ago
- [ACL 2025] The official code for "AGrail: A Lifelong Agent Guardrail with Effective and Adaptive Safety Detection".☆40Aug 4, 2025Updated 10 months ago
- GraphAlign: Pretraining One Graph Neural Network on Multiple Graphs via Feature Alignment☆18Sep 17, 2024Updated last year
- Modern utility library and typescript typings for building JSON Schema documents☆14Nov 28, 2025Updated 6 months ago
- AI Agent-powered web browser: Agentic AI inside the browser.☆22Jul 13, 2025Updated 10 months ago
- LLM Reasoning Benchmark & Chain-of-Thoughts Dataset for Chemistry☆53Oct 9, 2025Updated 8 months ago
- AI Training Chip☆13Jan 4, 2022Updated 4 years ago