Official repository for the paper "Beyond Static Tools: Test-Time Tool Evolution for Scientific Reasoning" and the SciEvo benchmark.
☆42Jan 13, 2026Updated 3 months ago
Alternatives and similar repositories for Test-Time-Tool-Evol
Users that are interested in Test-Time-Tool-Evol are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Source code for SWIFT, an efficient reward model.☆21Jan 13, 2026Updated 3 months ago
- Dynaseal is a dynamic API key management system designed to secure communications and identity verification for large model services. It …☆12Oct 30, 2024Updated last year
- We introduce Reasoning via Video, a new paradigm that uses maze-solving video generation to probe multimodal reasoning; our VR-Bench show…☆59Feb 4, 2026Updated 2 months ago
- Reasoning Activation in LLMs via Small Model Transfer (NeurIPS 2025)☆22Oct 16, 2025Updated 6 months ago
- Official code repository for "CLARE: Continual Learning for Vision-Language-Action Models via Autonomous Adapter Routing and Expansion".☆36Apr 11, 2026Updated 2 weeks ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Official implementation of Our NeurIPS 2024 Paper "Boundary Matters: A Bi-Level Active Finetuning Method"☆14Feb 11, 2025Updated last year
- ☆135Mar 3, 2026Updated last month
- Time-RA: Towards Time Series Reasoning for Anomaly with LLM Feedback☆22Jan 10, 2026Updated 3 months ago
- ☆29Mar 10, 2026Updated last month
- ☆10Dec 15, 2023Updated 2 years ago
- The src for Paper "Frequency-aware Generative Models for Multivariate Time Series Imputation"☆15May 22, 2024Updated last year
- MCE: Clone Human Souls with LLM Native Agent Skills | 基于 LLM Agent Skills 的心智克隆工程 | Agent Skills | Mind Skills | Mind Clone☆53Dec 21, 2025Updated 4 months ago
- ☆14Nov 13, 2025Updated 5 months ago
- Official implementation of 'All in One and One for All: A Simple yet Effective Method towards Cross-domain Graph Pretraining' published i…☆47Oct 23, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆17Apr 11, 2025Updated last year
- Reasoning or Memorization? Unreliable Results of Reinforcement Learning Due to Data Contamination.☆22Jul 18, 2025Updated 9 months ago
- Using PCA, Autoencoder and Fisher linear discriminant to extract the effective representations from the face images. Do the reconstructio…☆12Apr 23, 2019Updated 7 years ago
- ☆18Aug 14, 2024Updated last year
- Cell-Level RSRP Estimation with the Image-to-Image Wireless Propagation Model Based on Measured data.☆13Oct 10, 2023Updated 2 years ago
- ☆20Apr 15, 2025Updated last year
- ☆19Dec 30, 2025Updated 4 months ago
- 6,080-param transformer achieving 100% accuracy on 10-digit addition. Trained from scratch in 10 minutes.☆22Feb 19, 2026Updated 2 months ago
- Experimental interface environment for open source LLM, designed to democratize the use of AI. Powered by llama-cpp, llama-cpp-python and…☆18Oct 11, 2025Updated 6 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Wasserstein-Fisher-Rao Embedding: Logical Query Embeddings with Local Comparison and Global Transport (Findings-ACL 2023)☆13May 4, 2023Updated 2 years ago
- the code of MoG☆20Aug 6, 2024Updated last year
- Implementation of the paper: "FedTabDiff: Federated Learning of Diffusion Models for Synthetic Mixed-Type Tabular Data Generation"☆23Nov 10, 2024Updated last year
- ☆179Jan 19, 2026Updated 3 months ago
- Official Implementation of Half-Hop☆20Oct 10, 2023Updated 2 years ago
- ☆33Jul 15, 2025Updated 9 months ago
- ☆14Oct 29, 2020Updated 5 years ago
- [ACL 2025] The official code for "AGrail: A Lifelong Agent Guardrail with Effective and Adaptive Safety Detection".☆40Aug 4, 2025Updated 8 months ago
- [RSS 2026] LDA-1B: Scaling Latent Dynamics Action Model via Universal Embodied Data Ingestion☆169Updated this week
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- AI-Driven Research Systems (ADRS)☆138Dec 17, 2025Updated 4 months ago
- Modern utility library and typescript typings for building JSON Schema documents☆14Nov 28, 2025Updated 5 months ago
- AI Agent-powered web browser: Agentic AI inside the browser.☆22Jul 13, 2025Updated 9 months ago
- 💻 autodl自动续签 防止实例过期释放 支持Docker部署 AutoDL Automatic Renewal: Prevent Instance Expiry and Release☆20Apr 3, 2026Updated 3 weeks ago
- ☆16Jul 4, 2025Updated 9 months ago
- Easy Setup, File-based, Offline Capable Federated Learning and Computations☆22Mar 28, 2026Updated last month
- The first differentially-private diffusion model for tabular data☆34Jun 5, 2024Updated last year