Official repository for the paper "Beyond Static Tools: Test-Time Tool Evolution for Scientific Reasoning" and the SciEvo benchmark.
☆44Jan 13, 2026Updated 2 months ago
Alternatives and similar repositories for Test-Time-Tool-Evol
Users that are interested in Test-Time-Tool-Evol are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆33Nov 11, 2025Updated 4 months ago
- Source code for SWIFT, an efficient reward model.☆20Jan 13, 2026Updated 2 months ago
- We introduce Reasoning via Video, a new paradigm that uses maze-solving video generation to probe multimodal reasoning; our VR-Bench show…☆57Feb 4, 2026Updated 2 months ago
- Advantage Alignment Algorithms (ICLR 2025 oral)☆18Apr 7, 2025Updated last year
- Reasoning Activation in LLMs via Small Model Transfer (NeurIPS 2025)☆22Oct 16, 2025Updated 5 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Implementation of the paper "In-context Time Series Predictor" (ICLR 2025)☆15Feb 11, 2025Updated last year
- ☆10Dec 15, 2023Updated 2 years ago
- This folder contains some of the data sets commonly used in the field of multivariate time series forecasting.☆10Jul 28, 2023Updated 2 years ago
- ☆36Nov 15, 2025Updated 4 months ago
- [ICLR26] AI-based scaling law discovery☆28Jan 30, 2026Updated 2 months ago
- ☆15Nov 13, 2025Updated 4 months ago
- Official implementation of 'All in One and One for All: A Simple yet Effective Method towards Cross-domain Graph Pretraining' published i…☆47Oct 23, 2024Updated last year
- ☆17Apr 11, 2025Updated 11 months ago
- Using PCA, Autoencoder and Fisher linear discriminant to extract the effective representations from the face images. Do the reconstructio…☆12Apr 23, 2019Updated 6 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- AIRS-Bench: an AI Research Science benchmark for quantifying the end-to-end AI research abilities of LLM agents☆71Mar 17, 2026Updated 3 weeks ago
- ☆18Aug 14, 2024Updated last year
- ☆20Apr 15, 2025Updated 11 months ago
- 6,080-param transformer achieving 100% accuracy on 10-digit addition. Trained from scratch in 10 minutes.☆22Feb 19, 2026Updated last month
- ☆19Dec 30, 2025Updated 3 months ago
- [ACL 2025] The official code for "AGrail: A Lifelong Agent Guardrail with Effective and Adaptive Safety Detection".☆39Aug 4, 2025Updated 8 months ago
- the code of MoG☆20Aug 6, 2024Updated last year
- Implementation of the paper: "FedTabDiff: Federated Learning of Diffusion Models for Synthetic Mixed-Type Tabular Data Generation"☆23Nov 10, 2024Updated last year
- This repository contains the replication of the iGSM dataset generation process from the Physics of LLM paper by Zeyuan Zhu.☆17Sep 13, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Official Implementation of Half-Hop☆20Oct 10, 2023Updated 2 years ago
- ☆33Jul 15, 2025Updated 8 months ago
- ☆14Oct 29, 2020Updated 5 years ago
- 💻 autodl自动续签 防止实例过期释放 支持Docker部署 AutoDL Automatic Renewal: Prevent Instance Expiry and Release☆19Apr 3, 2026Updated last week
- Official repository of paper "Parameters vs. Context: Fine-Grained Control of Knowledge Reliance in Language Models"☆24May 27, 2025Updated 10 months ago
- AI-Driven Research Systems (ADRS)☆133Dec 17, 2025Updated 3 months ago
- GraphAlign: Pretraining One Graph Neural Network on Multiple Graphs via Feature Alignment☆18Sep 17, 2024Updated last year
- Modern utility library and typescript typings for building JSON Schema documents☆14Nov 28, 2025Updated 4 months ago
- AI Training Chip☆13Jan 4, 2022Updated 4 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- A Modular Framework for Learning on Multimodal Biomedical Knowledge Graphs☆18Jan 12, 2024Updated 2 years ago
- PeRL: Parameter-Efficient Reinforcement Learning☆74Updated this week
- [NeurIPS2024] Attractor memory for long-term time series forecasting: A chaos perspective☆23Nov 22, 2024Updated last year
- The first differentially-private diffusion model for tabular data☆34Jun 5, 2024Updated last year
- Easy Setup, File-based, Offline Capable Federated Learning and Computations☆22Mar 28, 2026Updated last week
- ☆29Apr 30, 2024Updated last year
- SEGmentation using Graphs with Inexact aNd Incomplete labels☆21Mar 25, 2022Updated 4 years ago