[ICLR2026] NewtonBench: Benchmarking Generalizable Scientific Law Discovery in LLM Agents
☆137Dec 15, 2025Updated 2 months ago
Alternatives and similar repositories for NewtonBench
Users that are interested in NewtonBench are comparing it to the libraries listed below
Sorting:
- A large-scale benchmark for detecting managerial evasion in earnings call Q&A.☆33Feb 5, 2026Updated 3 weeks ago
- Production-grade e-commerce platform for automotive accessories(or any), built with Next.js, PostgreSQL, Redis, and Stripe.☆41Dec 26, 2025Updated 2 months ago
- This project aims to analyze physical activity data from children and adolescents to predict the extent of their problematic internet use…☆54Sep 24, 2025Updated 5 months ago
- The enhanced model is specially trained for aquatic targets, achieving higher accuracy. It can detect sailboats, humans, other vessels, b…☆47May 15, 2025Updated 9 months ago
- CMPhysBench: A Benchmark for Evaluating Large Language Models in Condensed Matter Physics☆27Nov 1, 2025Updated 4 months ago
- Implementations of Influential Recommender System☆11Oct 29, 2024Updated last year
- ☆15Nov 18, 2025Updated 3 months ago
- 🏅토스 NEXT ML CHALLENGE : 광고 클릭 예측(CTR) 대회 5등 모델 제출용 레포지토리🏅☆26Feb 2, 2026Updated last month
- Nvidia In-Game Inference Framework Adapt to Unity☆32Dec 24, 2025Updated 2 months ago
- ☆16Mar 10, 2024Updated last year
- [ICLR 26] The official code repository for the paper "Mirage or Method? How Model–Task Alignment Induces Divergent RL Conclusions".☆15Feb 9, 2026Updated 3 weeks ago
- Code and data for the paper: IntentionQA: A Benchmark for Evaluating Purchase Intention Comprehension Abilities of Large Language Models …☆11Apr 27, 2024Updated last year
- Official code repository for the paper: AbsPyramid: Benchmarking the Abstration Ability of Language Models with a Unified Entailment Grap…☆13Oct 30, 2024Updated last year
- 包装计算(C#版)☆14May 20, 2025Updated 9 months ago
- Agentic Virtual Lab☆19Nov 30, 2025Updated 3 months ago
- Slightly patched clone of canonical gevel project from Teodor Sigaev☆12Dec 16, 2021Updated 4 years ago
- PyCausalSim is a Python framework for discovering and validating causal relationships through simulation. Unlike traditional analytics th…☆32Dec 8, 2025Updated 2 months ago
- A golang_based streaming video website☆39Mar 7, 2023Updated 2 years ago
- 一个 vue3 ui组件库☆26Oct 27, 2025Updated 4 months ago
- ☆14Aug 10, 2023Updated 2 years ago
- [ACL 2025] ⚖️ Temporally-aware MLLM for Biomedical Radiology Analysis and Report Generation. Flexible toolkit with MLLM backbone support,…☆27Jan 10, 2026Updated last month
- python implementation of the paper 'Fast Range Image-Based Segmentation of Sparse 3D Laser Scans for Online Operation'☆12Jan 4, 2021Updated 5 years ago
- ☆31Dec 3, 2025Updated 3 months ago
- This repo is the official implementation of "Euclid’s Gift: Enhancing Spatial Perception and Reasoning in Vision‑Language Models via Geom…☆27Nov 7, 2025Updated 3 months ago
- UR2: Unify RAG and Reasoning through Reinforcement Learning☆127Nov 19, 2025Updated 3 months ago
- [ACL 2024] Implementation for Advancing Abductive Reasoning in Knowledge Graphs through Complex Logical Hypothesis Generation☆15Oct 9, 2025Updated 4 months ago
- Code for "MvHo-IB: Multi-View Higher-Order Information Bottleneck for Brain Disorder Diagnosis"☆35Jul 4, 2025Updated 8 months ago
- Codes and Datasets for the Paper: Text-Tuple-Table: Towards Information Integration in Text-to-Table Generation via Global Tuple Extracti…☆15Jun 5, 2024Updated last year
- Face Identification using ONNX Runtime☆13Jul 4, 2024Updated last year
- ☆32Nov 11, 2025Updated 3 months ago
- [ICLR-2026] Official Implementation of our paper "THOR: Tool-Integrated Hierarchical Optimization via RL for Mathematical Reasoning".☆31Updated this week
- PyTorch implementation of QKAN "Quantum-inspired Kolmogorov-Arnold Network" https://arxiv.org/abs/2509.14026☆20Updated this week
- <핸즈온 LLM>(한빛미디어, 2025)의 예제 코드 저장소☆34Jan 4, 2026Updated 2 months ago
- your finance bro Agent for trading and investing☆108Nov 8, 2025Updated 3 months ago
- A modern web application for the Melbourne University Ultimate Frisbee Club, built with Next.js 15, TypeScript, and Tailwind CSS. This pl…☆101Jul 28, 2025Updated 7 months ago
- Multi-Agent,MCP,RAG,SpringAI1.0.0,RE-ACT☆113Jun 17, 2025Updated 8 months ago
- This is the code repo for the paper AceSearcher: Bootstrapping Reasoning and Search for LLMs via Reinforced Self-Play (NeurIPS 2025 Spotl…☆25Sep 29, 2025Updated 5 months ago
- Math-VR Benchmark & CodePlot-CoT: Mathematical Visual Reasoning by Thinking with Code-Driven Images☆53Nov 4, 2025Updated 4 months ago
- ☆10Oct 30, 2021Updated 4 years ago