[ICLR2026] NewtonBench: Benchmarking Generalizable Scientific Law Discovery in LLM Agents
☆147Feb 27, 2026Updated 2 months ago
Alternatives and similar repositories for NewtonBench
Users that are interested in NewtonBench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A large-scale benchmark for detecting managerial evasion in earnings call Q&A.☆34Feb 5, 2026Updated 3 months ago
- Production-grade e-commerce platform for automotive accessories(or any), built with Next.js, PostgreSQL, Redis, and Stripe.☆41Dec 26, 2025Updated 5 months ago
- Official code repository for the paper: AbsPyramid: Benchmarking the Abstration Ability of Language Models with a Unified Entailment Grap…☆13Oct 30, 2024Updated last year
- This project aims to analyze physical activity data from children and adolescents to predict the extent of their problematic internet use…☆55Sep 24, 2025Updated 8 months ago
- ☆32Nov 11, 2025Updated 6 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆14Aug 10, 2023Updated 2 years ago
- The enhanced model is specially trained for aquatic targets, achieving higher accuracy. It can detect sailboats, humans, other vessels, b…☆47May 15, 2025Updated last year
- Codes and Datasets for the Paper: Text-Tuple-Table: Towards Information Integration in Text-to-Table Generation via Global Tuple Extracti…☆15Jun 5, 2024Updated last year
- [ACL 2024] Implementation for Advancing Abductive Reasoning in Knowledge Graphs through Complex Logical Hypothesis Generation☆15Oct 9, 2025Updated 7 months ago
- Implementations of Influential Recommender System☆12Oct 29, 2024Updated last year
- Slightly patched clone of canonical gevel project from Teodor Sigaev☆12Dec 16, 2021Updated 4 years ago
- A minimal example of Abductive Learning☆19Dec 6, 2023Updated 2 years ago
- WWW 2024: New Frontiers of Knowledge Graph Reasoning: Recent Advances and Future Trends☆18Mar 24, 2026Updated 2 months ago
- Benchmark for Answering Existential First Order Queries with Single Free Variable (NeurIPS dataset and benchmark 2021)☆20May 3, 2023Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- One Discrete Word for Visual Reasoning Overtakes Agentic and Latent Methods☆118May 15, 2026Updated last week
- Code and data for the paper: IntentionQA: A Benchmark for Evaluating Purchase Intention Comprehension Abilities of Large Language Models …☆12Apr 27, 2024Updated 2 years ago
- Deep Reinforcement Learning trading strategies: Double DQN with Transformer Attention + Multi-Factor Model (Fama-French inspired). Featur…☆75May 12, 2026Updated last week
- Nvidia In-Game Inference Framework Adapt to Unity☆33Dec 24, 2025Updated 5 months ago
- 极不平衡样本下的预测☆40Oct 28, 2025Updated 6 months ago
- 基于GO语 言的客户端☆23Mar 3, 2025Updated last year
- python implementation of the paper 'Fast Range Image-Based Segmentation of Sparse 3D Laser Scans for Online Operation'☆13Jan 4, 2021Updated 5 years ago
- Resources and paper list for 'Scaling Environments for Agents'. This repository accompanies our survey on how environments contribute to …☆68Jan 28, 2026Updated 3 months ago
- 基于GO语言的服务端☆24Mar 8, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- 🎬 灵果短剧AI - 基于AI的一站式短剧/漫剧生成平台 《一句话生成完整短剧/漫剧,从剧本到成 片短剧/漫剧全自动化》Lingguo-Drama AI-An AI-powered one-stop generation platform for mini-dramas a…☆791Mar 28, 2026Updated last month
- 一个 vue3 ui组件库☆26Oct 27, 2025Updated 6 months ago
- Multimodal Document Intelligence Platform☆41Apr 10, 2026Updated last month
- General Bionic Social Program Architecture☆51May 15, 2026Updated last week
- 🎁 Modern e-commerce system built with Go (Gin + Gorm + Redis + JWT). Enhanced version of yshop-gin with improved UI, performance and fea…☆37Oct 17, 2025Updated 7 months ago
- UR2: Unify RAG and Reasoning through Reinforcement Learning☆129Apr 26, 2026Updated 3 weeks ago
- ☆16Mar 10, 2024Updated 2 years ago
- ☆39Oct 22, 2025Updated 7 months ago
- This is a niche collection of research papers which are proven to be gradients pushing the field of Natural Language Processing, Deep Lea…☆25Nov 19, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Personal-Model First Self Evolving AI Agent 🐘☆466Updated this week
- A golang_based streaming video website☆39Mar 7, 2023Updated 3 years ago
- your finance bro Agent for trading and investing☆109Nov 8, 2025Updated 6 months ago
- Code for the paper "Knowledge-driven Data Construction for Zero-shot Evaluation in Commonsense Question Answering" (AAAI 2021)☆30Feb 19, 2021Updated 5 years ago
- Multi-Agent,MCP,RAG,SpringAI1.0.0,RE-ACT☆116Jun 17, 2025Updated 11 months ago
- AGS02MA full-featured driver for general-purpose MCU and Linux.☆12Oct 25, 2025Updated 7 months ago
- [CVPR 2026] SeeU: Seeing the Unseen World via 4D Dynamics-aware Generation☆42Mar 18, 2026Updated 2 months ago