[ICLR2026] NewtonBench: Benchmarking Generalizable Scientific Law Discovery in LLM Agents
☆146Feb 27, 2026Updated last month
Alternatives and similar repositories for NewtonBench
Users that are interested in NewtonBench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A large-scale benchmark for detecting managerial evasion in earnings call Q&A.☆34Feb 5, 2026Updated 2 months ago
- Production-grade e-commerce platform for automotive accessories(or any), built with Next.js, PostgreSQL, Redis, and Stripe.☆41Dec 26, 2025Updated 3 months ago
- Official code repository for the paper: AbsPyramid: Benchmarking the Abstration Ability of Language Models with a Unified Entailment Grap…☆13Oct 30, 2024Updated last year
- This project aims to analyze physical activity data from children and adolescents to predict the extent of their problematic internet use…☆55Sep 24, 2025Updated 6 months ago
- ☆32Nov 11, 2025Updated 5 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆162Mar 29, 2026Updated 2 weeks ago
- ☆14Aug 10, 2023Updated 2 years ago
- The enhanced model is specially trained for aquatic targets, achieving higher accuracy. It can detect sailboats, humans, other vessels, b…☆47May 15, 2025Updated 11 months ago
- [ACL 2024] Implementation for Advancing Abductive Reasoning in Knowledge Graphs through Complex Logical Hypothesis Generation☆15Oct 9, 2025Updated 6 months ago
- Implementations of Influential Recommender System☆11Oct 29, 2024Updated last year
- Slightly patched clone of canonical gevel project from Teodor Sigaev☆12Dec 16, 2021Updated 4 years ago
- A minimal example of Abductive Learning☆19Dec 6, 2023Updated 2 years ago
- WWW 2024: New Frontiers of Knowledge Graph Reasoning: Recent Advances and Future Trends☆18Mar 24, 2026Updated 3 weeks ago
- Benchmark for Answering Existential First Order Queries with Single Free Variable (NeurIPS dataset and benchmark 2021)☆20May 3, 2023Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Code and data for the paper: IntentionQA: A Benchmark for Evaluating Purchase Intention Comprehension Abilities of Large Language Models …☆11Apr 27, 2024Updated last year
- Deep Reinforcement Learning trading strategies: Double DQN with Transformer Attention + Multi-Factor Model (Fama-French inspired). Featur…☆70Mar 1, 2026Updated last month
- Nvidia In-Game Inference Framework Adapt to Unity☆32Dec 24, 2025Updated 3 months ago
- 极不平衡样本下的预测☆40Oct 28, 2025Updated 5 months ago
- 基于GO语言的客户端☆23Mar 3, 2025Updated last year
- python implementation of the paper 'Fast Range Image-Based Segmentation of Sparse 3D Laser Scans for Online Operation'☆12Jan 4, 2021Updated 5 years ago
- Resources and paper list for 'Scaling Environments for Agents'. This repository accompanies our survey on how environments contribute to …☆65Jan 28, 2026Updated 2 months ago
- 基于GO语言的服务端☆24Mar 8, 2025Updated last year
- 🎁 Modern e-commerce system built with Go (Gin + Gorm + Redis + JWT). Enhanced version of yshop-gin with improved UI, performance and fea…☆37Oct 17, 2025Updated 5 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- 一个 vue3 ui组件库☆26Oct 27, 2025Updated 5 months ago
- UR2: Unify RAG and Reasoning through Reinforcement Learning☆127Nov 19, 2025Updated 4 months ago
- ☆39Oct 22, 2025Updated 5 months ago
- ☆16Mar 10, 2024Updated 2 years ago
- This is a niche collection of research papers which are proven to be gradients pushing the field of Natural Language Processing, Deep Lea…☆25Nov 19, 2024Updated last year
- A golang_based streaming video website☆39Mar 7, 2023Updated 3 years ago
- your finance bro Agent for trading and investing☆109Nov 8, 2025Updated 5 months ago
- Code for the paper "Knowledge-driven Data Construction for Zero-shot Evaluation in Commonsense Question Answering" (AAAI 2021)☆30Feb 19, 2021Updated 5 years ago
- Multi-Agent,MCP,RAG,SpringAI1.0.0,RE-ACT☆113Jun 17, 2025Updated 9 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- AGS02MA full-featured driver for general-purpose MCU and Linux.☆12Oct 25, 2025Updated 5 months ago
- 🚀 2026机场推荐 | 长期、稳定的机场VPN平台。性价比机场,一元机场,翻墙机场,机场评测,科学上网,梯子。免费VPN,机场节点订阅,节点分享,Clash节点,V2ray节点,小火箭等代理软件🚀☆45Updated this week
- [CVPR 2026] SeeU: Seeing the Unseen World via 4D Dynamics-aware Generation☆41Mar 18, 2026Updated 3 weeks ago
- This is small instance of blockChain using javascript.☆16Aug 26, 2022Updated 3 years ago
- [ACL 2025] ⚖️ Temporally-aware MLLM for Biomedical Radiology Analysis and Report Generation. Flexible toolkit with MLLM backbone support,…☆29Mar 18, 2026Updated 3 weeks ago
- ☆10Oct 30, 2021Updated 4 years ago
- 给新生用的 Introduction☆14Apr 8, 2026Updated last week