[ICLR2026] NewtonBench: Benchmarking Generalizable Scientific Law Discovery in LLM Agents
☆141Feb 27, 2026Updated 3 weeks ago
Alternatives and similar repositories for NewtonBench
Users that are interested in NewtonBench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A large-scale benchmark for detecting managerial evasion in earnings call Q&A.☆33Feb 5, 2026Updated last month
- Production-grade e-commerce platform for automotive accessories(or any), built with Next.js, PostgreSQL, Redis, and Stripe.☆41Dec 26, 2025Updated 2 months ago
- Official code repository for the paper: AbsPyramid: Benchmarking the Abstration Ability of Language Models with a Unified Entailment Grap…☆13Oct 30, 2024Updated last year
- This project aims to analyze physical activity data from children and adolescents to predict the extent of their problematic internet use…☆54Sep 24, 2025Updated 6 months ago
- ☆32Nov 11, 2025Updated 4 months ago
- ☆14Aug 10, 2023Updated 2 years ago
- The enhanced model is specially trained for aquatic targets, achieving higher accuracy. It can detect sailboats, humans, other vessels, b…☆47May 15, 2025Updated 10 months ago
- Codes and Datasets for the Paper: Text-Tuple-Table: Towards Information Integration in Text-to-Table Generation via Global Tuple Extracti…☆15Jun 5, 2024Updated last year
- [ACL 2024] Implementation for Advancing Abductive Reasoning in Knowledge Graphs through Complex Logical Hypothesis Generation☆15Oct 9, 2025Updated 5 months ago
- Implementations of Influential Recommender System☆11Oct 29, 2024Updated last year
- Slightly patched clone of canonical gevel project from Teodor Sigaev☆12Dec 16, 2021Updated 4 years ago
- A minimal example of Abductive Learning☆18Dec 6, 2023Updated 2 years ago
- WWW 2024: New Frontiers of Knowledge Graph Reasoning: Recent Advances and Future Trends☆18May 14, 2024Updated last year
- Benchmark for Answering Existential First Order Queries with Single Free Variable (NeurIPS dataset and benchmark 2021)☆20May 3, 2023Updated 2 years ago
- Deep Reinforcement Learning trading strategies: Double DQN with Transformer Attention + Multi-Factor Model (Fama-French inspired). Featur…☆61Mar 1, 2026Updated 3 weeks ago
- Code and data for the paper: IntentionQA: A Benchmark for Evaluating Purchase Intention Comprehension Abilities of Large Language Models …☆11Apr 27, 2024Updated last year
- 极不平衡样本下的预测☆40Oct 28, 2025Updated 4 months ago
- Nvidia In-Game Inference Framework Adapt to Unity☆32Dec 24, 2025Updated 3 months ago
- 基于GO语言的客户端☆23Mar 3, 2025Updated last year
- python implementation of the paper 'Fast Range Image-Based Segmentation of Sparse 3D Laser Scans for Online Operation'☆12Jan 4, 2021Updated 5 years ago
- Resources and paper list for 'Scaling Environments for Agents'. This repository accompanies our survey on how environments contribute to …☆64Jan 28, 2026Updated last month
- 基于GO语言的服务端☆24Mar 8, 2025Updated last year
- 🎁 Modern e-commerce system built with Go (Gin + Gorm + Redis + JWT). Enhanced version of yshop-gin with improved UI, performance and fea…☆37Oct 17, 2025Updated 5 months ago
- 一个 vue3 ui组件库☆26Oct 27, 2025Updated 4 months ago
- UR2: Unify RAG and Reasoning through Reinforcement Learning☆127Nov 19, 2025Updated 4 months ago
- ☆39Oct 22, 2025Updated 5 months ago
- ☆16Mar 10, 2024Updated 2 years ago
- This is a niche collection of research papers which are proven to be gradients pushing the field of Natural Language Processing, Deep Lea…☆25Nov 19, 2024Updated last year
- A golang_based streaming video website☆39Mar 7, 2023Updated 3 years ago
- your finance bro Agent for trading and investing☆109Nov 8, 2025Updated 4 months ago
- [CVPR 2026] SeeU: Seeing the Unseen World via 4D Dynamics-aware Generation☆39Updated this week
- Code for the paper "Knowledge-driven Data Construction for Zero-shot Evaluation in Commonsense Question Answering" (AAAI 2021)☆30Feb 19, 2021Updated 5 years ago
- Multi-Agent,MCP,RAG,SpringAI1.0.0,RE-ACT☆113Jun 17, 2025Updated 9 months ago
- CMPhysBench: A Benchmark for Evaluating Large Language Models in Condensed Matter Physics☆28Nov 1, 2025Updated 4 months ago
- AGS02MA full-featured driver for general-purpose MCU and Linux.☆12Oct 25, 2025Updated 4 months ago
- This is small instance of blockChain using javascript.☆16Aug 26, 2022Updated 3 years ago
- [ACL 2025] ⚖️ Temporally-aware MLLM for Biomedical Radiology Analysis and Report Generation. Flexible toolkit with MLLM backbone support,…☆28Updated this week
- ☆10Oct 30, 2021Updated 4 years ago
- [ICLR 26] The official code repository for the paper "Mirage or Method? How Model–Task Alignment Induces Divergent RL Conclusions".☆17Feb 9, 2026Updated last month