[ICLR2026] NewtonBench: Benchmarking Generalizable Scientific Law Discovery in LLM Agents
☆147Feb 27, 2026Updated 2 months ago
Alternatives and similar repositories for NewtonBench
Users that are interested in NewtonBench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A large-scale benchmark for detecting managerial evasion in earnings call Q&A.☆34Feb 5, 2026Updated 3 months ago
- Production-grade e-commerce platform for automotive accessories(or any), built with Next.js, PostgreSQL, Redis, and Stripe.☆41Dec 26, 2025Updated 4 months ago
- Official code repository for the paper: AbsPyramid: Benchmarking the Abstration Ability of Language Models with a Unified Entailment Grap…☆13Oct 30, 2024Updated last year
- This project aims to analyze physical activity data from children and adolescents to predict the extent of their problematic internet use…☆55Sep 24, 2025Updated 7 months ago
- ☆32Nov 11, 2025Updated 5 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆162Mar 29, 2026Updated last month
- ☆14Aug 10, 2023Updated 2 years ago
- OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, …☆19Jun 25, 2024Updated last year
- The enhanced model is specially trained for aquatic targets, achieving higher accuracy. It can detect sailboats, humans, other vessels, b…☆47May 15, 2025Updated 11 months ago
- Codes and Datasets for the Paper: Text-Tuple-Table: Towards Information Integration in Text-to-Table Generation via Global Tuple Extracti…☆15Jun 5, 2024Updated last year
- [ACL 2024] Implementation for Advancing Abductive Reasoning in Knowledge Graphs through Complex Logical Hypothesis Generation☆15Oct 9, 2025Updated 6 months ago
- Implementations of Influential Recommender System☆11Oct 29, 2024Updated last year
- Slightly patched clone of canonical gevel project from Teodor Sigaev☆12Dec 16, 2021Updated 4 years ago
- A minimal example of Abductive Learning☆19Dec 6, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- WWW 2024: New Frontiers of Knowledge Graph Reasoning: Recent Advances and Future Trends☆18Mar 24, 2026Updated last month
- Benchmark for Answering Existential First Order Queries with Single Free Variable (NeurIPS dataset and benchmark 2021)☆20May 3, 2023Updated 3 years ago
- Code and data for the paper: IntentionQA: A Benchmark for Evaluating Purchase Intention Comprehension Abilities of Large Language Models …☆12Apr 27, 2024Updated 2 years ago
- 🎬 灵果短剧AI - 基于AI的一站式短剧/漫剧生成平台 《一句话生成完整短剧/漫剧,从剧本到成片短剧/漫剧全自动化》Lingguo-Drama AI-An AI-powered one-stop generation platform for mini-dramas a…☆576Mar 28, 2026Updated last month
- Nvidia In-Game Inference Framework Adapt to Unity☆33Dec 24, 2025Updated 4 months ago
- Deep Reinforcement Learning trading strategies: Double DQN with Transformer Attention + Multi-Factor Model (Fama-French inspired). Featur…☆75Apr 17, 2026Updated 2 weeks ago
- 极不平衡样本下的预测☆40Oct 28, 2025Updated 6 months ago
- 基于GO语言的客户端☆23Mar 3, 2025Updated last year
- python implementation of the paper 'Fast Range Image-Based Segmentation of Sparse 3D Laser Scans for Online Operation'☆12Jan 4, 2021Updated 5 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Resources and paper list for 'Scaling Environments for Agents'. This repository accompanies our survey on how environments contribute to …☆66Jan 28, 2026Updated 3 months ago
- 基于GO语言的服务端☆24Mar 8, 2025Updated last year
- 一个 vue3 ui组件库☆26Oct 27, 2025Updated 6 months ago
- 🎁 Modern e-commerce system built with Go (Gin + Gorm + Redis + JWT). Enhanced version of yshop-gin with improved UI, performance and fea…☆37Oct 17, 2025Updated 6 months ago
- Multimodal Document Intelligence Platform☆41Apr 10, 2026Updated 3 weeks ago
- Predicting brain activity from word embeddings during natural language comprehension☆24Feb 20, 2024Updated 2 years ago
- UR2: Unify RAG and Reasoning through Reinforcement Learning☆128Apr 26, 2026Updated last week
- ☆16Mar 10, 2024Updated 2 years ago
- ☆39Oct 22, 2025Updated 6 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- This is a niche collection of research papers which are proven to be gradients pushing the field of Natural Language Processing, Deep Lea…☆25Nov 19, 2024Updated last year
- A golang_based streaming video website☆39Mar 7, 2023Updated 3 years ago
- your finance bro Agent for trading and investing☆109Nov 8, 2025Updated 5 months ago
- Code for the paper "Knowledge-driven Data Construction for Zero-shot Evaluation in Commonsense Question Answering" (AAAI 2021)☆30Feb 19, 2021Updated 5 years ago
- Multi-Agent,MCP,RAG,SpringAI1.0.0,RE-ACT☆113Jun 17, 2025Updated 10 months ago
- AGS02MA full-featured driver for general-purpose MCU and Linux.☆12Oct 25, 2025Updated 6 months ago
- [CVPR 2026] SeeU: Seeing the Unseen World via 4D Dynamics-aware Generation☆41Mar 18, 2026Updated last month