HKUST-KnowComp / NewtonBenchLinks
NewtonBench: Benchmarking Generalizable Scientific Law Discovery in LLM Agents
☆131Updated last week
Alternatives and similar repositories for NewtonBench
Users that are interested in NewtonBench are comparing it to the libraries listed below
Sorting:
- Awesome Literature Graph Learning Challenges☆99Updated 2 months ago
- Official repository of DARE: dLLM Alignment and Reinforcement Executor☆117Updated this week
- Code implementation of the paper accepted by IEEE TKDE2024: "Make Heterophilic Graphs Better Fit GNN: A Graph Rewiring Approach"☆110Updated last year
- The codes for the paper One-bit Deep Hashing: Towards a Resource-Efficient Hashing Model with Binary Neural Networks (ACMMM24)☆45Updated 9 months ago
- [NeurIPS 2025] Official implementation of "STRAP: Spatio-Temporal Pattern Retrieval for Out-of-Distribution Generalization"☆75Updated last month
- A powerful multi-format file parsing, data cleaning, and AI annotation toolkit.☆142Updated last week
- [ACL 2025 Oral] QAEncoder: Towards Aligned Representation Learning in Question Answering Systems☆176Updated 5 months ago
- Dataset and evaluation code of ISDrama(ACM-MM 2025): Immersive Spatial Drama Generation through Multimodal Prompting☆236Updated 4 months ago
- A benchmark suite for evaluating LLM-based interactive scientific reasoning.☆91Updated last month
- Official implementation of CIKM2024 paper titled "PROSPECT: Learn MLPs on Graphs Robust against Adversarial Structure Attacks"☆22Updated 10 months ago
- Spring项目:支持设置时间、价格、距离权重的个性化导航服务,并支持根据大量用户行驶状态更新道路情况和预计到达时间☆22Updated 7 months ago
- ☆356Updated 5 months ago
- Group Expectation Policy Optimization for Heterogeneous Reinforcement Learning☆164Updated last month
- Repository for the paper:☆69Updated last year
- [COLM 2025] Assessing Judging Bias in Large Reasoning Models: An Empirical Study https://openreview.net/pdf?id=SlRtFwBdzP☆164Updated 3 months ago
- A light-weight framework for building llm agentic systems with additional supports for program synthesis and neural-symbolic research.☆87Updated 3 weeks ago
- A pytorch implementation of the paper "TreeLoRA: Efficient Continual Learning via Layer-Wise LoRAs Guided by a Hierarchical Gradient-Simi…☆343Updated last week
- ☆86Updated 9 months ago
- An Integrated Library for Tuning, Deploying and Interpreting Genomic Models☆119Updated 3 months ago
- (EMNLP 2025 Findings) Source Evaluation scripts for Humanity's Last Code Exam☆95Updated 4 months ago
- The code for paper "Learning from Committee: Reasoning Distillation from a Mixture of Teachers with Peer-Review" accepted by ACL 2025.☆103Updated 7 months ago
- The code for Refining Sentence Embedding Model through Ranking Sentences Generation with Large Language Models (Finding of ACL2025)☆83Updated 5 months ago
- ☆114Updated this week
- 4th Place Solution for the Kaggle Competition: LMSYS - Chatbot Arena Human Preference Predictions☆171Updated last year
- ☆79Updated 10 months ago
- ☆73Updated 3 weeks ago
- This is a pytorch project for the paper Universal Adaptive Data Augmentation (IJCAI2023).☆86Updated 4 months ago
- MAX31855 full-featured driver library for general-purpose MCU and Linux.☆70Updated last month
- Openai API Cost Tracker☆21Updated last year
- PCF8563 full-featured driver library for general-purpose MCU and Linux.☆30Updated last month