Reproducible and flexible LLM evaluations for scientific reasoning.
☆27Jul 23, 2025Updated 8 months ago
Alternatives and similar repositories for lm-open-science-evaluation
Users that are interested in lm-open-science-evaluation are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official repository of paper "Context-DPO: Aligning Language Models for Context-Faithfulness"☆22Feb 17, 2025Updated last year
- ☆18Mar 2, 2026Updated 3 weeks ago
- MegaScience: Pushing the Frontiers of Post-Training Datasets for Science Reasoning☆115Feb 2, 2026Updated last month
- ☆13Nov 11, 2022Updated 3 years ago
- Code and data for paper "Exploring Hallucination of Large Multimodal Models in Video Understanding: Benchmark, Analysis and Mitigation".☆24Oct 22, 2025Updated 5 months ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- Official repository of paper "Parameters vs. Context: Fine-Grained Control of Knowledge Reliance in Language Models"☆24May 27, 2025Updated 10 months ago
- Implementation of "Investigating the Factual Knowledge Boundary of Large Language Models with Retrieval Augmentation"☆21Jul 31, 2023Updated 2 years ago
- This is a repo consisting of papers about LLMs' perception of their knowledge boundaries; Uncertainty Quantification; Honesty Alignment; …☆24Nov 25, 2025Updated 4 months ago
- Subgraph Based Learning of Contextual Embedding☆29Nov 5, 2021Updated 4 years ago
- Official implementation of Panacea: A foundation model for clinical trial design, recruitment, search, and summarization.☆18Dec 24, 2024Updated last year
- ☆13Jun 16, 2021Updated 4 years ago
- ☆16Sep 4, 2025Updated 6 months ago
- Official code for the paper Towards Fully Exploiting LLM Internal States to Enhance Knowledge Boundary Perception. The code is based on t…☆19Aug 5, 2025Updated 7 months ago
- A project that can generate ancient poems based on pictures, including CLIP, T5, GPT2 models☆22Feb 16, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Repository for storing codes about experiments of computer graphics lessons.☆16Nov 2, 2022Updated 3 years ago
- Code of "Instruction Multi-Constraint Molecular Generation Using a Teacher-Student Large Language Model"☆14Jul 8, 2025Updated 8 months ago
- MetaLadder: Ascending Mathematical Solution Quality via Analogical-Problem Reasoning Transfer (EMNLP 2025)☆12Apr 18, 2025Updated 11 months ago
- [ICLR 2026] Official PyTorch Implementation of RLP: Reinforcement as a Pretraining Objective☆243Jan 26, 2026Updated 2 months ago
- ☆11Jun 4, 2021Updated 4 years ago
- Code for EMNLP2023 paper "MolCA: Molecular Graph-Language Modeling with Cross-Modal Projector and Uni-Modal Adapter".☆12Dec 27, 2023Updated 2 years ago
- Repository containing the group project Wind Power Forecasting for DTU's 02456 Deep Learning.☆13Apr 7, 2022Updated 3 years ago
- DatasetResearch: Benchmarking Agent Systems for Demand-Driven Dataset Discovery☆20Sep 24, 2025Updated 6 months ago
- Implementation of "ACL'24: When Do LLMs Need Retrieval Augmentation? Mitigating LLMs’ Overconfidence Helps Retrieval Augmentation"☆24Jul 19, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- [NAACL'25] RuleR: Improving LLM Controllability by Rule-based Data Recycling☆14Sep 27, 2025Updated 6 months ago
- Klear-Reasoner: Advancing Reasoning Capability via Gradient-Preserving Clipping Policy Optimization☆81Dec 25, 2025Updated 3 months ago
- A curated list of awesome resources dedicated to Scaling Laws for LLMs☆81Apr 10, 2023Updated 2 years ago
- ☆10Dec 20, 2023Updated 2 years ago
- ESL-Note 是阅读 ESL中文版 的笔记。笔记中对书中出现的公式进行了详细的推导,习题也进行了求解,与中文版中的做法有所差异并且加入了知识补充和扩展部分。☆25Mar 7, 2022Updated 4 years ago
- ☆10Oct 17, 2021Updated 4 years ago
- [WWW 25] USPTO-LLM: A Large Language Model-Assisted Information-enriched Chemical Reaction Dataset☆16Dec 12, 2024Updated last year
- Category Theory for Quantum Natural Language Processing☆11Feb 22, 2023Updated 3 years ago
- contains quantum neural network and quantum transformer repos☆11Apr 11, 2025Updated 11 months ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- Encoder-decoders for translating different chemical formats.☆19Sep 17, 2025Updated 6 months ago
- RetroDFM-R: Reasoning-Driven Retrosynthesis Prediction with Large Language Models via Reinforcement Learning☆20Nov 22, 2025Updated 4 months ago
- ☆10Sep 27, 2021Updated 4 years ago
- ☆15Jan 8, 2020Updated 6 years ago
- The official codes for our paper at COLING 2022: Semantic-Preserving Adversarial Code Comprehension☆12Oct 23, 2022Updated 3 years ago
- ☆36Feb 15, 2024Updated 2 years ago
- ☆13May 23, 2025Updated 10 months ago