Reproducible and flexible LLM evaluations for scientific reasoning.
☆28Jul 23, 2025Updated 10 months ago
Alternatives and similar repositories for lm-open-science-evaluation
Users that are interested in lm-open-science-evaluation are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official repository of paper "Context-DPO: Aligning Language Models for Context-Faithfulness"☆23Feb 17, 2025Updated last year
- ☆18Mar 2, 2026Updated 3 months ago
- [AAAI 2025] Assessing the Creativity of LLMs in Proposing Novel Solutions to Mathematical Problems☆13May 5, 2025Updated last year
- MegaScience: Pushing the Frontiers of Post-Training Datasets for Science Reasoning☆121Feb 2, 2026Updated 4 months ago
- ☆13Nov 11, 2022Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆79May 22, 2024Updated 2 years ago
- [COLING 2025] Official repo of paper: "Not Aligned" is Not "Malicious": Being Careful about Hallucinations of Large Language Models' Jail…☆12Jul 26, 2024Updated last year
- Code and data for paper "Exploring Hallucination of Large Multimodal Models in Video Understanding: Benchmark, Analysis and Mitigation".☆25Oct 22, 2025Updated 7 months ago
- Official repository of paper "Parameters vs. Context: Fine-Grained Control of Knowledge Reliance in Language Models"☆26May 27, 2025Updated last year
- Reformatted Alignment☆111Sep 23, 2024Updated last year
- Repository for our paper "DeepEdit: Knowledge Editing as Decoding with Constraints". https://arxiv.org/abs/2401.10471☆21Jun 19, 2024Updated last year
- Implementation of "Investigating the Factual Knowledge Boundary of Large Language Models with Retrieval Augmentation"☆21Jul 31, 2023Updated 2 years ago
- A library for open domain query facet extraction and generation☆16Apr 24, 2024Updated 2 years ago
- ☆16Sep 6, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆12Jun 13, 2025Updated last year
- ☆108Oct 7, 2025Updated 8 months ago
- Official implementation of Panacea: A foundation model for clinical trial design, recruitment, search, and summarization.☆21Dec 24, 2024Updated last year
- Official code for the paper Towards Fully Exploiting LLM Internal States to Enhance Knowledge Boundary Perception. The code is based on t…☆20Aug 5, 2025Updated 10 months ago
- ☆16Sep 4, 2025Updated 9 months ago
- SIGIR 2022: GERE: Generative Evidence Retrieval for Fact Verification☆20Jul 19, 2022Updated 3 years ago
- A project that can generate ancient poems based on pictures, including CLIP, T5, GPT2 models☆21Feb 16, 2025Updated last year
- MetaLadder: Ascending Mathematical Solution Quality via Analogical-Problem Reasoning Transfer (EMNLP 2025)☆12Apr 18, 2025Updated last year
- ☆14Apr 16, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- 2022春季学期清华大学计算机图形学大作业☆12Mar 4, 2023Updated 3 years ago
- [ICLR 2026] Official PyTorch Implementation of RLP: Reinforcement as a Pretraining Objective☆253Jan 26, 2026Updated 4 months ago
- Code for EMNLP2023 paper "MolCA: Molecular Graph-Language Modeling with Cross-Modal Projector and Uni-Modal Adapter".☆12Dec 27, 2023Updated 2 years ago
- Repository containing the group project Wind Power Forecasting for DTU's 02456 Deep Learning.☆13Apr 7, 2022Updated 4 years ago
- DatasetResearch: Benchmarking Agent Systems for Demand-Driven Dataset Discovery☆20Sep 24, 2025Updated 8 months ago
- [ICML 2026 Spotlight] Critique-GRPO: Advancing LLM Reasoning with Natural Language and Numerical Feedback☆69Jun 3, 2026Updated 2 weeks ago
- Klear-Reasoner: Advancing Reasoning Capability via Gradient-Preserving Clipping Policy Optimization☆82Dec 25, 2025Updated 5 months ago
- Code for paper Towards Mitigating LLM Hallucination via Self Reflection☆30Oct 9, 2023Updated 2 years ago
- A curated list of awesome resources dedicated to Scaling Laws for LLMs☆84Apr 10, 2023Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆10Dec 20, 2023Updated 2 years ago
- NLP stuff with quantum computing☆17Nov 9, 2020Updated 5 years ago
- Solution of KDD cup 2021☆11Jun 16, 2021Updated 5 years ago
- ☆10Oct 17, 2021Updated 4 years ago
- 基于BERT和指针网络构建实体抽取任务☆14Aug 2, 2020Updated 5 years ago
- [WWW 25] USPTO-LLM: A Large Language Model-Assisted Information-enriched Chemical Reaction Dataset☆19Dec 12, 2024Updated last year
- Category Theory for Quantum Natural Language Processing☆11Feb 22, 2023Updated 3 years ago