Reproducible and flexible LLM evaluations for scientific reasoning.
☆28Jul 23, 2025Updated 9 months ago
Alternatives and similar repositories for lm-open-science-evaluation
Users that are interested in lm-open-science-evaluation are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official repository of paper "Context-DPO: Aligning Language Models for Context-Faithfulness"☆23Feb 17, 2025Updated last year
- ☆18Mar 2, 2026Updated 2 months ago
- ☆79May 22, 2024Updated last year
- [COLING 2025] Official repo of paper: "Not Aligned" is Not "Malicious": Being Careful about Hallucinations of Large Language Models' Jail…☆12Jul 26, 2024Updated last year
- Code and data for paper "Exploring Hallucination of Large Multimodal Models in Video Understanding: Benchmark, Analysis and Mitigation".☆24Oct 22, 2025Updated 6 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Official repository of paper "Parameters vs. Context: Fine-Grained Control of Knowledge Reliance in Language Models"☆25May 27, 2025Updated 11 months ago
- Repository for our paper "DeepEdit: Knowledge Editing as Decoding with Constraints". https://arxiv.org/abs/2401.10471☆21Jun 19, 2024Updated last year
- SciKnowEval: Evaluating Multi-level Scientific Knowledge of Large Language Models☆27Jul 13, 2025Updated 9 months ago
- Implementation of "Investigating the Factual Knowledge Boundary of Large Language Models with Retrieval Augmentation"☆21Jul 31, 2023Updated 2 years ago
- A library for open domain query facet extraction and generation☆16Apr 24, 2024Updated 2 years ago
- This is a repo consisting of papers about LLMs' perception of their knowledge boundaries; Uncertainty Quantification; Honesty Alignment; …☆25Nov 25, 2025Updated 5 months ago
- 识别工厂中托盘和托盘上的孔☆14Sep 11, 2023Updated 2 years ago
- ☆27Oct 7, 2025Updated 7 months ago
- Awesome-Parallel-Reasoning: Unlocking the reasoning potential of LLMs. Papers, Code, Resources & Survey.☆52Mar 8, 2026Updated 2 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Subgraph Based Learning of Contextual Embedding☆29Nov 5, 2021Updated 4 years ago
- Official implementation of Panacea: A foundation model for clinical trial design, recruitment, search, and summarization.☆19Dec 24, 2024Updated last year
- ☆13Jun 16, 2021Updated 4 years ago
- ☆16Sep 4, 2025Updated 8 months ago
- Official code for the paper Towards Fully Exploiting LLM Internal States to Enhance Knowledge Boundary Perception. The code is based on t…☆19Aug 5, 2025Updated 9 months ago
- SIGIR 2022: GERE: Generative Evidence Retrieval for Fact Verification☆20Jul 19, 2022Updated 3 years ago
- Repository for storing codes about experiments of computer graphics lessons.☆16Nov 2, 2022Updated 3 years ago
- Code of "Instruction Multi-Constraint Molecular Generation Using a Teacher-Student Large Language Model"☆14Jul 8, 2025Updated 10 months ago
- MetaLadder: Ascending Mathematical Solution Quality via Analogical-Problem Reasoning Transfer (EMNLP 2025)☆12Apr 18, 2025Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆14Apr 16, 2024Updated 2 years ago
- ☆11Jun 4, 2021Updated 4 years ago
- 2022春季学期清华大学计算机图形学大作业☆12Mar 4, 2023Updated 3 years ago
- Code for EMNLP2023 paper "MolCA: Molecular Graph-Language Modeling with Cross-Modal Projector and Uni-Modal Adapter".☆12Dec 27, 2023Updated 2 years ago
- DatasetResearch: Benchmarking Agent Systems for Demand-Driven Dataset Discovery☆20Sep 24, 2025Updated 7 months ago
- Klear-Reasoner: Advancing Reasoning Capability via Gradient-Preserving Clipping Policy Optimization☆82Dec 25, 2025Updated 4 months ago
- Code for paper Towards Mitigating LLM Hallucination via Self Reflection☆30Oct 9, 2023Updated 2 years ago
- A curated list of awesome resources dedicated to Scaling Laws for LLMs☆84Apr 10, 2023Updated 3 years ago
- ESL-Note 是阅读 ESL中文版 的笔记。笔记中对书中出现的公式进行了详细的推导,习题也进行了求解,与中文版中的做法有所差异并且加入了知识补充和扩展部分。☆25Mar 7, 2022Updated 4 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆10Oct 17, 2021Updated 4 years ago
- Category Theory for Quantum Natural Language Processing☆11Feb 22, 2023Updated 3 years ago
- Encoder-decoders for translating different chemical formats.☆20Sep 17, 2025Updated 7 months ago
- RetroDFM-R: Reasoning-Driven Retrosynthesis Prediction with Large Language Models via Reinforcement Learning☆21Nov 22, 2025Updated 5 months ago
- ☆10Sep 27, 2021Updated 4 years ago
- Using tensorflow/serving to deploy kashgari model for time training and predicting.☆13Sep 16, 2019Updated 6 years ago
- ☆16Jan 8, 2020Updated 6 years ago