SciMT / SciMT-benchmark
☆11Updated last year
Alternatives and similar repositories for SciMT-benchmark:
Users that are interested in SciMT-benchmark are comparing it to the libraries listed below
- SciKnowEval: Evaluating Multi-level Scientific Knowledge of Large Language Models☆15Updated 2 months ago
- PRESTO: Progressive Pretraining Enhances Synthetic Chemistry Outcomes [EMNLP 2024]☆22Updated 2 months ago
- Structured Chemistry Reasoning with Large Language Models☆31Updated 8 months ago
- Official Implementation for EMNLP 2024 (main) "AgentReview: Exploring Academic Peer Review with LLM Agent."☆43Updated 2 months ago
- ☆42Updated 9 months ago
- Pre-trained Language Model for Scientific Text☆44Updated 10 months ago
- [AAAI 2024] MELO: Enhancing Model Editing with Neuron-indexed Dynamic LoRA☆23Updated 9 months ago
- A trainable user simulator☆32Updated 4 months ago
- ☆12Updated 5 months ago
- exploring whether LLMs perform case-based or rule-based reasoning☆28Updated 10 months ago
- Call for participation in the impact of LLM for scientific discovery☆62Updated 9 months ago
- PyTorch codes for the paper "An Empirical Study of Multimodal Model Merging"☆37Updated last year
- InstructMol: Multi-Modal Integration for Building a Versatile and Reliable Molecular Assistant in Drug Discovery (COLING 2025)☆42Updated last month
- MMSci: A Multimodal Multi-Discipline Dataset for PhD-Level Scientific Comprehension☆35Updated last month
- Official Implementation of UA^{2}-Agent and other baseline algorithms of "Towards Unified Alignment Between Agents, Humans, and Environme…☆16Updated 2 months ago
- Part of official implementation of "Natural language-informed learning of molecule graphs"☆15Updated last year
- What can Large Language Models do in chemistry? A comprehensive benchmark on eight tasks☆133Updated 5 months ago
- [ACL 2024] An Easy-to-use Hallucination Detection Framework for LLMs.☆25Updated 3 months ago
- [NeurIPS-2024] 📈 Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies https://arxiv.org/abs/2407.13623☆75Updated 3 months ago
- Official repository for Decentralized Arena via Collective LLM Intelligence☆8Updated 3 months ago
- Repository for Text2Mol: Cross-Modal Molecular Retrieval with Natural Language Queries☆41Updated 10 months ago
- ☆25Updated 7 months ago
- ☆15Updated 2 months ago
- Two Stones Hit One Bird: Bilevel Positional Encoding for Better Length Extrapolation, ICML 2024☆21Updated 6 months ago
- Retrieved Sequence Augmentation for Protein Representation Learning☆49Updated last year
- SciAssess is a comprehensive benchmark for evaluating Large Language Models' proficiency in scientific literature analysis across various…☆67Updated 3 months ago
- Code and Data Repo for [NeurIPS 2024] Paper "Embedding Trajectory for Out-of-Distribution Detection in Mathematical Reasoning"☆21Updated 7 months ago
- Official implementation of paper "General Preference Modeling with Preference Representations for Aligning Language Models" (https://arxi…☆21Updated last month
- This repository contains information on the creation, evaluation, and benchmark models for the L+M-24 Dataset. L+M-24 will be featured as…☆26Updated last month
- [EMNLP 2023] ReLM: Leveraging Language Models for Enhanced Chemical Reaction Prediction.☆18Updated 11 months ago