SciMT / SciMT-benchmark
☆11Updated last year
Alternatives and similar repositories for SciMT-benchmark:
Users that are interested in SciMT-benchmark are comparing it to the libraries listed below
- SciKnowEval: Evaluating Multi-level Scientific Knowledge of Large Language Models☆15Updated 5 months ago
- PRESTO: Progressive Pretraining Enhances Synthetic Chemistry Outcomes [EMNLP 2024]☆25Updated 4 months ago
- PyTorch codes for the paper "An Empirical Study of Multimodal Model Merging"☆38Updated last year
- ☆20Updated 4 years ago
- Pre-trained Language Model for Scientific Text☆44Updated last year
- Structured Chemistry Reasoning with Large Language Models☆35Updated 10 months ago
- Official implementation of paper "Beyond Bradley-Terry Models: A General Preference Model for Language Model Alignment" (https://arxiv.or…☆22Updated last month
- The source code for running LLMs on the AAAR-1.0 benchmark.☆16Updated this week
- Applies ROME and MEMIT on Mamba-S4 models☆14Updated 11 months ago
- implementation of dualformer☆13Updated last month
- MMSci: A Multimodal Multi-Discipline Dataset for PhD-Level Scientific Comprehension☆40Updated 3 months ago
- Two Stones Hit One Bird: Bilevel Positional Encoding for Better Length Extrapolation, ICML 2024☆22Updated 9 months ago
- MathFusion: Enhancing Mathematic Problem-solving of LLM through Instruction Fusion☆16Updated last week
- InstructMol: Multi-Modal Integration for Building a Versatile and Reliable Molecular Assistant in Drug Discovery (COLING 2025)☆45Updated 3 months ago
- Official Implementation for EMNLP 2024 (main) "AgentReview: Exploring Academic Peer Review with LLM Agent."☆49Updated 4 months ago
- A trainable user simulator☆34Updated 6 months ago
- Code for "Seeking Neural Nuggets: Knowledge Transfer in Large Language Models from a Parametric Perspective"☆32Updated 10 months ago
- An official implementation of "Catastrophic Failure of LLM Unlearning via Quantization" (ICLR 2025)☆26Updated last month
- exploring whether LLMs perform case-based or rule-based reasoning☆28Updated last year
- Offcial Repo of Paper "Eliminating Position Bias of Language Models: A Mechanistic Approach""☆13Updated 7 months ago
- ☆43Updated 11 months ago
- ☆10Updated 2 years ago
- Self-Supervised Alignment with Mutual Information☆16Updated 10 months ago
- The official repository of "Improving Large Language Models via Fine-grained Reinforcement Learning with Minimum Editing Constraint"☆37Updated last year
- [NeurIPS 2024] Code and Data Repo for Paper "Embedding Trajectory for Out-of-Distribution Detection in Mathematical Reasoning"☆24Updated 10 months ago
- A curated list of awesome LLM Inference-Time Self-Improvement (ITSI, pronounced "itsy") papers from our recent survey: A Survey on Large …☆74Updated 3 months ago
- ☆21Updated 8 months ago
- ☆20Updated 4 months ago
- ☆20Updated 8 months ago
- Code for EMNLP2023 paper "MolCA: Molecular Graph-Language Modeling with Cross-Modal Projector and Uni-Modal Adapter".☆12Updated last year