[AAAI 2024] SciEval: A Multi-Level Large Language Model Evaluation Benchmark for Scientific Research
☆30Aug 6, 2024Updated last year
Alternatives and similar repositories for SciEval
Users that are interested in SciEval are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆133Jul 8, 2024Updated last year
- ☆14Apr 16, 2024Updated 2 years ago
- ☆10Dec 20, 2023Updated 2 years ago
- [WWW 25] USPTO-LLM: A Large Language Model-Assisted Information-enriched Chemical Reaction Dataset☆17Dec 12, 2024Updated last year
- ☆15Dec 4, 2023Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- [NeurIPS 24] Can LLMs Solve Molecule Puzzles? A Multimodal Benchmark for Molecular Structure Elucidation☆18Jan 2, 2026Updated 4 months ago
- SciGLM: Training Scientific Language Models with Self-Reflective Instruction Annotation and Tuning (NeurIPS D&B Track 2024)☆86Feb 25, 2024Updated 2 years ago
- [EMNLP 2022] The baseline code for META-GUI dataset☆15Jul 9, 2024Updated last year
- CDQA: Chinese Dynamic Question Answering Benchmark☆18Dec 13, 2024Updated last year
- ☆10Apr 20, 2022Updated 4 years ago
- ☆16Jan 5, 2021Updated 5 years ago
- A quantitative benchmark and analysis of molecular large language models.☆19Jun 3, 2025Updated 11 months ago
- Paper to Reviewer Assignment is a tedious but a very crucial job for conference organizers. Till date the Toronto Paper Matching System (…☆10Nov 30, 2017Updated 8 years ago
- PRESTO: Progressive Pretraining Enhances Synthetic Chemistry Outcomes [EMNLP 2024]☆28Nov 18, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆11Jan 6, 2024Updated 2 years ago
- [SCIS] MULTI-Benchmark: Multimodal Understanding Leaderboard with Text and Images☆45Nov 19, 2025Updated 5 months ago
- ☆23Feb 3, 2026Updated 3 months ago
- ☆11Dec 22, 2024Updated last year
- Tailoring Molecules for Protein Pockets: a Transformer-based Generative Solution for Structured-based Drug Design☆20Jul 26, 2023Updated 2 years ago
- The code of MultiSPANS.☆12Oct 20, 2023Updated 2 years ago
- [JBHI 2024] CareSleepNet: A Hybrid Deep Learning Network for Automatic Sleep Staging☆16Jun 6, 2025Updated 11 months ago
- Official Code for What can Large Language Models do in chemistry? A comprehensive benchmark on eight tasks (In NeurIPS 2023)☆173Jul 26, 2024Updated last year
- Code and data for paper named: Large language models for automatic equation discovery of nonlinear dynamics☆13Mar 6, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Repository for ''Contextualizing MLP-Mixers Spatiotemporally for Urban Data Forecast at Scale''☆14Apr 30, 2024Updated 2 years ago
- ☆11Apr 10, 2025Updated last year
- MMER☆18Jan 8, 2026Updated 3 months ago
- ☆33May 10, 2025Updated 11 months ago
- Official code for "From Seeing to Doing: Bridging Reasoning and Decision for Robotic Manipulation" (ICLR2026)☆36Mar 1, 2026Updated 2 months ago
- ☆11Oct 26, 2023Updated 2 years ago
- ☆16Feb 17, 2025Updated last year
- ☆53Apr 8, 2026Updated 3 weeks ago
- 针对最经典的表格型Q learning算法进行了复现,能够支持gym中大多数的离散动作和状态空间的环境,譬如CliffWalking-v0。☆10Jan 2, 2021Updated 5 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Official code for the paper "Joint Bayesian Inference of Graphical Structure and Parameters with a Single Generative Flow Network"☆16Aug 9, 2023Updated 2 years ago
- ☆15Aug 5, 2025Updated 9 months ago
- a survey on deep research☆48Sep 9, 2025Updated 7 months ago
- ☆44Jun 20, 2025Updated 10 months ago
- ☆11Oct 5, 2024Updated last year
- ICLR 2025 paper: 3DMolFormer: A Dual-channel Framework for Structure-based Drug Discovery☆28Apr 25, 2025Updated last year
- ☆25Feb 21, 2026Updated 2 months ago