[AAAI 2024] SciEval: A Multi-Level Large Language Model Evaluation Benchmark for Scientific Research
☆30Aug 6, 2024Updated last year
Alternatives and similar repositories for SciEval
Users that are interested in SciEval are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆14Apr 16, 2024Updated 2 years ago
- ☆10Dec 20, 2023Updated 2 years ago
- [WWW 25] USPTO-LLM: A Large Language Model-Assisted Information-enriched Chemical Reaction Dataset☆17Dec 12, 2024Updated last year
- ☆15Dec 4, 2023Updated 2 years ago
- [NeurIPS 24] Can LLMs Solve Molecule Puzzles? A Multimodal Benchmark for Molecular Structure Elucidation☆19Jan 2, 2026Updated 4 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- SciGLM: Training Scientific Language Models with Self-Reflective Instruction Annotation and Tuning (NeurIPS D&B Track 2024)☆87Feb 25, 2024Updated 2 years ago
- [ICML 2025] Official repository for paper "OR-Bench: An Over-Refusal Benchmark for Large Language Models"☆26Mar 4, 2025Updated last year
- [EMNLP 2022] The baseline code for META-GUI dataset☆15Jul 9, 2024Updated last year
- ☆10Apr 20, 2022Updated 4 years ago
- ☆16Jan 5, 2021Updated 5 years ago
- A quantitative benchmark and analysis of molecular large language models.☆19Jun 3, 2025Updated 11 months ago
- PRESTO: Progressive Pretraining Enhances Synthetic Chemistry Outcomes [EMNLP 2024]☆28Nov 18, 2024Updated last year
- ☆11Jan 6, 2024Updated 2 years ago
- [SCIS] MULTI-Benchmark: Multimodal Understanding Leaderboard with Text and Images☆45Nov 19, 2025Updated 6 months ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- ☆23Feb 3, 2026Updated 3 months ago
- ☆11Dec 22, 2024Updated last year
- Tailoring Molecules for Protein Pockets: a Transformer-based Generative Solution for Structured-based Drug Design☆20Jul 26, 2023Updated 2 years ago
- Crawl traffic data from PEMS☆10Jul 19, 2021Updated 4 years ago
- The code of MultiSPANS.☆12Oct 20, 2023Updated 2 years ago
- [JBHI 2024] CareSleepNet: A Hybrid Deep Learning Network for Automatic Sleep Staging☆17Jun 6, 2025Updated 11 months ago
- source codes of TASSGN☆12Mar 11, 2024Updated 2 years ago
- Official Code for What can Large Language Models do in chemistry? A comprehensive benchmark on eight tasks (In NeurIPS 2023)☆174Jul 26, 2024Updated last year
- Code and data for paper named: Large language models for automatic equation discovery of nonlinear dynamics☆13Mar 6, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆11Apr 10, 2025Updated last year
- ☆35Oct 4, 2025Updated 7 months ago
- MMER☆18Jan 8, 2026Updated 4 months ago
- ☆33May 10, 2025Updated last year
- Official code for "From Seeing to Doing: Bridging Reasoning and Decision for Robotic Manipulation" (ICLR2026)☆36Mar 1, 2026Updated 2 months ago
- Online BaseHangul Encoder And Decoder☆13Jan 30, 2023Updated 3 years ago
- ☆11Oct 26, 2023Updated 2 years ago
- ☆16Feb 17, 2025Updated last year
- 针对最经典的表格型Q learning算法进行了复现,能够支持gym中大多数的离散动作和状态空间的环境,譬如CliffWalking-v0。☆10Jan 2, 2021Updated 5 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆56Apr 8, 2026Updated last month
- ☆15Aug 5, 2025Updated 9 months ago
- ☆44Jun 20, 2025Updated 11 months ago
- ☆11Oct 5, 2024Updated last year
- ICLR 2025 paper: 3DMolFormer: A Dual-channel Framework for Structure-based Drug Discovery☆28Apr 25, 2025Updated last year
- MSTGAN is an innovative method designed for multi-station urban air quality prediction, which fully considers the individual, global, and…☆20Jul 13, 2024Updated last year
- ☆12Jun 13, 2025Updated 11 months ago