[AAAI 2024] SciEval: A Multi-Level Large Language Model Evaluation Benchmark for Scientific Research
☆30Aug 6, 2024Updated last year
Alternatives and similar repositories for SciEval
Users that are interested in SciEval are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆131Jul 8, 2024Updated last year
- ☆14Apr 16, 2024Updated 2 years ago
- ☆10Dec 20, 2023Updated 2 years ago
- ☆15Dec 4, 2023Updated 2 years ago
- [NeurIPS 24] Can LLMs Solve Molecule Puzzles? A Multimodal Benchmark for Molecular Structure Elucidation☆18Jan 2, 2026Updated 3 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- SciGLM: Training Scientific Language Models with Self-Reflective Instruction Annotation and Tuning (NeurIPS D&B Track 2024)☆86Feb 25, 2024Updated 2 years ago
- [ICML 2025] Official repository for paper "OR-Bench: An Over-Refusal Benchmark for Large Language Models"☆26Mar 4, 2025Updated last year
- ☆10Apr 20, 2022Updated 3 years ago
- ☆16Jan 5, 2021Updated 5 years ago
- A quantitative benchmark and analysis of molecular large language models.☆18Jun 3, 2025Updated 10 months ago
- PRESTO: Progressive Pretraining Enhances Synthetic Chemistry Outcomes [EMNLP 2024]☆28Nov 18, 2024Updated last year
- ☆11Jan 6, 2024Updated 2 years ago
- ☆23Feb 3, 2026Updated 2 months ago
- ☆11Dec 22, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A clean no-jargon mathematical definition of transforrmer language model with a Python implementation that focuses on clarity rather than…☆11Jul 23, 2022Updated 3 years ago
- Tailoring Molecules for Protein Pockets: a Transformer-based Generative Solution for Structured-based Drug Design☆20Jul 26, 2023Updated 2 years ago
- Crawl traffic data from PEMS☆10Jul 19, 2021Updated 4 years ago
- The code of MultiSPANS.☆12Oct 20, 2023Updated 2 years ago
- 面向多平台编译优化的深度学习中间表示☆10Oct 28, 2024Updated last year
- Official Code for What can Large Language Models do in chemistry? A comprehensive benchmark on eight tasks (In NeurIPS 2023)☆171Jul 26, 2024Updated last year
- source codes of TASSGN☆12Mar 11, 2024Updated 2 years ago
- Code and data for paper named: Large language models for automatic equation discovery of nonlinear dynamics☆13Mar 6, 2025Updated last year
- Repository for ''Contextualizing MLP-Mixers Spatiotemporally for Urban Data Forecast at Scale''☆14Apr 30, 2024Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- ☆11Apr 10, 2025Updated last year
- MMER☆17Jan 8, 2026Updated 3 months ago
- ☆32May 10, 2025Updated 11 months ago
- Official code for "From Seeing to Doing: Bridging Reasoning and Decision for Robotic Manipulation" (ICLR2026)☆32Mar 1, 2026Updated last month
- ☆11Oct 26, 2023Updated 2 years ago
- ☆16Feb 17, 2025Updated last year
- 针对最经典的表格型Q learning算法进行了复现,能够支持gym中大多数的离散动作和状态空间的环境,譬如CliffWalking-v0。☆10Jan 2, 2021Updated 5 years ago
- Official code for the paper "Joint Bayesian Inference of Graphical Structure and Parameters with a Single Generative Flow Network"☆16Aug 9, 2023Updated 2 years ago
- ☆15Aug 5, 2025Updated 8 months ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- a survey on deep research☆48Sep 9, 2025Updated 7 months ago
- ☆11Oct 5, 2024Updated last year
- ICLR 2025 paper: 3DMolFormer: A Dual-channel Framework for Structure-based Drug Discovery☆28Apr 25, 2025Updated 11 months ago
- Memory footprint reduction for transformer models☆11Jan 24, 2023Updated 3 years ago
- ☆22Feb 21, 2026Updated last month
- Official Implementation of "GRIFFIN: Effective Token Alignment for Faster Speculative Decoding"[NeurIPS 2025]☆18May 12, 2025Updated 11 months ago
- [IJCAI'24] Official code for our paper "Make Graph Neural Networks Great Again: A Generic Integration Paradigm of Topology-Free Patterns …☆15Jul 3, 2025Updated 9 months ago