overview of datasets for ML in chemistry
☆402Oct 22, 2025Updated 6 months ago
Alternatives and similar repositories for awesome-chemistry-datasets
Users that are interested in awesome-chemistry-datasets are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ChemNLP project☆175Updated this week
- ☆256May 17, 2024Updated last year
- How good are LLMs at chemistry?☆137Jan 26, 2026Updated 3 months ago
- EPFL CH-457 "AI for chemistry"☆262Apr 3, 2026Updated last month
- molfeat - the hub for all your molecular featurizers☆226May 27, 2025Updated 11 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Curated list of known efforts in collecting and/or curating of chemical/materials data☆23Dec 8, 2020Updated 5 years ago
- Official Code for What can Large Language Models do in chemistry? A comprehensive benchmark on eight tasks (In NeurIPS 2023)☆173Jul 26, 2024Updated last year
- a curated list of resources for everyone interested in learning about digital chemistry☆42Jan 25, 2026Updated 3 months ago
- Repository of Jupyter Notebooks on Colab, Binder and Huggingface for Bio, Chemistry and Physics☆13Jul 29, 2023Updated 2 years ago
- DeepMol: A Machine and Deep Learning Framework for Computational Chemistry☆175Mar 26, 2026Updated last month
- A curated list of Python packages related to chemistry☆1,391Sep 21, 2025Updated 7 months ago
- Descriptor computation(chemistry) and (optional) storage for machine learning☆278Oct 26, 2024Updated last year
- A curated list of resources for machine learning for small-molecule drug discovery☆240Nov 25, 2023Updated 2 years ago
- ☆24Nov 24, 2024Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- A curated list of Cheminformatics libraries and software.☆863Mar 15, 2024Updated 2 years ago
- Molecular Processing Made Easy.☆534Jun 10, 2024Updated last year
- pythonic interface to virtual screening software☆92Sep 4, 2025Updated 8 months ago
- Molecular Set Representation Learning☆51Jul 16, 2025Updated 9 months ago
- Extract structure-functions from data using XAI and LLMs☆27Jan 20, 2025Updated last year
- ☆11Jan 5, 2022Updated 4 years ago
- A quick tutorial for modern materials science, should the reader be not familiar with it and just wishing to crack the data☆16Aug 24, 2025Updated 8 months ago
- Synthesis generative model☆47Apr 24, 2025Updated last year
- Message Passing Neural Networks for Molecule Property Prediction☆2,354Apr 24, 2026Updated 2 weeks ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- RXNMapper: Unsupervised attention-guided atom-mapping. Code complementing our Science Advances publication on "Extraction of organic chem…☆366Feb 13, 2026Updated 2 months ago
- Official data repository for the Open Reaction Database☆331Updated this week
- Extracting medicinal chemistry intuition via preference machine learning☆119Oct 31, 2023Updated 2 years ago
- Package for Retrosynthetic Planning☆196Apr 30, 2026Updated last week
- A Sequence Generation Model for Reaction Diagram Parsing☆111Sep 18, 2023Updated 2 years ago
- Practical Cheminformatics Tutorials☆1,240May 2, 2026Updated last week
- A molecular identifier and descriptor for all domains of chemistry.☆26Dec 23, 2025Updated 4 months ago
- ☆25Jan 22, 2025Updated last year
- Transformer-based model for chemical reactions☆94Jan 14, 2026Updated 3 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [IJCAI 2023 survey track]A curated list of resources for chemical pre-trained models☆538Jun 17, 2023Updated 2 years ago
- Utilities for working with datasets of chemical reactions, reaction templates and template extraction.☆93Apr 13, 2026Updated 3 weeks ago
- [ICLR 2024] Mol-Instructions: A Large-Scale Biomolecular Instruction Dataset for Large Language Models☆292Oct 28, 2024Updated last year
- Robust representation of semantically constrained graphs, in particular for molecules in chemistry☆842May 17, 2025Updated 11 months ago
- Curated list of known efforts in materials informatics, i.e. in modern materials science☆507Mar 24, 2026Updated last month
- Example implementations of common machine learning projects in chemistry.☆184Feb 17, 2026Updated 2 months ago
- ☆19Aug 4, 2024Updated last year