overview of datasets for ML in chemistry
☆399Oct 22, 2025Updated 5 months ago
Alternatives and similar repositories for awesome-chemistry-datasets
Users that are interested in awesome-chemistry-datasets are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ChemNLP project☆175Updated this week
- ☆258May 17, 2024Updated last year
- How good are LLMs at chemistry?☆135Jan 26, 2026Updated 2 months ago
- EPFL CH-457 "AI for chemistry"☆259Apr 3, 2026Updated 2 weeks ago
- molfeat - the hub for all your molecular featurizers☆224May 27, 2025Updated 10 months ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Official Code for What can Large Language Models do in chemistry? A comprehensive benchmark on eight tasks (In NeurIPS 2023)☆171Jul 26, 2024Updated last year
- Curated list of known efforts in collecting and/or curating of chemical/materials data☆23Dec 8, 2020Updated 5 years ago
- a curated list of resources for everyone interested in learning about digital chemistry☆41Jan 25, 2026Updated 2 months ago
- Repository of Jupyter Notebooks on Colab, Binder and Huggingface for Bio, Chemistry and Physics☆13Jul 29, 2023Updated 2 years ago
- DeepMol: A Machine and Deep Learning Framework for Computational Chemistry☆174Mar 26, 2026Updated 3 weeks ago
- A curated list of Python packages related to chemistry☆1,380Sep 21, 2025Updated 6 months ago
- Descriptor computation(chemistry) and (optional) storage for machine learning☆278Oct 26, 2024Updated last year
- A curated list of resources for machine learning for small-molecule drug discovery☆240Nov 25, 2023Updated 2 years ago
- ☆24Nov 24, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A curated list of Cheminformatics libraries and software.☆854Mar 15, 2024Updated 2 years ago
- Molecular Processing Made Easy.☆533Jun 10, 2024Updated last year
- pythonic interface to virtual screening software☆92Sep 4, 2025Updated 7 months ago
- Molecular Set Representation Learning☆51Jul 16, 2025Updated 9 months ago
- Extract structure-functions from data using XAI and LLMs☆27Jan 20, 2025Updated last year
- ☆11Jan 5, 2022Updated 4 years ago
- A quick tutorial for modern materials science, should the reader be not familiar with it and just wishing to crack the data☆16Aug 24, 2025Updated 7 months ago
- Message Passing Neural Networks for Molecule Property Prediction☆2,331Apr 7, 2026Updated last week
- Synthesis generative model☆47Apr 24, 2025Updated 11 months ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Official data repository for the Open Reaction Database☆329Apr 10, 2026Updated last week
- RXNMapper: Unsupervised attention-guided atom-mapping. Code complementing our Science Advances publication on "Extraction of organic chem…☆365Feb 13, 2026Updated 2 months ago
- Extracting medicinal chemistry intuition via preference machine learning☆118Oct 31, 2023Updated 2 years ago
- Package for Retrosynthetic Planning☆193Apr 10, 2026Updated last week
- Practical Cheminformatics Tutorials☆1,221Mar 31, 2026Updated 2 weeks ago
- A molecular identifier and descriptor for all domains of chemistry.☆26Dec 23, 2025Updated 3 months ago
- ☆25Jan 22, 2025Updated last year
- Transformer-based model for chemical reactions☆94Jan 14, 2026Updated 3 months ago
- [IJCAI 2023 survey track]A curated list of resources for chemical pre-trained models☆537Jun 17, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Utilities for working with datasets of chemical reactions, reaction templates and template extraction.☆92Feb 9, 2026Updated 2 months ago
- [ICLR 2024] Mol-Instructions: A Large-Scale Biomolecular Instruction Dataset for Large Language Models☆293Oct 28, 2024Updated last year
- Robust representation of semantically constrained graphs, in particular for molecules in chemistry☆838May 17, 2025Updated 11 months ago
- Curated list of known efforts in materials informatics, i.e. in modern materials science☆503Mar 24, 2026Updated 3 weeks ago
- Example implementations of common machine learning projects in chemistry.☆184Feb 17, 2026Updated 2 months ago
- ☆19Aug 4, 2024Updated last year
- open data sets for machine learning pertaining to porous materials☆28Nov 28, 2023Updated 2 years ago