overview of datasets for ML in chemistry
☆411Oct 22, 2025Updated 7 months ago
Alternatives and similar repositories for awesome-chemistry-datasets
Users that are interested in awesome-chemistry-datasets are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ChemNLP project☆176Updated this week
- ☆259May 17, 2024Updated 2 years ago
- How good are LLMs at chemistry?☆141Jan 26, 2026Updated 4 months ago
- EPFL CH-457 "AI for chemistry"☆278Apr 3, 2026Updated 2 months ago
- molfeat - the hub for all your molecular featurizers☆230May 27, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Curated list of known efforts in collecting and/or curating of chemical/materials data☆24Dec 8, 2020Updated 5 years ago
- Official Code for What can Large Language Models do in chemistry? A comprehensive benchmark on eight tasks (In NeurIPS 2023)☆173Jul 26, 2024Updated last year
- a curated list of resources for everyone interested in learning about digital chemistry☆48Jan 25, 2026Updated 4 months ago
- Repository of Jupyter Notebooks on Colab, Binder and Huggingface for Bio, Chemistry and Physics☆13Jul 29, 2023Updated 2 years ago
- A curated list of Python packages related to chemistry☆1,401Sep 21, 2025Updated 8 months ago
- DeepMol: A Machine and Deep Learning Framework for Computational Chemistry☆178Mar 26, 2026Updated 2 months ago
- Descriptor computation(chemistry) and (optional) storage for machine learning☆281Oct 26, 2024Updated last year
- A curated list of resources for machine learning for small-molecule drug discovery☆241Nov 25, 2023Updated 2 years ago
- ☆24Nov 24, 2024Updated last year
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- A curated list of Cheminformatics libraries and software.☆869Mar 15, 2024Updated 2 years ago
- Molecular Processing Made Easy.☆538May 20, 2026Updated 3 weeks ago
- pythonic interface to virtual screening software☆92Sep 4, 2025Updated 9 months ago
- Molecular Set Representation Learning☆51Jul 16, 2025Updated 11 months ago
- Extract structure-functions from data using XAI and LLMs☆27Jan 20, 2025Updated last year
- ☆11Jan 5, 2022Updated 4 years ago
- A quick tutorial for modern materials science, should the reader be not familiar with it and just wishing to crack the data☆16Aug 24, 2025Updated 9 months ago
- Synthesis generative model☆47Apr 24, 2025Updated last year
- Message Passing Neural Networks for Molecule Property Prediction☆2,390Updated this week
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- RXNMapper: Unsupervised attention-guided atom-mapping. Code complementing our Science Advances publication on "Extraction of organic chem…☆376Feb 13, 2026Updated 4 months ago
- Official data repository for the Open Reaction Database☆334Jun 12, 2026Updated last week
- Extracting medicinal chemistry intuition via preference machine learning☆121Jun 12, 2026Updated last week
- A Sequence Generation Model for Reaction Diagram Parsing☆114Sep 18, 2023Updated 2 years ago
- Practical Cheminformatics Tutorials☆1,267May 2, 2026Updated last month
- Package for Retrosynthetic Planning☆198Jun 10, 2026Updated last week
- A molecular identifier and descriptor for all domains of chemistry.☆26Dec 23, 2025Updated 5 months ago
- ☆25Jan 22, 2025Updated last year
- Transformer-based model for chemical reactions☆94Jan 14, 2026Updated 5 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [IJCAI 2023 survey track]A curated list of resources for chemical pre-trained models☆539Jun 17, 2023Updated 3 years ago
- Utilities for working with datasets of chemical reactions, reaction templates and template extraction.☆96Apr 13, 2026Updated 2 months ago
- [ICLR 2024] Mol-Instructions: A Large-Scale Biomolecular Instruction Dataset for Large Language Models☆294Oct 28, 2024Updated last year
- Robust representation of semantically constrained graphs, in particular for molecules in chemistry☆850May 17, 2025Updated last year
- Curated list of known efforts in materials informatics, i.e. in modern materials science☆518Mar 24, 2026Updated 2 months ago
- Example implementations of common machine learning projects in chemistry.☆184Feb 17, 2026Updated 4 months ago
- ☆19Aug 4, 2024Updated last year