overview of datasets for ML in chemistry
☆395Oct 22, 2025Updated 5 months ago
Alternatives and similar repositories for awesome-chemistry-datasets
Users that are interested in awesome-chemistry-datasets are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ChemNLP project☆172Updated this week
- ☆258May 17, 2024Updated last year
- How good are LLMs at chemistry?☆135Jan 26, 2026Updated 2 months ago
- EPFL CH-457 "AI for chemistry"☆255Mar 16, 2026Updated last week
- molfeat - the hub for all your molecular featurizers☆223May 27, 2025Updated 10 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Official Code for What can Large Language Models do in chemistry? A comprehensive benchmark on eight tasks (In NeurIPS 2023)☆171Jul 26, 2024Updated last year
- Curated list of known efforts in collecting and/or curating of chemical/materials data☆23Dec 8, 2020Updated 5 years ago
- a curated list of resources for everyone interested in learning about digital chemistry☆41Jan 25, 2026Updated 2 months ago
- Repository of Jupyter Notebooks on Colab, Binder and Huggingface for Bio, Chemistry and Physics☆13Jul 29, 2023Updated 2 years ago
- DeepMol: A Machine and Deep Learning Framework for Computational Chemistry☆171Updated this week
- A curated list of Python packages related to chemistry☆1,370Sep 21, 2025Updated 6 months ago
- Descriptor computation(chemistry) and (optional) storage for machine learning☆277Oct 26, 2024Updated last year
- A curated list of resources for machine learning for small-molecule drug discovery☆240Nov 25, 2023Updated 2 years ago
- ☆24Nov 24, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- A curated list of Cheminformatics libraries and software.☆846Mar 15, 2024Updated 2 years ago
- Molecular Processing Made Easy.☆532Jun 10, 2024Updated last year
- pythonic interface to virtual screening software☆92Sep 4, 2025Updated 6 months ago
- Molecular Set Representation Learning☆51Jul 16, 2025Updated 8 months ago
- Extract structure-functions from data using XAI and LLMs☆27Jan 20, 2025Updated last year
- ☆11Jan 5, 2022Updated 4 years ago
- A quick tutorial for modern materials science, should the reader be not familiar with it and just wishing to crack the data☆16Aug 24, 2025Updated 7 months ago
- Message Passing Neural Networks for Molecule Property Prediction☆2,302Mar 17, 2026Updated last week
- Synthesis generative model☆48Apr 24, 2025Updated 11 months ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Official data repository for the Open Reaction Database☆328Updated this week
- RXNMapper: Unsupervised attention-guided atom-mapping. Code complementing our Science Advances publication on "Extraction of organic chem…☆362Feb 13, 2026Updated last month
- Extracting medicinal chemistry intuition via preference machine learning☆117Oct 31, 2023Updated 2 years ago
- Package for Retrosynthetic Planning☆190Updated this week
- Practical Cheminformatics Tutorials☆1,205Mar 22, 2026Updated last week
- A Sequence Generation Model for Reaction Diagram Parsing☆109Sep 18, 2023Updated 2 years ago
- A molecular identifier and descriptor for all domains of chemistry.☆26Dec 23, 2025Updated 3 months ago
- ☆25Jan 22, 2025Updated last year
- Transformer-based model for chemical reactions☆94Jan 14, 2026Updated 2 months ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- [IJCAI 2023 survey track]A curated list of resources for chemical pre-trained models☆537Jun 17, 2023Updated 2 years ago
- Utilities for working with datasets of chemical reactions, reaction templates and template extraction.☆90Feb 9, 2026Updated last month
- [ICLR 2024] Mol-Instructions: A Large-Scale Biomolecular Instruction Dataset for Large Language Models☆294Oct 28, 2024Updated last year
- Robust representation of semantically constrained graphs, in particular for molecules in chemistry☆837May 17, 2025Updated 10 months ago
- Curated list of known efforts in materials informatics, i.e. in modern materials science☆498Updated this week
- Example implementations of common machine learning projects in chemistry.☆185Feb 17, 2026Updated last month
- ☆19Aug 4, 2024Updated last year