kjappelbaum / awesome-chemistry-datasetsView external linksLinks
overview of datasets for ML in chemistry
☆384Oct 22, 2025Updated 3 months ago
Alternatives and similar repositories for awesome-chemistry-datasets
Users that are interested in awesome-chemistry-datasets are comparing it to the libraries listed below
Sorting:
- ChemNLP project☆171Feb 9, 2026Updated last week
- ☆258May 17, 2024Updated last year
- How good are LLMs at chemistry?☆132Jan 26, 2026Updated 3 weeks ago
- EPFL CH-457 "AI for chemistry"☆241Apr 30, 2025Updated 9 months ago
- molfeat - the hub for all your molecular featurizers☆221May 27, 2025Updated 8 months ago
- Official Code for What can Large Language Models do in chemistry? A comprehensive benchmark on eight tasks (In NeurIPS 2023)☆169Jul 26, 2024Updated last year
- Curated list of known efforts in collecting and/or curating of chemical/materials data☆23Dec 8, 2020Updated 5 years ago
- A curated list of Cheminformatics libraries and software.☆833Mar 15, 2024Updated last year
- A curated list of resources for machine learning for small-molecule drug discovery☆234Nov 25, 2023Updated 2 years ago
- DeepMol: A Machine and Deep Learning Framework for Computational Chemistry☆169Oct 10, 2025Updated 4 months ago
- A curated list of Python packages related to chemistry☆1,350Sep 21, 2025Updated 4 months ago
- Repository of Jupyter Notebooks on Colab, Binder and Huggingface for Bio, Chemistry and Physics☆13Jul 29, 2023Updated 2 years ago
- Descriptor computation(chemistry) and (optional) storage for machine learning☆275Oct 26, 2024Updated last year
- a curated list of resources for everyone interested in learning about digital chemistry☆38Jan 25, 2026Updated 3 weeks ago
- Molecular Processing Made Easy.☆528Jun 10, 2024Updated last year
- Message Passing Neural Networks for Molecule Property Prediction☆2,262Feb 3, 2026Updated 2 weeks ago
- A Sequence Generation Model for Reaction Diagram Parsing☆102Sep 18, 2023Updated 2 years ago
- A quick tutorial for modern materials science, should the reader be not familiar with it and just wishing to crack the data☆16Aug 24, 2025Updated 5 months ago
- Extracting medicinal chemistry intuition via preference machine learning☆115Oct 31, 2023Updated 2 years ago
- Synthesis generative model☆48Apr 24, 2025Updated 9 months ago
- Official data repository for the Open Reaction Database☆320Jul 30, 2025Updated 6 months ago
- Extract structure-functions from data using XAI and LLMs☆27Jan 20, 2025Updated last year
- pythonic interface to virtual screening software☆92Sep 4, 2025Updated 5 months ago
- RXNMapper: Unsupervised attention-guided atom-mapping. Code complementing our Science Advances publication on "Extraction of organic chem…☆354Updated this week
- [IJCAI 2023 survey track]A curated list of resources for chemical pre-trained models☆536Jun 17, 2023Updated 2 years ago
- ☆19Aug 4, 2024Updated last year
- ☆25Jan 22, 2025Updated last year
- A molecular identifier and descriptor for all domains of chemistry.☆25Dec 23, 2025Updated last month
- open data sets for machine learning pertaining to porous materials☆27Nov 28, 2023Updated 2 years ago
- add-on to plotly which show molecule images on mouseover!☆259Apr 10, 2024Updated last year
- Example implementations of common machine learning projects in chemistry.☆186Sep 1, 2024Updated last year
- Practical Cheminformatics Tutorials☆1,180Updated this week
- ☆11Jan 5, 2022Updated 4 years ago
- Extraction of action sequences from experimental procedures☆43Oct 13, 2023Updated 2 years ago
- Simple Python interface to OPSIN: Open Parser for Systematic IUPAC nomenclature☆71Jan 22, 2026Updated 3 weeks ago
- A python package for chemical space visualization.☆150Dec 17, 2024Updated last year
- [ICLR 2024] Mol-Instructions: A Large-Scale Biomolecular Instruction Dataset for Large Language Models☆292Oct 28, 2024Updated last year
- Package for Retrosynthetic Planning☆186Jan 30, 2026Updated 2 weeks ago
- Python wrapper for the PubChem PUG REST API.☆491Sep 8, 2025Updated 5 months ago