SteshinSS / lohi_splitter
Lo-Hi Splitter for Modern Splits of Molecular Datasets
☆53Updated 5 months ago
Alternatives and similar repositories for lohi_splitter:
Users that are interested in lohi_splitter are comparing it to the libraries listed below
- MULAN: Multimodal Protein Language Model for Sequence and Structure Encoding☆17Updated 5 months ago
- Lo-Hi: Practical ML Drug Discovery Benchmark paper☆11Updated last year
- Spatial Epitope Modeling with Artificial intelligence (SEMA)☆36Updated last year
- nablaDFT: Large-Scale Conformational Energy and Hamiltonian Prediction benchmark and dataset☆201Updated 2 weeks ago
- ☆42Updated 9 months ago
- A foundational package for molecular predictive modelling☆93Updated 5 months ago
- ☆29Updated last year
- DataSAIL is a tool to split datasets while reducing information leakage.☆21Updated last week
- Recursion's molecular foundation model☆41Updated 5 months ago
- Machine Learning dataset splitting for life sciences.☆27Updated 9 months ago
- SELFormer: Molecular Representation Learning via SELFIES Language Models☆89Updated 4 months ago
- pyPept: a python library to generate atomistic 2D and 3D representations of peptides☆68Updated 7 months ago
- Materials for my presentation on molecular standardization as part of the RSC OpenScience workshop series☆47Updated 3 years ago
- Foster the development of impactful AI models in drug discovery.☆119Updated this week
- MaSIF-neosurf: surface-based protein design for ternary complexes.☆108Updated last week
- An unofficial re-implementation of AntiBERTy, an antibody-specific protein language model, in PyTorch.☆24Updated last year
- Awesome list of the data and AI/ML related projects with direct Life Science Companies participation☆34Updated 6 months ago
- ☆84Updated last year
- Python for chemoinformatics☆110Updated 4 years ago
- Learning to design protein-protein interactions with enhanced generalization (ICLR 2024)☆45Updated 2 months ago
- 🔥 PyTorch implementation of GNINA scoring function for molecular docking☆61Updated 3 weeks ago
- pre-training BERT with molecular data☆42Updated 3 years ago
- Dataset and package for working with protein-protein interactions in 3D☆90Updated last month
- ☆84Updated 5 months ago
- PINDER: The Protein INteraction Dataset and Evaluation Resource☆110Updated 4 months ago
- Multi-domain Distribution Learning for De Novo Drug Design☆58Updated 2 weeks ago
- Practical Cheminformatics Blog Posts☆59Updated last week
- Official implementation of "Equivariant Shape-Conditioned Generation of 3D Molecules for Ligand-Based Drug Design"☆62Updated 3 months ago
- Community-Maintained Version of mordred☆67Updated this week
- ☆48Updated last year