TencentAI4S / CD-GPT
A Biological Foundation Model Bridging the Gap between Molecular Sequences Through Central Dogma
☆13Updated 2 weeks ago
Related projects: ⓘ
- [COLM'24] We propose Protein Chain of Thought (ProCoT), which replicates the biological mechanism of signaling pathways as language promp…☆46Updated 3 weeks ago
- Code for "LangCell: Language-Cell Pre-training for Cell Identity Understanding".☆36Updated 3 months ago
- ☆129Updated last month
- ☆197Updated last month
- Awesome-Biomolecule-Language-Cross-Modeling: a curated list of resources for paper "Leveraging Biomolecule and Natural Language through M…☆140Updated 2 weeks ago
- ☆13Updated 3 months ago
- ☆30Updated last year
- ☆230Updated last month
- [ICML-23 ORAL] ProtST: Multi-Modality Learning of Protein Sequences and Biomedical Texts☆83Updated 11 months ago
- Official repo for CellPLM: Pre-training of Cell Language Model Beyond Single Cells.☆61Updated 5 months ago
- Scientific Large Language Models: A Survey on Biological & Chemical Domains☆211Updated last month
- ☆18Updated this week
- ☆25Updated 7 months ago
- ☆267Updated 9 months ago
- [ICLR'24 spotlight] Saprot: Protein Language Model with Structural Alphabet☆318Updated this week
- [ICLR 2024] Mol-Instructions: A Large-Scale Biomolecular Instruction Dataset for Large Language Models☆231Updated 4 months ago
- Contextualizing protein representations using deep learning on protein networks and single-cell data☆67Updated last week
- [ICLR 2022] OntoProtein: Protein Pretraining With Gene Ontology Embedding☆141Updated last year
- Generative Pretraining from Transcriptomes☆11Updated last year
- ☆25Updated 9 months ago
- A collection of awesome bio-foundation models, including protein, RNA, DNA, gene, single-cell, and so on.☆108Updated this week
- GEARS is a geometric deep learning model that predicts outcomes of novel multi-gene perturbations☆192Updated last month
- RNA foundation model☆189Updated 5 months ago
- [NeurIPS 2023] Official codes of "MuSe-GNN: Learning Unified Gene Representation From Multimodal Biological Graph Data"☆22Updated 3 months ago
- Cell2Sentence turns scRNA-seq data into text for LLM training.☆82Updated 2 weeks ago
- ☆44Updated 7 months ago
- GPT lanuage model for dna sequence☆15Updated last year
- Codes for paper: Evaluating the Utilities of Large Language Models in Single-cell Data Analysis.☆45Updated 3 weeks ago
- ☆76Updated this week
- ☆166Updated 6 months ago