SciMT / SciMT-benchmark
☆11Updated 8 months ago
Related projects: ⓘ
- SciKnowEval: Evaluating Multi-level Scientific Knowledge of Large Language Models☆13Updated last month
- PRESTO: Progressive Pretraining Enhances Synthetic Chemistry Outcomes☆16Updated 3 weeks ago
- Structured Chemistry Reasoning with Large Language Models☆29Updated 4 months ago
- ☆37Updated 5 months ago
- Listing some diffusion papers in NLP domain I have read, text generation is main, table will continue to be updated.☆23Updated last month
- 📈 Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies https://arxiv.org/abs/2407.13623☆52Updated 3 weeks ago
- Repository for Text2Mol: Cross-Modal Molecular Retrieval with Natural Language Queries☆34Updated 7 months ago
- Two Stones Hit One Bird: Bilevel Positional Encoding for Better Length Extrapolation, ICML 2024☆20Updated 2 months ago
- [NeurIPS 2023] "Rethinking Tokenizer and Decoder in Masked Graph Modeling for Molecules"☆27Updated 6 months ago
- Pre-trained Language Model for Scientific Text☆40Updated 6 months ago
- [AAAI 2024] MELO: Enhancing Model Editing with Neuron-indexed Dynamic LoRA☆21Updated 5 months ago
- ☆24Updated 3 months ago
- Structured Denoising Diffusion Models in Discrete State-Spaces☆13Updated last year
- ☆20Updated 4 months ago
- Official PyTorch implementation for "Your Absorbing Discrete Diffusion Secretly Models the Conditional Distributions of Clean Data"☆11Updated last week
- The code for GIMLET: A Unified Graph-Text Model for Instruction-Based Molecule Zero-Shot Learning☆49Updated 6 months ago
- PyTorch codes for the paper "An Empirical Study of Multimodal Model Merging"☆36Updated 11 months ago
- Code for EMNLP2023 paper "MolCA: Molecular Graph-Language Modeling with Cross-Modal Projector and Uni-Modal Adapter".☆11Updated 8 months ago
- [EMNLP 2023] ReLM: Leveraging Language Models for Enhanced Chemical Reaction Prediction.☆16Updated 7 months ago
- [ICML 2023] "Data Efficient Neural Scaling Law via Model Reusing" by Peihao Wang, Rameswar Panda, Zhangyang Wang☆13Updated 8 months ago
- BioT5 (EMNLP 2023) and BioT5+ (ACL 2024 Findings)☆89Updated this week
- exploring whether LLMs perform case-based or rule-based reasoning☆20Updated 6 months ago
- Official PyTorch implementation for "Unifying Bayesian Flow Networks and Diffusion Models through Stochastic Differential Equations"☆27Updated 4 months ago
- "Found in the Middle: How Language Models Use Long Contexts Better via Plug-and-Play Positional Encoding" Zhenyu Zhang, Runjin Chen, Shiw…☆19Updated 4 months ago
- Code for "Towards Revealing the Mystery behind Chain of Thought: a Theoretical Perspective"☆14Updated last year
- Retrieved Sequence Augmentation for Protein Representation Learning☆41Updated 10 months ago
- ☆20Updated last year
- Code for paper "Diffusion Language Models Can Perform Many Tasks with Scaling and Instruction-Finetuning"☆59Updated 7 months ago
- Code and data for the benchmark "Multimodal Needle in a Haystack (MMNeedle): Benchmarking Long-Context Capability of Multimodal Large Lan…☆30Updated 2 months ago
- ☆31Updated 11 months ago