IDEA-XL / PRESTO
PRESTO: Progressive Pretraining Enhances Synthetic Chemistry Outcomes
☆19Updated this week
Related projects ⓘ
Alternatives and complementary repositories for PRESTO
- PyTorch codes for the paper "An Empirical Study of Multimodal Model Merging"☆36Updated last year
- Two Stones Hit One Bird: Bilevel Positional Encoding for Better Length Extrapolation, ICML 2024☆21Updated 4 months ago
- Pre-trained Language Model for Scientific Text☆42Updated 9 months ago
- Codes for Merging Large Language Models☆25Updated 3 months ago
- InstructMol: Multi-Modal Integration for Building a Versatile and Reliable Molecular Assistant in Drug Discovery☆41Updated 3 weeks ago
- [ACL 2024] Logical Closed Loop: Uncovering Object Hallucinations in Large Vision-Language Models. Detect and mitigate object hallucinatio…☆16Updated 5 months ago
- Code for paper "Diffusion Language Models Can Perform Many Tasks with Scaling and Instruction-Finetuning"☆63Updated 9 months ago
- Official Implementation for EMNLP 2024 (main) "AgentReview: Exploring Academic Peer Review with LLM Agent."☆36Updated last week
- ☆33Updated last year
- [AAAI 2024] MELO: Enhancing Model Editing with Neuron-indexed Dynamic LoRA☆21Updated 7 months ago
- ☆15Updated 5 months ago
- The code for GIMLET: A Unified Graph-Text Model for Instruction-Based Molecule Zero-Shot Learning☆52Updated 9 months ago
- Auto get diffusion nlp papers in Axriv. More papers Information can be found in another repository "Diffusion_NLP_Papers".☆65Updated this week
- Text Diffusion Model with Encoder-Decoder Transformers for Sequence-to-Sequence Generation [NAACL 2024]☆92Updated last year
- ☆41Updated 7 months ago
- [ICLR 2023] "Sparse MoE as the New Dropout: Scaling Dense and Self-Slimmable Transformers" by Tianlong Chen*, Zhenyu Zhang*, Ajay Jaiswal…☆44Updated last year
- ☆30Updated 2 weeks ago
- The official code for paper "EasyGen: Easing Multimodal Generation with a Bidirectional Conditional Diffusion Model and LLMs"☆73Updated 9 months ago
- Repository for Text2Mol: Cross-Modal Molecular Retrieval with Natural Language Queries☆37Updated 9 months ago
- Structured Chemistry Reasoning with Large Language Models☆31Updated 6 months ago
- [ATTRIB @ NeurIPS 2024] When Attention Sink Emerges in Language Models: An Empirical View☆29Updated last month
- The official codebase for "Empowering Diffusion Models on the Embedding Space for Text Generation" (NAACL 2024)☆52Updated 6 months ago
- ☆19Updated 2 years ago
- Official code for "pi-Tuning: Transferring Multimodal Foundation Models with Optimal Multi-task Interpolation", ICML 2023.☆32Updated last year
- [EVA ICLR'23; LARA ICML'22] Efficient attention mechanisms via control variates, random features, and importance sampling☆79Updated last year
- [NeurIPS-2024] 📈 Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies https://arxiv.org/abs/2407.13623☆69Updated last month
- Reparameterized Discrete Diffusion Models for Text Generation☆90Updated last year
- ☆22Updated 7 months ago
- The source code of "Merging Experts into One: Improving Computational Efficiency of Mixture of Experts (EMNLP 2023)":☆35Updated 7 months ago
- Source code for the paper "Prefix Language Models are Unified Modal Learners"☆43Updated last year