IDEA-XL / PRESTO
PRESTO: Progressive Pretraining Enhances Synthetic Chemistry Outcomes
☆16Updated 3 weeks ago
Related projects: ⓘ
- Two Stones Hit One Bird: Bilevel Positional Encoding for Better Length Extrapolation, ICML 2024☆20Updated 2 months ago
- [EVA ICLR'23; LARA ICML'22] Efficient attention mechanisms via control variates, random features, and importance sampling☆78Updated last year
- PyTorch codes for the paper "An Empirical Study of Multimodal Model Merging"☆36Updated 11 months ago
- Codes for Merging Large Language Models☆16Updated last month
- Official code for "pi-Tuning: Transferring Multimodal Foundation Models with Optimal Multi-task Interpolation", ICML 2023.☆32Updated last year
- [AAAI 2024] MELO: Enhancing Model Editing with Neuron-indexed Dynamic LoRA☆21Updated 5 months ago
- Implementation of Particle Guidance: non-I.I.D. Diverse Sampling with Diffusion Models☆64Updated 10 months ago
- [ACL 2024] Logical Closed Loop: Uncovering Object Hallucinations in Large Vision-Language Models. Detect and mitigate object hallucinatio…☆16Updated 3 months ago
- ☆37Updated 5 months ago
- PyTorch implementation of "From Sparse to Soft Mixtures of Experts"☆38Updated last year
- ☆11Updated 8 months ago
- Official PyTorch implementation for "Unifying Bayesian Flow Networks and Diffusion Models through Stochastic Differential Equations"☆27Updated 4 months ago
- ☆15Updated 3 months ago
- ☆11Updated 2 months ago
- Listing some diffusion papers in NLP domain I have read, text generation is main, table will continue to be updated.☆23Updated last month
- [ACL 2023] Delving into the Openness of CLIP☆22Updated last year
- Source code for the paper "Prefix Language Models are Unified Modal Learners"☆42Updated last year
- [ICML 2024] Interaction-based Retrieval-augmented Diffusion Models for Protein-specific 3D Molecule Generation☆11Updated 2 weeks ago
- Experiments for "A Closer Look at In-Context Learning under Distribution Shifts"☆20Updated last year
- ☆16Updated last week
- Code and data for the benchmark "Multimodal Needle in a Haystack (MMNeedle): Benchmarking Long-Context Capability of Multimodal Large Lan…☆30Updated 2 months ago
- PyTorch implementation of the paper "Discovering and Explaining the Representation Bottleneck of DNNs" (ICLR 2022)☆37Updated 2 years ago
- The official implementation of the ECCV'24 paper MC-CoT: Boosting the Power of Small Multimodal Reasoning Models to Match Larger Models w…☆11Updated 4 months ago
- ☆14Updated 2 months ago
- Unofficial PyTorch implementation of "Step-unrolled Denoising Autoencoders for Text Generation"☆22Updated last year
- Converting Mixtral-8x7B to Mixtral-[1~7]x7B☆20Updated 6 months ago
- A simple DIffusion LM approach.☆20Updated last year
- Enhancing Large Vision Language Models with Self-Training on Image Comprehension.☆51Updated 3 months ago
- NeurIPS'22 Oral: EquiVSet - Learning Neural Set Functions Under the Optimal Subset Oracle☆18Updated last year
- [NeurIPS 2023] "Rethinking Tokenizer and Decoder in Masked Graph Modeling for Molecules"☆27Updated 6 months ago