IDEA-XL / RAPMLinks
Code for paper "Rethinking Text-based Protein Understanding: Retrieval or LLM?"
☆18Updated 3 months ago
Alternatives and similar repositories for RAPM
Users that are interested in RAPM are comparing it to the libraries listed below
Sorting:
- LLM Reasoning Benchmark & Chain-of-Thoughts Dataset for Chemistry☆44Updated 3 months ago
- 【Nature Computational Science 2025🔥】Deep peak property learning for efficient chiral molecules ECD spectra prediction☆50Updated last year
- ☆37Updated 7 months ago
- 【COLING 2025🔥】Code for the paper "Is Parameter Collision Hindering Continual Learning in LLMs?".☆38Updated last year
- [ACMMM 2025 - Dataset Track] ComplexBench-Edit: Benchmarking Complex Instruction-Driven Image Editing via Compositional Dependencies☆22Updated 7 months ago
- ☆37Updated 8 months ago
- WISE: A World Knowledge-Informed Semantic Evaluation for Text-to-Image Generation☆182Updated 2 months ago
- [NeurIPS 2024] Image Understanding Makes for A Good Tokenizer for Image Generation☆22Updated last year
- The official code for "TaxDiff: Taxonomic-Guided Diffusion Model for Protein Sequence Generation"☆74Updated last year
- Code for the paper "AsFT: Anchoring Safety During LLM Fune-Tuning Within Narrow Safety Basin".☆35Updated 6 months ago
- 本项目用于Multimodal领域新手的学习路线,包括该领域的经典论文,项目及课程。旨在希望学习者在一定的时间内达到对这个领域有较为深刻的认知,能够自己进行的独立研究。☆45Updated last year
- ☆14Updated 9 months ago
- GPT as a Monte Carlo Language Tree: A Probabilistic Perspective☆45Updated last year
- iFSQ & LlamaGen-REPA☆71Updated this week
- [AAAI'25 Oral] "RFL: Simplifying Chemical Structure Recognition with Ring-Free Language".☆20Updated 7 months ago
- Papers and codes collection for customized, personalized and editable generative models☆29Updated last year
- The code repository of Adv-GRPO☆67Updated last month
- Does Understanding Inform Generation in Unified Multimodal Models? From Analysis to Path Forward☆59Updated 2 months ago
- TokLIP: Marry Visual Tokens to CLIP for Multimodal Comprehension and Generation☆235Updated 5 months ago
- \infty-Video: A Training-Free Approach to Long Video Understanding via Continuous-Time Memory Consolidation☆19Updated 11 months ago
- Draw ALL Your Imagine: A Holistic Benchmark and Agent Framework for Complex Instruction-based Image Generation☆23Updated 4 months ago
- [CVPRW 2025] UniToken is an auto-regressive generation model that combines discrete and continuous representations to process visual inpu…☆104Updated 9 months ago
- This repository is the official implementation of "Look-Back: Implicit Visual Re-focusing in MLLM Reasoning".☆83Updated 6 months ago
- 📖 This is a repository for organizing papers, codes, and other resources related to unified multimodal models.☆347Updated 3 weeks ago
- [EMNLP 2025 Main] Video Compression Commander: Plug-and-Play Inference Acceleration for Video Large Language Models☆57Updated this week
- [CVPR 2025] VASparse: Towards Efficient Visual Hallucination Mitigation via Visual-Aware Token Sparsification☆49Updated 10 months ago
- Code for the paper DisCo-Diff: Enhancing Continuous Diffusion Models with Discrete Latents, ICML 2024☆92Updated last year
- a collection of awesome autoregressive visual generation models☆79Updated 9 months ago
- ☆41Updated 10 months ago
- [CVPR 2025] Official implementation of ByTheWay: Boost Your Text-to-Video Generation Model to Higher Quality in a Training-free Way☆46Updated 3 months ago