☆19Jun 4, 2025Updated 9 months ago
Alternatives and similar repositories for multimodal_alignment
Users that are interested in multimodal_alignment are comparing it to the libraries listed below
Sorting:
- The open-source materials for paper "Sparsing Law: Towards Large Language Models with Greater Activation Sparsity".☆30Nov 12, 2024Updated last year
- ☆72Jul 30, 2025Updated 7 months ago
- Official Implementation of wd1☆24Sep 25, 2025Updated 5 months ago
- Fork of Flame repo for training of some new stuff in development☆19Updated this week
- Official repository for Robust Multimodal Large Language Models Against Modality Conflict☆20Jul 9, 2025Updated 8 months ago
- Symmetrical Visual Contrastive Optimization: Aligning Vision-Language Models with Minimal Contrastive Images☆18Jun 4, 2025Updated 9 months ago
- ☆31Aug 21, 2023Updated 2 years ago
- ☆15Mar 20, 2025Updated last year
- [CHIL 2024] Interpretation of Intracardiac Electrograms Through Textual Representations☆12Sep 4, 2024Updated last year
- ☆21Jul 21, 2025Updated 8 months ago
- official code for unigame☆19Nov 26, 2025Updated 3 months ago
- [ICML 2025] Code for "R2-T2: Re-Routing in Test-Time for Multimodal Mixture-of-Experts"☆19Mar 10, 2025Updated last year
- [Preprint] Efficient Generative Model Training via Embedded Representation Warmup☆36Oct 15, 2025Updated 5 months ago
- Submission Under Review☆17May 15, 2025Updated 10 months ago
- G1: Bootstrapping Perception and Reasoning Abilities of Vision-Language Model via Reinforcement Learning☆101May 20, 2025Updated 10 months ago
- Large Language Models Powered Context-aware Motion Prediction☆14Jan 12, 2026Updated 2 months ago
- ☆19Jun 26, 2025Updated 8 months ago
- Both Text and Images Leaked! A Systematic Analysis of Data Contamination in Multimodal LLM | EMNLP 2025 Findings☆19Oct 17, 2025Updated 5 months ago
- [NeurIPS 2025 Oral] Representation Entanglement for Generation: Training Diffusion Transformers Is Much Easier Than You Think☆254Oct 4, 2025Updated 5 months ago
- ☆22May 3, 2025Updated 10 months ago
- ☆62Jan 20, 2026Updated 2 months ago
- Compiler-R1: Towards Agentic Compiler Auto-tuning with Reinforcement Learning☆28Jul 14, 2025Updated 8 months ago
- Evaluating Durability: Benchmark Insights into Multimodal Watermarking☆12Jun 7, 2024Updated last year
- ☆23Sep 19, 2024Updated last year
- ☆17Aug 1, 2025Updated 7 months ago
- ☆16Updated this week
- ☆12Jun 22, 2020Updated 5 years ago
- [ICLR 2023] MultiViz: Towards Visualizing and Understanding Multimodal Models☆99Aug 22, 2024Updated last year
- ☆12Jun 10, 2024Updated last year
- Resa: Transparent Reasoning Models via SAEs☆48Sep 23, 2025Updated 6 months ago
- The Official PyTorch Implementation of "Brain-like Variational Inference" (NeurIPS 2025 Paper)☆71Feb 9, 2026Updated last month
- Code, models and dataset for ICCV 2023 (oral) paper on differentiable volumetric rasterisation of point clouds for 3D registration☆18May 20, 2024Updated last year
- Computation of binomial confidence intervals that achieve exact coverage.☆14Apr 23, 2025Updated 10 months ago
- [IJCAI 2023 workshop]Expanding dataset for 2D medical image segmentation using diffusion models☆15Feb 28, 2023Updated 3 years ago
- FROM $f(x)$ AND $g(x)$ TO $f(g(x))$: LLMs Learn New Skills in RL by Composing Old Ones☆64Jan 26, 2026Updated last month
- [ NeurIPS 2023 ] Official Codebase for "Aligning Synthetic Medical Images with Clinical Knowledge using Human Feedback"☆20Oct 19, 2023Updated 2 years ago
- An open-ended, self-improving AI system that evolves its own source code using a local LLM. Built for autonomy, reflection, and code evol…☆22Jan 24, 2026Updated last month
- ☆12Nov 21, 2023Updated 2 years ago
- ☆28Sep 22, 2025Updated 6 months ago