UniGen: Enhanced Training & Test-Time Strategies for Unified Multimodal Understanding and Generation
☆37Nov 24, 2025Updated 3 months ago
Alternatives and similar repositories for ml-unigen
Users that are interested in ml-unigen are comparing it to the libraries listed below
Sorting:
- ☆21Jan 17, 2025Updated last year
- A framework aiming to bridge fast robot prototyping, predefined motion primitives, heterogeneous teleoperation, data collection, and flex…☆23Mar 2, 2026Updated 2 weeks ago
- Official repository for the paper Local Linear Attention: An Optimal Interpolation of Linear and Softmax Attention For Test-Time Regressi…☆23Oct 1, 2025Updated 5 months ago
- ☆14Jan 4, 2023Updated 3 years ago
- ☆30Jan 15, 2026Updated 2 months ago
- MRSAudio: A Large-Scale Multimodal Recorded Spatial Audio Dataset with Refined Annotations☆34Oct 15, 2025Updated 5 months ago
- [ACL 2024 Findings] Light-PEFT: Lightening Parameter-Efficient Fine-Tuning via Early Pruning☆13Sep 2, 2024Updated last year
- Efficient Scaling laws and collaborative pretraining.☆21Sep 18, 2025Updated 6 months ago
- ☆32Updated this week
- [AAAI 2026] Segment Anything Across Shots: A Method and Benchmark☆27Nov 16, 2025Updated 4 months ago
- ☆10Feb 6, 2025Updated last year
- An official implementation of Random Policy Valuation is Enough for LLM Reasoning with Verifiable Rewards☆37Oct 3, 2025Updated 5 months ago
- ☆16Sep 4, 2025Updated 6 months ago
- Official implementation of "Describing Sets of Images with Textual-PCA".☆16Feb 13, 2023Updated 3 years ago
- REAP expert pruning for MoE LLMs on Apple Silicon via MLX☆45Updated this week
- FNIN: A Fourier Neural Operator-based Numerical Integration Network for Surface-form-gradients☆13Jan 22, 2025Updated last year
- Model for processing text sequences with coreference annotations☆14Nov 29, 2018Updated 7 years ago
- [ACM MM2023] Code Release of GCMA: Generative Cross-Modal Transferable Adversarial Attacks from Images to Videos☆12Mar 29, 2024Updated last year
- ☆14Apr 18, 2020Updated 5 years ago
- The official repository of the first version of ACE-Brain foundation model.☆62Updated this week
- MetaLadder: Ascending Mathematical Solution Quality via Analogical-Problem Reasoning Transfer (EMNLP 2025)☆11Apr 18, 2025Updated 11 months ago
- ☆12Nov 16, 2020Updated 5 years ago
- ICML2025☆65Aug 28, 2025Updated 6 months ago
- ☆16Sep 6, 2024Updated last year
- LEMMA: Logical Engine for Multi-domain Mathematical Analysis☆28Feb 14, 2026Updated last month
- ☆32Oct 23, 2025Updated 4 months ago
- SAVL: Scene-Adaptive UAV Visual Localization Using Sparse Feature Extraction and Incremental Descriptor Mapping☆14Aug 6, 2025Updated 7 months ago
- ☆19Jun 13, 2024Updated last year
- DSTC8-AVSD: Sentence generation task for Audio Visual Scene-aware Dialog☆14Jun 10, 2021Updated 4 years ago
- [CVPR 2026 (Findings) 🔥🔥] Self Evolving Large Multimodal Models with Continuous Rewards☆20Mar 5, 2026Updated 2 weeks ago
- [ICCV25 Highlight] The official implementation of the paper "LEGION: Learning to Ground and Explain for Synthetic Image Detection"☆75Oct 22, 2025Updated 4 months ago
- ROSA+: RWKV's ROSA implementation with fallback statistical predictor☆35Oct 13, 2025Updated 5 months ago
- TREE-G: Decision Trees Contesting Graph Neural Networks, specialized for graph data.☆13Feb 28, 2024Updated 2 years ago
- Official implementation of Categorical Flow Maps on text.☆47Feb 16, 2026Updated last month
- Offline implementation of UniREditBench: A Unified Reasoning-based Image Editing Benchmark.☆54Jan 7, 2026Updated 2 months ago
- ☆121Nov 7, 2025Updated 4 months ago
- Code for ICLR 2024 Paper: CompA: Addressing the Gap in Compositional Reasoning in Audio-Language Models☆22Jul 10, 2024Updated last year
- ☆18Mar 11, 2022Updated 4 years ago
- ☆16Dec 22, 2017Updated 8 years ago