UniGen: Enhanced Training & Test-Time Strategies for Unified Multimodal Understanding and Generation
☆39Nov 24, 2025Updated 5 months ago
Alternatives and similar repositories for ml-unigen
Users that are interested in ml-unigen are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆21Jan 17, 2025Updated last year
- ☆26Dec 26, 2024Updated last year
- A framework aiming to bridge fast robot prototyping, predefined motion primitives, heterogeneous teleoperation, data collection, and flex…☆26Apr 4, 2026Updated 3 weeks ago
- ICCV 2021: Deep Co-Training with Task Decomposition for Semi-supervised Domain Adaptation☆17Dec 8, 2022Updated 3 years ago
- Official repository Flash Local Linear Attention☆23Updated this week
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆22Dec 8, 2022Updated 3 years ago
- MLX Implementation of Recursive Reasoning with Tiny Networks☆78Oct 11, 2025Updated 6 months ago
- MRSAudio: A Large-Scale Multimodal Recorded Spatial Audio Dataset with Refined Annotations☆37Oct 15, 2025Updated 6 months ago
- Official implementation of "Describing Sets of Images with Textual-PCA".☆16Feb 13, 2023Updated 3 years ago
- ☆16Sep 4, 2025Updated 7 months ago
- An official implementation of Random Policy Valuation is Enough for LLM Reasoning with Verifiable Rewards☆36Oct 3, 2025Updated 6 months ago
- ☆14Oct 7, 2023Updated 2 years ago
- Rethinking the Trust Region in LLM Reinforcement Learning☆52Mar 2, 2026Updated last month
- Model for processing text sequences with coreference annotations☆14Nov 29, 2018Updated 7 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆14Apr 18, 2020Updated 6 years ago
- ☆35Mar 13, 2026Updated last month
- [ACM MM2023] Code Release of GCMA: Generative Cross-Modal Transferable Adversarial Attacks from Images to Videos☆12Mar 29, 2024Updated 2 years ago
- Implementation of "Improving Whispered Speech Recognition Performance using Pseudo-whispered based Data Augmentation"☆13Oct 31, 2024Updated last year
- ☆18Aug 23, 2024Updated last year
- MetaLadder: Ascending Mathematical Solution Quality via Analogical-Problem Reasoning Transfer (EMNLP 2025)☆12Apr 18, 2025Updated last year
- ICML2025☆64Aug 28, 2025Updated 8 months ago
- ☆16Sep 6, 2024Updated last year
- Better coding experience for Flask☆16Oct 21, 2025Updated 6 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- LEMMA: Logical Engine for Multi-domain Mathematical Analysis☆28Feb 14, 2026Updated 2 months ago
- ☆33Oct 23, 2025Updated 6 months ago
- ☆11Feb 14, 2025Updated last year
- A PyTorch implementation of a conditional Denoising Diffusion Probabilistic Model (DDPM) for multi-modal trajectory prediction. This proj…☆37Feb 20, 2026Updated 2 months ago
- SAVL: Scene-Adaptive UAV Visual Localization Using Sparse Feature Extraction and Incremental Descriptor Mapping☆14Updated this week
- DSTC8-AVSD: Sentence generation task for Audio Visual Scene-aware Dialog☆14Jun 10, 2021Updated 4 years ago
- arxiv翻译修复器!☆22Nov 13, 2024Updated last year
- ☆29Sep 2, 2025Updated 7 months ago
- Training Transformers with knowledge localization (SGTM)☆51Jan 11, 2026Updated 3 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [CVPR 2026 (Findings) 🔥🔥] Self Evolving Large Multimodal Models with Continuous Rewards☆22Mar 5, 2026Updated last month
- [ACM MM 2024] ReToMe-VA: Recursive Token Merging for Video Diffusion-based Unrestricted Adversarial Attack☆14Dec 20, 2024Updated last year
- [MM'22 Oral] AI Illustrator: Translating Raw Descriptions into Images by Prompt-based Cross-Modal Generation☆11Apr 3, 2023Updated 3 years ago
- TREE-G: Decision Trees Contesting Graph Neural Networks, specialized for graph data.☆13Feb 28, 2024Updated 2 years ago
- ☆18Mar 11, 2022Updated 4 years ago
- Code for ICLR 2024 Paper: CompA: Addressing the Gap in Compositional Reasoning in Audio-Language Models☆22Jul 10, 2024Updated last year
- Metis-RISE: RL Incentivizes and SFT Enhances Multimodal Reasoning Model Learning☆22Jun 26, 2025Updated 10 months ago