duoan / mega-data-factoryLinks
Mega Scale Multimodal DataPipeline for SOTA models
☆38Updated this week
Alternatives and similar repositories for mega-data-factory
Users that are interested in mega-data-factory are comparing it to the libraries listed below
Sorting:
- [NeurIPS'24] Training-Free Adaptive Diffusion with Bounded Difference Approximation Strategy☆73Updated last year
- Official repository of DARE: dLLM Alignment and Reinforcement Executor☆159Updated this week
- Explain Before You Answer: A Survey on Compositional Visual Reasoning☆306Updated 3 months ago
- [Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics]: VisuoThink: Empowering LVLM Reasoning with Mul…☆101Updated 6 months ago
- Autoregressive Semantic Visual Reconstruction Helps VLMs Understand Better☆186Updated this week
- 🔥 [AAAI 2026 Oral] Official code for Decomposing the Neurons: Activation Sparsity via Mixture of Experts for Continual Test Time Adaptat…☆75Updated last year
- Official repository of MMGenBench☆120Updated 10 months ago
- A simple, unified multimodal models training engine. Lean, flexible, and built for hacking at scale.☆706Updated 2 weeks ago
- ☆321Updated 3 months ago
- **Deep Video Discovery (DVD)** is a deep-research style question answering agent designed for understanding extra-long videos.☆346Updated 3 months ago
- [AAAI 2024] Official code for Efficient Deweather Mixture-of-Experts with Uncertainty-aware Feature-wise Linear Modulation