EtaYang10th / Open-M3-BenchLinks
☆68Updated last week
Alternatives and similar repositories for Open-M3-Bench
Users that are interested in Open-M3-Bench are comparing it to the libraries listed below
Sorting:
- Efficient controlnet for DiTs☆382Updated 8 months ago
- MTLA: Multi-head Temporal Latent Attention☆760Updated 3 months ago
- [NeurIPS2025 spotlight★] Official implementation for "RepLDM: Reprogramming Pretrained Latent Diffusion Models for High-Quality, High-Eff…☆219Updated last month
- Inspiring the Next Generation of Segment Anything Models: Comprehensively Evaluate SAM and SAM 2 with Diverse Prompts Towards Context-Dep…☆574Updated 5 months ago
- We introduce the Audio Logical Reasoning (ALR) dataset, consisting of 6,446 text-audio annotated samples specifically designed for comple…☆1,104Updated 2 months ago
- We introduce temporal working memory (TWM), which aims to enhance the temporal modeling capabilities of Multimodal foundation models (MFM…☆311Updated 2 months ago
- a multiscale multimodal large language models for radiology report generation (RRG) tasks☆273Updated 2 weeks ago
- Official implementation of UniCalli: A Unified Diffusion Framework for Column-Level Generation and Recognition of Chinese Calligraphy☆186Updated last week
- [AAAI 2026 Oral] Cook and Clean Together: Teaching Embodied Agents for Parallel Task Execution☆356Updated last month
- ☆73Updated 2 months ago
- INFTY Engine: An Optimization Toolkit to Support Continual AI☆566Updated 4 months ago
- Group Expectation Policy Optimization for Heterogeneous Reinforcement Learning☆167Updated last week
- This is the project for the paper of "Boosting Image Restoration via Priors from Pre-trained Models" in CVPR2024☆95Updated 7 months ago
- ☆386Updated 6 months ago
- A real-time interactive Omni Avatar built on LiveKit, which allows you to seamlessly integrate with any open source Avatar components (re…☆559Updated this week
- 3D generation made easy!☆436Updated 2 months ago
- Fat-Cat: A document-centric context management Agent. Making context as simple as reading chat history.☆515Updated 2 weeks ago
- ☆839Updated 6 months ago
- ☆517Updated 11 months ago
- This is the project for the paper of "Low-Light Video Enhancement via Spatial-Temporal Consistent Decomposition" in IJCAI2025☆86Updated 6 months ago
- GigaModels: A Comprehensive Repository and Platform for Multi-modal, Generative, and Perceptual Models☆860Updated last month
- ☆41Updated 5 months ago
- ☆61Updated 5 months ago
- GigaDatasets: A Unified and Lightweight Framework for Data Processing, Curation, and Visualization☆534Updated last month
- ☆333Updated 3 months ago
- ☆138Updated 7 months ago
- 超能文献|AI驱动的文档翻译与学术搜索服务。支持PDF、DOCX、PPTX等多格式文档的高质量翻译(支持11种语言),特别优化了数学公式翻译。同时提供PubMed学术文献智能搜索功能。更多访问:https://suppr.wilddata.cn☆246Updated 3 months ago
- The Collapse of Patches☆58Updated 2 months ago
- Dataset approched by A Benchmark and Frequency Compression Method for Infrared Few-Shot Object Detection☆1,004Updated 10 months ago
- The Python implementation of some deep text hashing (also called deep semantic hashing) Models☆80Updated 2 months ago