Video dataset dedicated to portrait-mode video recognition.
☆58Oct 13, 2025Updated 6 months ago
Alternatives and similar repositories for Portrait-Mode-Video
Users that are interested in Portrait-Mode-Video are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An efficient multi-modal instruction-following data synthesis tool and the official implementation of Oasis https://arxiv.org/abs/2503.08…☆40Jun 4, 2025Updated 10 months ago
- Official implementation of MTM☆21Aug 30, 2023Updated 2 years ago
- ☆12Sep 11, 2021Updated 4 years ago
- [ECCV 2024] 3DPE: Real-time 3D-aware Portrait Editing from a Single Image☆22Sep 15, 2025Updated 7 months ago
- Repo for "Human-Centric Foundation Models: Perception, Generation and Agentic Modeling" (https://arxiv.org/abs/2502.08556)☆57Feb 15, 2025Updated last year
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- The official repo for "VisualWebInstruct: Scaling up Multimodal Instruction Data through Web Search" [EMNLP25]☆39Feb 1, 2026Updated 2 months ago
- Video Diffusion State Space Models☆19Mar 27, 2024Updated 2 years ago
- Offical Code for Paper "Exploring Inter-Channel Correlation for Diversity-preserved Knowledge Distillation"☆17Jan 19, 2022Updated 4 years ago
- [CVPR 2025 Oral] VideoEspresso: A Large-Scale Chain-of-Thought Dataset for Fine-Grained Video Reasoning via Core Frame Selection☆140Jul 28, 2025Updated 8 months ago
- Official implementation of Next Block Prediction: Video Generation via Semi-Autoregressive Modeling☆41Feb 12, 2025Updated last year
- [ICLR 2023] Towards Smooth Video Composition☆85Jun 12, 2023Updated 2 years ago
- [IJCAI 2024] Official implementation of the paper "Integrating View Conditions for Image Synthesis"☆25Aug 27, 2024Updated last year
- code of the paper "Vision-Language Navigation with Multi-granularity Observation and Auxiliary Reasoning Tasks"☆23Mar 23, 2021Updated 5 years ago
- The official implementation for the paper 'mmSampler: Efficient Frame Sampler for Multimodal Video Retrieval'.☆11Aug 23, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- [CVPR 2024 Highlight] ImageNet-D☆47Oct 15, 2024Updated last year
- NSAS code for CVPR review☆27Jun 2, 2021Updated 4 years ago
- 探索智能零售领域的图像识别方案,从而让机器更精准地识别商品,通过更快捷地购物带来全新的用户体验。☆12Jun 15, 2021Updated 4 years ago
- Accepted by AAAI2022☆21Apr 10, 2022Updated 4 years ago
- Official code of *Towards Event-oriented Long Video Understanding*☆12Jul 26, 2024Updated last year
- 本项目基于Wechaty开源微信SDK,融合PaddleClas、PaddleGan、PaddleHub等多个飞桨开发工具,集成【Mural_Gan】、垃圾分类等,打造微信个人专属生活小助手。☆13Aug 23, 2022Updated 3 years ago
- 🔥🔥MLVU: Multi-task Long Video Understanding Benchmark☆246Aug 21, 2025Updated 7 months ago
- ICML2024-ReconBoost: Boosting Can Achieve Modality Reconcilement☆29May 2, 2025Updated 11 months ago
- ☆11Jul 26, 2024Updated last year
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- ☆13Feb 2, 2025Updated last year
- Official repo for paper "MiraData: A Large-Scale Video Dataset with Long Durations and Structured Captions"☆512Sep 2, 2024Updated last year
- ☆112Jan 8, 2025Updated last year
- gradio bbox labeling tools☆11May 12, 2023Updated 2 years ago
- Xlore2.0 Code[BaiduExtractor, HudongExtractor, WikiExtractor, XloreData, XloreWeb]☆12Apr 5, 2017Updated 9 years ago
- [AAAI'25 Oral] NightReID: A Large-Scale Nighttime Person Re-Identification Benchmark☆11Jun 10, 2025Updated 10 months ago
- Synthesizing Efficient Data with Diffusion Models for Person Re-Identification Pre-Training☆11Jan 23, 2024Updated 2 years ago
- Scalable group inference for generating high quality and diverse images with diffusion models.☆42Aug 31, 2025Updated 7 months ago
- [CVPR 2025] Multi-focal Conditioned Latent Diffusion for Person Image Synthesis☆22Mar 23, 2025Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- [ICCV 2023] One-Shot Generative Domain Adaptation☆56Dec 23, 2021Updated 4 years ago
- [IJCAI-2024] The official code of Self-Supervised Pre-training with Symmetric Superimposition Modeling for Scene Text Recognition☆10Aug 10, 2025Updated 8 months ago
- ☆28Feb 11, 2025Updated last year
- ☆16Jul 29, 2025Updated 8 months ago
- CVPR 2025 Accepted Papers☆24Dec 20, 2025Updated 3 months ago
- ☆38Jan 25, 2024Updated 2 years ago
- ☆17Feb 19, 2024Updated 2 years ago