Video dataset dedicated to portrait-mode video recognition.
☆58Oct 13, 2025Updated 8 months ago
Alternatives and similar repositories for Portrait-Mode-Video
Users that are interested in Portrait-Mode-Video are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An efficient multi-modal instruction-following data synthesis tool and the official implementation of Oasis https://arxiv.org/abs/2503.08…☆40Jun 4, 2025Updated last year
- ☆12Sep 11, 2021Updated 4 years ago
- [ECCV 2024] 3DPE: Real-time 3D-aware Portrait Editing from a Single Image☆22Sep 15, 2025Updated 9 months ago
- ☆14Nov 22, 2022Updated 3 years ago
- Repo for "Human-Centric Foundation Models: Perception, Generation and Agentic Modeling" (https://arxiv.org/abs/2502.08556)☆56Feb 15, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- The official repo for "VisualWebInstruct: Scaling up Multimodal Instruction Data through Web Search" [EMNLP25]☆40Feb 1, 2026Updated 4 months ago
- Official InfiniBench: A Benchmark for Large Multi-Modal Models in Long-Form Movies and TV Shows☆20Nov 4, 2025Updated 7 months ago
- Video Diffusion State Space Models☆19Mar 27, 2024Updated 2 years ago
- Offical Code for Paper "Exploring Inter-Channel Correlation for Diversity-preserved Knowledge Distillation"☆17Jan 19, 2022Updated 4 years ago
- [CVPR 2025 Oral] VideoEspresso: A Large-Scale Chain-of-Thought Dataset for Fine-Grained Video Reasoning via Core Frame Selection☆140Jul 28, 2025Updated 10 months ago
- Code for paper: "Executing Arithmetic: Fine-Tuning Large Language Models as Turing Machines"☆11Oct 11, 2024Updated last year
- Official implementation of Next Block Prediction: Video Generation via Semi-Autoregressive Modeling☆42Feb 12, 2025Updated last year
- [IJCAI 2024] Official implementation of the paper "Integrating View Conditions for Image Synthesis"☆25Aug 27, 2024Updated last year
- code of the paper "Vision-Language Navigation with Multi-granularity Observation and Auxiliary Reasoning Tasks"☆23Mar 23, 2021Updated 5 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- The official implementation for the paper 'mmSampler: Efficient Frame Sampler for Multimodal Video Retrieval'.☆11Aug 23, 2022Updated 3 years ago
- [CVPR 2024 Highlight] ImageNet-D☆47Oct 15, 2024Updated last year
- 探索智能零售领域的图像识别方案,从而让机器更精准地识别商品,通过更快捷地购物带来全新的用户体验。☆12Jun 15, 2021Updated 5 years ago
- Official code of *Towards Event-oriented Long Video Understanding*☆12Jul 26, 2024Updated last year
- retouching ptoto,remove moles/buffing/face-lift☆15Aug 12, 2021Updated 4 years ago
- 🔥🔥MLVU: Multi-task Long Video Understanding Benchmark☆262Apr 13, 2026Updated 2 months ago
- ICML2024-ReconBoost: Boosting Can Achieve Modality Reconcilement☆29May 2, 2025Updated last year
- ☆11Jul 26, 2024Updated last year
- ☆13Feb 2, 2025Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Edge-Aware Mirror Network for Camouflaged Object Detection (EAMNet, IEEE ICME 2023).☆13Jul 8, 2023Updated 2 years ago
- [TPAMI2025] BackMix: Regularizing Open Set Recognition by Removing Underlying Fore-Background Priors☆16Apr 23, 2025Updated last year
- Repository for "CoMix: Comprehensive Benchmark for Multi-Task Comic Understanding"☆16Nov 20, 2024Updated last year
- gradio bbox labeling tools☆11May 12, 2023Updated 3 years ago
- ☆115Jan 8, 2025Updated last year
- PyTorch re-implementation of Hierarchical Normalization for Robust Monocular Depth Estimation☆23Dec 8, 2022Updated 3 years ago
- Official repo for paper "MiraData: A Large-Scale Video Dataset with Long Durations and Structured Captions"☆524Sep 2, 2024Updated last year
- Xlore2.0 Code[BaiduExtractor, HudongExtractor, WikiExtractor, XloreData, XloreWeb]☆12Apr 5, 2017Updated 9 years ago
- ☆16Apr 7, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Unofficial implementation of ResNet3D and CSN (Channel-Separated Convolutional Networks) from Video Classification with Channel-Separated…☆18Apr 25, 2020Updated 6 years ago
- [AAAI'25 Oral] NightReID: A Large-Scale Nighttime Person Re-Identification Benchmark☆11Jun 10, 2025Updated last year
- Synthesizing Efficient Data with Diffusion Models for Person Re-Identification Pre-Training☆11Jan 23, 2024Updated 2 years ago
- Generate Camouflage Images by Pytorch☆14Jun 29, 2023Updated 2 years ago
- [CVPR 2025] Multi-focal Conditioned Latent Diffusion for Person Image Synthesis☆23Mar 23, 2025Updated last year
- [NAACL 2025] Representing Rule-based Chatbots with Transformers☆23Feb 9, 2025Updated last year
- ☆27Feb 11, 2025Updated last year