High-resolution models for human tasks.
☆5,296Nov 18, 2024Updated last year
Alternatives and similar repositories for sapiens
Users that are interested in sapiens are comparing it to the libraries listed below
Sorting:
- The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained mode…☆18,560Dec 25, 2024Updated last year
- Official PyTorch implementation of "Expressive Whole-Body 3D Gaussian Avatar", ECCV 2024.☆645May 7, 2025Updated 9 months ago
- Code of [CVPR 2024] "Animatable Gaussians: Learning Pose-dependent Gaussian Maps for High-fidelity Human Avatar Modeling"☆1,072Nov 16, 2024Updated last year
- Depth Pro: Sharp Monocular Metric Depth in Less Than a Second.☆5,313Apr 21, 2025Updated 10 months ago
- [NeurIPS 2023] Official Code for "SMPLer-X: Scaling Up Expressive Human Pose and Shape Estimation"☆1,178Feb 12, 2026Updated 3 weeks ago
- [NeurIPS 2024] Depth Anything V2. A More Capable Foundation Model for Monocular Depth Estimation☆7,641Jan 22, 2025Updated last year
- 4DHumans: Reconstructing and Tracking Humans with Transformers☆1,547Feb 7, 2026Updated 3 weeks ago
- Goliath Dataset and Official PyTorch Implementation of RelightableHands, Relightable Gaussian Codec Avatars, and Driving-Signal Aware Ful…☆343Dec 4, 2024Updated last year
- More relighting!☆8,375Feb 20, 2025Updated last year
- Controllable video and image Generation, SVD, Animate Anyone, ControlNet, ControlNeXt, LoRA☆1,633Sep 25, 2024Updated last year
- [CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation☆8,011Jul 17, 2024Updated last year
- [SIGGRAPH Asia 2024] PuzzleAvatar: Assembling 3D Avatars from Personal Albums☆317Jan 29, 2026Updated last month
- [CVPR'23, Highlight] ECON: Explicit Clothed humans Optimized via Normal integration☆1,198Sep 17, 2024Updated last year
- Bring portraits to life!☆17,873Nov 16, 2025Updated 3 months ago
- CoTracker is a model for tracking any point (pixel) on a video.☆4,840Jan 21, 2025Updated last year
- SMPL-X☆2,493Aug 12, 2024Updated last year
- Official inference repo for FLUX.1 models☆25,246Jul 31, 2025Updated 7 months ago
- PyTorch code and models for the DINOv2 self-supervised learning method.☆12,427Feb 24, 2026Updated last week
- "Effective Whole-body Pose Estimation with Two-stages Distillation" (ICCV 2023, CV4Metaverse Workshop)☆2,671Dec 12, 2023Updated 2 years ago
- Pippo: High-Resolution Multi-View Humans from a Single Image☆632Apr 4, 2025Updated 11 months ago
- [3DV 2024] Official repo of "TeCH: Text-guided Reconstruction of Lifelike Clothed Humans"☆418Mar 7, 2024Updated last year
- [CVPR 2024 Highlight] The official repo for "GaussianAvatars: Photorealistic Head Avatars with Rigged 3D Gaussians"☆951Feb 11, 2026Updated 3 weeks ago
- text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)☆12,477Nov 4, 2025Updated 4 months ago
- Official repository of "SAMURAI: Adapting Segment Anything Model for Zero-Shot Visual Tracking with Motion-Aware Memory"☆7,043Mar 18, 2025Updated 11 months ago
- DUSt3R: Geometric 3D Vision Made Easy☆6,975Sep 24, 2025Updated 5 months ago
- [CVPR 2023] Official implementation of the paper "One-Stage 3D Whole-Body Mesh Recovery with Component Aware Transformer"☆777Aug 26, 2024Updated last year
- [ICLR 2025] Pyramidal Flow Matching for Efficient Video Generative Modeling☆3,162Dec 21, 2024Updated last year
- Expressive Body Capture: 3D Hands, Face, and Body from a Single Image☆2,086Feb 23, 2024Updated 2 years ago
- [CVPR 2024] The official repo for "GaussianAvatar: Towards Realistic Human Avatar Modeling from a Single Video via Animatable 3D Gaussian…☆576Mar 26, 2024Updated last year
- [TPAMI 2025] ViewCrafter: Taming Video Diffusion Models for High-fidelity Novel View Synthesis☆1,518Dec 13, 2025Updated 2 months ago
- [NeurIPS 2024] Neural Localizer Fields for Continuous 3D Human Pose and Shape Estimation☆422May 22, 2025Updated 9 months ago
- New repo collection for NVIDIA Cosmos: https://github.com/nvidia-cosmos☆8,082Jan 6, 2026Updated 2 months ago
- ☆1,003Apr 18, 2024Updated last year
- Code for "GVHMR: World-Grounded Human Motion Recovery via Gravity-View Coordinates", Siggraph Asia 2024☆1,407Jul 14, 2025Updated 7 months ago
- Official implementation of AnimateDiff.☆12,038Jul 31, 2024Updated last year
- [ICCV2025] LHM: Large Animatable Human Reconstruction Model from a Single Image in Seconds☆2,556Jul 15, 2025Updated 7 months ago
- Original reference implementation of "3D Gaussian Splatting for Real-Time Radiance Field Rendering"☆20,827Oct 17, 2025Updated 4 months ago
- A unified framework for 3D content generation.☆6,979Dec 16, 2024Updated last year
- Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and …☆17,431Sep 5, 2024Updated last year