facebookresearch / sapiensView external linksLinks
High-resolution models for human tasks.
☆5,287Nov 18, 2024Updated last year
Alternatives and similar repositories for sapiens
Users that are interested in sapiens are comparing it to the libraries listed below
Sorting:
- The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained mode…☆18,503Dec 25, 2024Updated last year
- Official PyTorch implementation of "Expressive Whole-Body 3D Gaussian Avatar", ECCV 2024.☆646May 7, 2025Updated 9 months ago
- Code of [CVPR 2024] "Animatable Gaussians: Learning Pose-dependent Gaussian Maps for High-fidelity Human Avatar Modeling"☆1,071Nov 16, 2024Updated last year
- Depth Pro: Sharp Monocular Metric Depth in Less Than a Second.☆5,281Apr 21, 2025Updated 9 months ago
- [NeurIPS 2024] Depth Anything V2. A More Capable Foundation Model for Monocular Depth Estimation☆7,597Jan 22, 2025Updated last year
- [NeurIPS 2023] Official Code for "SMPLer-X: Scaling Up Expressive Human Pose and Shape Estimation"☆1,175Updated this week
- 4DHumans: Reconstructing and Tracking Humans with Transformers☆1,545Feb 7, 2026Updated last week
- Goliath Dataset and Official PyTorch Implementation of RelightableHands, Relightable Gaussian Codec Avatars, and Driving-Signal Aware Ful…☆341Dec 4, 2024Updated last year
- Controllable video and image Generation, SVD, Animate Anyone, ControlNet, ControlNeXt, LoRA☆1,632Sep 25, 2024Updated last year
- More relighting!☆8,367Feb 20, 2025Updated 11 months ago
- [CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation☆8,000Jul 17, 2024Updated last year
- [SIGGRAPH Asia 2024] PuzzleAvatar: Assembling 3D Avatars from Personal Albums☆316Jan 29, 2026Updated 2 weeks ago
- [CVPR'23, Highlight] ECON: Explicit Clothed humans Optimized via Normal integration☆1,197Sep 17, 2024Updated last year
- Bring portraits to life!☆17,817Nov 16, 2025Updated 3 months ago
- CoTracker is a model for tracking any point (pixel) on a video.☆4,827Jan 21, 2025Updated last year
- SMPL-X☆2,474Aug 12, 2024Updated last year
- PyTorch code and models for the DINOv2 self-supervised learning method.☆12,393Dec 22, 2025Updated last month
- Pippo: High-Resolution Multi-View Humans from a Single Image☆631Apr 4, 2025Updated 10 months ago
- "Effective Whole-body Pose Estimation with Two-stages Distillation" (ICCV 2023, CV4Metaverse Workshop)☆2,666Dec 12, 2023Updated 2 years ago
- [3DV 2024] Official repo of "TeCH: Text-guided Reconstruction of Lifelike Clothed Humans"☆418Mar 7, 2024Updated last year
- [CVPR 2024 Highlight] The official repo for "GaussianAvatars: Photorealistic Head Avatars with Rigged 3D Gaussians"☆947Feb 11, 2026Updated last week
- text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)☆12,426Nov 4, 2025Updated 3 months ago
- Official repository of "SAMURAI: Adapting Segment Anything Model for Zero-Shot Visual Tracking with Motion-Aware Memory"☆7,042Mar 18, 2025Updated 10 months ago
- DUSt3R: Geometric 3D Vision Made Easy☆6,959Sep 24, 2025Updated 4 months ago
- [CVPR 2023] Official implementation of the paper "One-Stage 3D Whole-Body Mesh Recovery with Component Aware Transformer"☆776Aug 26, 2024Updated last year
- [ICLR 2025] Pyramidal Flow Matching for Efficient Video Generative Modeling☆3,159Dec 21, 2024Updated last year
- Expressive Body Capture: 3D Hands, Face, and Body from a Single Image☆2,069Feb 23, 2024Updated last year
- [CVPR 2024] The official repo for "GaussianAvatar: Towards Realistic Human Avatar Modeling from a Single Video via Animatable 3D Gaussian…☆576Mar 26, 2024Updated last year
- [TPAMI 2025] ViewCrafter: Taming Video Diffusion Models for High-fidelity Novel View Synthesis☆1,509Dec 13, 2025Updated 2 months ago
- [NeurIPS 2024] Neural Localizer Fields for Continuous 3D Human Pose and Shape Estimation☆416May 22, 2025Updated 8 months ago
- New repo collection for NVIDIA Cosmos: https://github.com/nvidia-cosmos☆8,086Jan 6, 2026Updated last month
- ☆991Apr 18, 2024Updated last year
- Code for "GVHMR: World-Grounded Human Motion Recovery via Gravity-View Coordinates", Siggraph Asia 2024☆1,379Jul 14, 2025Updated 7 months ago
- Official implementation of AnimateDiff.☆12,018Jul 31, 2024Updated last year
- Original reference implementation of "3D Gaussian Splatting for Real-Time Radiance Field Rendering"☆20,663Oct 17, 2025Updated 4 months ago
- A unified framework for 3D content generation.☆6,979Dec 16, 2024Updated last year
- [ICCV2025] LHM: Large Animatable Human Reconstruction Model from a Single Image in Seconds☆2,551Jul 15, 2025Updated 7 months ago
- Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and …☆17,397Sep 5, 2024Updated last year
- [CVPR 2024 - Oral, Best Paper Award Candidate] Marigold: Repurposing Diffusion-Based Image Generators for Monocular Depth Estimation☆3,084Dec 10, 2025Updated 2 months ago