PingchuanMa / PingchuanMa.github.io
Source code for my homepage.
☆12Updated last month
Alternatives and similar repositories for PingchuanMa.github.io:
Users that are interested in PingchuanMa.github.io are comparing it to the libraries listed below
- Program synthesis for 3D spatial reasoning☆24Updated last month
- Code Release of "3D Concept Grounding on Neural Fields (NeurIPS2022)"☆15Updated 2 years ago
- Compositional Object Light Fields code☆26Updated 2 years ago
- Code for paper "Grounding Video Models to Actions through Goal Conditioned Exploration".☆44Updated 3 months ago
- Code release for NeurIPS 2023 paper SlotDiffusion: Object-centric Learning with Diffusion Models☆84Updated last year
- Official PyTorch implementation of NeuralDiff: Segmenting 3D objects that move in egocentric videos.☆31Updated 2 years ago
- Code for paper Background Prompting for Improved Object Depth☆29Updated last year
- A paper list that includes world models or generative video models for embodied agents.☆19Updated 2 months ago
- HInt dataset from HaMeR: Reconstructing Hands in 3D with Transformers☆38Updated 10 months ago
- EgoVid-5M: A Large-Scale Video-Action Dataset for Egocentric Video Generation☆98Updated 4 months ago
- FleVRS: Towards Flexible Visual Relationship Segmentation, NeurIPS 2024☆20Updated 3 months ago
- [3DV 2022] Articulated 3D Human-Object Interactions from RGB Videos: An Empirical Analysis of Approaches and Challenges☆17Updated 2 years ago
- [ICLR 2024] Official implementation of the paper "Toss: High-quality text-guided novel view synthesis from a single image"☆22Updated 10 months ago
- TORE: Token Reduction for Efficient Human Mesh Recovery with Transformer☆47Updated last year
- Code for our paper: Learning Camera Movement Control from Real-World Drone Videos☆27Updated last month
- Official Reimplementation of Diffusion-Guided Reconstruction of Everyday Hand-Object Interaction Clips (DiffHOI, ICCV23) https://judyye.g…☆36Updated last year
- IMProv: Inpainting-based Multimodal Prompting for Computer Vision Tasks☆58Updated 6 months ago
- Web page for "🍅HumanTOMATO: Text-aligned Whole-body Motion Generation".☆14Updated 10 months ago
- Unifying Specialized Visual Encoders for Video Language Models☆16Updated 2 months ago
- https://coshand.cs.columbia.edu/☆16Updated 5 months ago
- ☆14Updated last year
- Code for NeurIPS 2022 paper "Learning Viewpoint-Agnostic Visual Representations by Recovering Tokens in 3D Space"☆20Updated last year
- Large Gaussian Reconstruction Model for Efficient 3D Reconstruction and Generation☆15Updated 11 months ago
- [ICCV 2023] Rendering Humans from Object-Occluded Monocular Videos☆42Updated last year
- VEGGIE: Instructional Editing and Reasoning Video Concepts with Grounded Generation☆14Updated last week
- ☆32Updated last week
- [arXiv 2024] The official repository of the paper "Unsupervised Discovery of Object-Centric Neural Fields"☆17Updated last month
- Repo for "Human-Centric Foundation Models: Perception, Generation and Agentic Modeling" (https://arxiv.org/abs/2502.08556)☆36Updated last month
- [ICLR 2022] RelViT: Concept-guided Vision Transformer for Visual Relational Reasoning☆63Updated 2 years ago
- Unofficial Implementation of "Stable Video Diffusion Multi-View"☆78Updated 11 months ago