HaixinShi / fmov_pose
This is the official repo for the implementation of Free-Moving Object Reconstruction and Pose Estimation with Virtual Camera.
☆10Updated 2 weeks ago
Related projects ⓘ
Alternatives and complementary repositories for fmov_pose
- EdgeSAM model for use with Autodistill.☆25Updated 5 months ago
- ☆22Updated 3 weeks ago
- Python scripts performing optical flow estimation using the NeuFlowV2 model in ONNX.☆32Updated last month
- Use Segment Anything 2, grounded with Florence-2, to auto-label data for use in training vision models.☆90Updated 3 months ago
- Implementation of VisionLLaMA from the paper: "VisionLLaMA: A Unified LLaMA Interface for Vision Tasks" in PyTorch and Zeta☆16Updated last week
- ☆34Updated 9 months ago
- ☆29Updated last month
- Use Florence 2 to auto-label data for use in training fine-tuned object detection models.☆59Updated 2 months ago
- Evaluate the performance of computer vision models and prompts for zero-shot models (Grounding DINO, CLIP, BLIP, DINOv2, ImageBind, model…☆34Updated last year
- ☆28Updated 9 months ago
- Use Grounding DINO, Segment Anything, and GPT-4V to label images with segmentation masks for use in training smaller, fine-tuned models.☆65Updated 11 months ago
- Detect corn stalks for micro-sensor insertion☆13Updated 8 months ago
- The open source implementation of "NeVA: NeMo Vision and Language Assistant"☆18Updated last year
- A Gradio web UI for Depth-Pro, Sharp Monocular Metric Depth Estimation☆44Updated last month
- Unofficial implementation and experiments related to Set-of-Mark (SoM) 👁️☆77Updated last year
- ClickDiffusion: Harnessing LLMs for Interactive Precise Image Editing☆65Updated 5 months ago
- ☆44Updated 6 months ago
- ☆29Updated 11 months ago
- ☆48Updated 2 months ago
- ☆14Updated 8 months ago
- ☆32Updated 3 months ago
- Repo for event-based binary image reconstruction.☆30Updated 7 months ago
- ☆30Updated 10 months ago
- an optimized, production-ready implementation of active speaker detection☆52Updated 5 months ago
- ☆14Updated 6 months ago
- robotic arm hardware beta release of RX2 humanoid☆15Updated last month
- ☆30Updated 9 months ago
- Implementation of Zero-Shot Video Semantic Segmentation☆35Updated 3 months ago
- GPT-4V(ision) module for use with Autodistill.☆25Updated 3 months ago
- Visual RAG using less than 300 lines of code.☆23Updated 8 months ago