jina-ai / executor-3d-encoder
An executor that wraps 3D mesh models and encodes 3D content documents to d-dimension vector.
☆19Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for executor-3d-encoder
- CAD-MLLM: Unifying Multimodality-Conditioned CAD Generation With MLLM☆26Updated last week
- Official Implementation of 3D-GRAND: Towards Better Grounding and Less Hallucination for 3D-LLMs☆30Updated 5 months ago
- This repository is a collection of research papers on World Models.☆36Updated last year
- Improving 3D Large Language Model via Robust Instruction Tuning☆42Updated last month
- Using Segment-Anything and CLIP to generate pixel-aligned semantic features.☆35Updated last year
- [CVPR 2023] Code for "3D Concept Learning and Reasoning from Multi-View Images"☆75Updated 10 months ago
- This repo contains the code and data for "MEGA-Bench Scaling Multimodal Evaluation to over 500 Real-World Tasks"☆43Updated last week
- A project for computing high-quality ground truth training examples for RGB-D data.☆43Updated last year
- CLEVR3D Dataset: Comprehensive Visual Question Answering on Point Clouds through Compositional Scene Manipulation☆13Updated 9 months ago
- Download scripts and tools for Replay dataset.☆30Updated last year
- ShapeGPT: 3D Shape Generation with A Unified Multi-modal Language Model, a unified and user-friendly shape-language model☆92Updated 11 months ago
- Codebase for the Recognize Anything Model (RAM)☆64Updated 11 months ago
- (CVPR 2024) Bayesian Diffusion Models for 3D Shape Reconstruction☆24Updated 6 months ago
- A benchmark dataset for evaluating LLM's SVG editing capabilities☆17Updated last month
- Code release for the CVPR'23 paper titled "PartDistillation Learning part from Instance Segmentation"☆59Updated 11 months ago
- ☆34Updated last year
- Code for “Pretrained Language Models as Visual Planners for Human Assistance”☆57Updated last year
- Code for "Chat-3D: Data-efficiently Tuning Large Language Model for Universal Dialogue of 3D Scenes"☆51Updated 7 months ago
- Code for IterInpaint model, presented in Diagnostic Benchmark and Iterative Inpainting for Layout-Guided Image Generation (CVPR 2024 work…☆23Updated 4 months ago
- [ACL2023 Area Chair Award] Official repo for the paper "Tell2Design: A Dataset for Language-Guided Floor Plan Generation".☆55Updated last year
- Implementation of ECCV'2022: Pose2Room: Understanding 3D Scenes from Human Activities☆86Updated 11 months ago
- Utilizing segment-anything to help the region selection of 3D point cloud or mesh.☆43Updated last year
- [MM 2024] [Need only a 3090] MiniGPT-3D: Efficiently Aligning 3D Point Clouds with Large Language Models using 2D Priors☆73Updated 2 months ago
- ☆22Updated 6 months ago
- ☆25Updated last year
- ☆92Updated last year
- ☆27Updated last year
- Awesome-DragGAN: A curated list of papers, tutorials, repositories related to DragGAN☆83Updated last year
- ☆15Updated last month
- ☆19Updated 5 months ago