jina-ai / executor-3d-encoderLinks
An executor that wraps 3D mesh models and encodes 3D content documents to d-dimension vector.
☆19Updated 3 years ago
Alternatives and similar repositories for executor-3d-encoder
Users that are interested in executor-3d-encoder are comparing it to the libraries listed below
Sorting:
- Code for “Pretrained Language Models as Visual Planners for Human Assistance”☆61Updated 2 years ago
- [CVPR 2025] 3D-GRAND: Towards Better Grounding and Less Hallucination for 3D-LLMs☆52Updated last year
- Code for "Chat-3D: Data-efficiently Tuning Large Language Model for Universal Dialogue of 3D Scenes"☆56Updated last year
- [ICCV 2025] Improving 3D Large Language Model via Robust Instruction Tuning☆66Updated 3 months ago
- Official implementation of ICCV 2023 paper "3D-VisTA: Pre-trained Transformer for 3D Vision and Text Alignment"☆217Updated 2 years ago
- [CVPR 2023] Code for "3D Concept Learning and Reasoning from Multi-View Images"☆84Updated 2 years ago
- (CVPR 2024) Bayesian Diffusion Models for 3D Shape Reconstruction☆37Updated last year
- ☆33Updated 2 years ago
- This repository is a collection of research papers on World Models.☆43Updated 2 years ago
- a thin wrapper of chatgpt for improving paper writing.☆254Updated 2 years ago
- A project for computing high-quality ground truth training examples for RGB-D data.☆48Updated 2 years ago
- Code release for the CVPR'23 paper titled "PartDistillation Learning part from Instance Segmentation"☆60Updated 2 years ago
- [ICML 2023] Contrast with Reconstruct: Contrastive 3D Representation Learning Guided by Generative Pretraining☆151Updated last year
- Vision-oriented multimodal AI☆49Updated last year
- [ICRA 2024] Chat with NeRF enables users to interact with a NeRF model by typing in natural language.☆320Updated 3 months ago
- Download scripts and tools for Replay dataset.☆36Updated 2 years ago
- A paper list of world model☆28Updated 9 months ago
- Official code release of "CLIP goes 3D: Leveraging Prompt Tuning for Language Grounded 3D Recognition"☆240Updated 2 years ago
- [ACL2023 Area Chair Award] Official repo for the paper "Tell2Design: A Dataset for Language-Guided Floor Plan Generation".☆78Updated 10 months ago
- ☆15Updated 3 years ago
- Experimental scripts for researching data adaptive learning rate scheduling.☆22Updated 2 years ago
- Code for the Ask4Help project☆22Updated 3 years ago
- Slot-TTA shows that test-time adaptation using slot-centric models can improve image segmentation on out-of-distribution examples.☆26Updated 2 years ago
- Implementation of "PaLM2-VAdapter:" from the multi-modal model paper: "PaLM2-VAdapter: Progressively Aligned Language Model Makes a Stron…☆17Updated last year
- Quick check of compatible versions of PyTorch, Python, CUDA, cuDNN, NVIDIA driver! 实现 PyTorch, Python, CUDA, cuDNN, NVIDIA driver 兼容版本速查!☆35Updated last year
- CLEVR3D Dataset: Comprehensive Visual Question Answering on Point Clouds through Compositional Scene Manipulation☆19Updated last year
- A high-fidelity, general-purpose platform for embodied agent training and testing.☆159Updated last week
- Code for recreating the HoS benchmark of VISOR☆22Updated 2 years ago
- Unofficial implementation for Sigmoid Loss for Language Image Pre-Training☆11Updated 2 years ago
- [ICLR 2023] SQA3D for embodied scene understanding and reasoning☆154Updated 2 years ago