VisionDreamer: High-Fidelity Text-to-3D Generation via Mesh-Guided 3D Gaussian Splatting
☆16Jul 7, 2025Updated 8 months ago
Alternatives and similar repositories for VisionDreamer
Users that are interested in VisionDreamer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆12Oct 10, 2024Updated last year
- This repository is for the paper "Is BERT Blind? Exploring the Effect of Vision-and-Language Pretraining on Visual Language Understanding…☆21Nov 2, 2023Updated 2 years ago
- This repo is about multi-label medical image classification base on CNN and Transformer.☆33Sep 21, 2025Updated 6 months ago
- Bootstrap+SpringMVC+Spring+Mybatis+MySQL搭建的二手交易网站☆21Mar 13, 2018Updated 8 years ago
- 使用Spring MVC框架 实现的云音乐网站☆29Jan 23, 2019Updated 7 years ago
- 最近复习大神的pdf,发现有些重点的页面需要单独保存,在网上找了半天pdf拆分合并的工具,并没有好用的,所以自己用Python的PyPDF2和tkinter写了一个小工具,里面是代码以及打包好的exe可执行文件。☆48Apr 4, 2018Updated 7 years ago
- Embodied Question Answering (EQA) benchmark and method (ICCV 2025)☆48Aug 12, 2025Updated 7 months ago
- The official implementation of the paper "DIP: Dual Incongruity Perceiving Network for Sarcasm Detection"☆37Dec 6, 2024Updated last year
- [CVPR 2024] The code for paper 'Towards Learning a Generalist Model for Embodied Navigation'☆62Apr 11, 2024Updated last year
- Public release for "Explore until Confident: Efficient Exploration for Embodied Question Answering"☆77Jul 5, 2024Updated last year
- 医学图像作业:图像配准论文阅读;眼底血管分割实验☆68Oct 19, 2021Updated 4 years ago
- Code of the paper "NavCoT: Boosting LLM-Based Vision-and-Language Navigation via Learning Disentangled Reasoning" (TPAMI 2025)☆132Jun 4, 2025Updated 9 months ago
- [ICLR 2023] SQA3D for embodied scene understanding and reasoning☆157Oct 13, 2023Updated 2 years ago
- Leveraging Large Language Models for Visual Target Navigation☆159Oct 24, 2023Updated 2 years ago
- 12306抢票程序JAVA版☆122Dec 24, 2019Updated 6 years ago
- 面试题库☆187Sep 1, 2018Updated 7 years ago
- [ICCV'23] LLM-Planner: Few-Shot Grounded Planning for Embodied Agents with Large Language Models☆218Mar 26, 2025Updated 11 months ago
- [arXiv 2023] Embodied Task Planning with Large Language Models☆193Aug 22, 2023Updated 2 years ago
- [ECCV 2024] Official implementation of NavGPT-2: Unleashing Navigational Reasoning Capability for Large Vision-Language Models☆237Sep 20, 2024Updated last year
- 比较全的中华古诗古词古文库,包括21万首古诗词,以及注释、赏析等信息,包含10000多名诗人以及诗人的介绍、生平等,同时包含,1600多个词牌介绍,中国70多个朝代解析,和古诗文的近200个分类标签☆403Sep 11, 2023Updated 2 years ago
- ☆264Jan 14, 2025Updated last year
- repository for HapticLLaMA: A Multimodal Sensory Language Model for Haptic Captioning☆200Sep 3, 2025Updated 6 months ago
- [AAAI 2024] Official implementation of NavGPT: Explicit Reasoning in Vision-and-Language Navigation with Large Language Models☆326Nov 7, 2023Updated 2 years ago
- OpenEQA Embodied Question Answering in the Era of Foundation Models☆343Sep 20, 2024Updated last year
- Train embodied agents that can answer questions in environments☆316Jul 25, 2023Updated 2 years ago
- 自己的毕设作品,是一个前后端分离的项目,分为前台门户系统和后台管理系统。前端基于Vue.js、Element UI、Axios等实现页面的构建和请求的发送,后端基于SpringBoot、MyBatis、Redis、Nginx实现。系 统实现了流浪动物救助帖子的发布、回复、评论…☆478Apr 20, 2024Updated last year
- TransMIL: Transformer based Correlated Multiple Instance Learning for Whole Slide Image Classification☆470May 3, 2024Updated last year
- ThreeDWorld simulation environment☆583Jun 3, 2024Updated last year
- 收集各种网站前端模板☆752Mar 6, 2025Updated last year
- 天若有情天亦老,人间正道是沧桑☆1,091Jan 31, 2025Updated last year
- An open-source implementation of Google's PaLM models☆819Jun 21, 2024Updated last year
- [CVPR2024] StableVITON: Learning Semantic Correspondence with Latent Diffusion Model for Virtual Try-On☆1,255Oct 12, 2025Updated 5 months ago
- Implementation of the specific Transformer architecture from PaLM - Scaling Language Modeling with Pathways☆828Nov 9, 2022Updated 3 years ago
- A flexible and efficient codebase for training visually-conditioned language models (VLMs)☆956Jul 4, 2024Updated last year
- A system for building 3D Scene Graphs from sensor data in real-time☆940Updated this week
- Code for the ICML 2021 (long talk) paper: "ViLT: Vision-and-Language Transformer Without Convolution or Region Supervision"☆1,528Apr 3, 2024Updated last year
- [Embodied-AI-Survey-2025] Paper List and Resource Repository for Embodied AI☆1,948Mar 11, 2026Updated last week
- LPIPS metric. pip install lpips☆4,190Jul 2, 2024Updated last year
- [ECCV2024] IDM-VTON : Improving Diffusion Models for Authentic Virtual Try-on in the Wild☆4,925Mar 7, 2025Updated last year