VisionDreamer: High-Fidelity Text-to-3D Generation via Mesh-Guided 3D Gaussian Splatting
☆18Jul 7, 2025Updated 11 months ago
Alternatives and similar repositories for VisionDreamer
Users that are interested in VisionDreamer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆12Oct 10, 2024Updated last year
- This repository is for the paper "Is BERT Blind? Exploring the Effect of Vision-and-Language Pretraining on Visual Language Understanding…☆21Nov 2, 2023Updated 2 years ago
- This repo is about multi-label medical image classification base on CNN and Transformer.☆35Sep 21, 2025Updated 8 months ago
- Bootstrap+SpringMVC+Spring+Mybatis+MySQL搭建的二手交易网站☆21Mar 13, 2018Updated 8 years ago
- 使用Spring MVC框架 实现的云音乐网站☆29Jan 23, 2019Updated 7 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- 最近复习大神的pdf,发现有些重点的页面需要单独保存,在网上找了半天pdf拆分合并的工具,并没有好用的,所以自己用Python的PyPDF2和tkinter写了一个小工具,里面是代码以及打包好的exe可执行文件。☆49Apr 4, 2018Updated 8 years ago
- Embodied Question Answering (EQA) benchmark and method (ICCV 2025)☆55Aug 12, 2025Updated 10 months ago
- The official implementation of the paper "DIP: Dual Incongruity Perceiving Network for Sarcasm Detection"☆36Dec 6, 2024Updated last year
- [CVPR 2024] The code for paper 'Towards Learning a Generalist Model for Embodied Navigation'☆62Apr 11, 2024Updated 2 years ago
- Public release for "Explore until Confident: Efficient Exploration for Embodied Question Answering"☆82Jul 5, 2024Updated last year
- 医学图像作业:图像配准论文阅读;眼底血管分割实验☆70Oct 19, 2021Updated 4 years ago
- Code of the paper "NavCoT: Boosting LLM-Based Vision-and-Language Navigation via Learning Disentangled Reasoning" (TPAMI 2025)☆139Jun 4, 2025Updated last year
- [ICLR 2023] SQA3D for embodied scene understanding and reasoning☆161Oct 13, 2023Updated 2 years ago
- Leveraging Large Language Models for Visual Target Navigation☆163Oct 24, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- 12306抢票程序JAVA版☆122Dec 24, 2019Updated 6 years ago
- 面试题库☆187Sep 1, 2018Updated 7 years ago
- [ICCV'23] LLM-Planner: Few-Shot Grounded Planning for Embodied Agents with Large Language Models☆226Mar 26, 2025Updated last year
- [arXiv 2023] Embodied Task Planning with Large Language Models☆194Aug 22, 2023Updated 2 years ago
- [ECCV 2024] Official implementation of NavGPT-2: Unleashing Navigational Reasoning Capability for Large Vision-Language Models☆243Apr 3, 2026Updated 2 months ago
- 比较全的中华古诗古词古文库,包括21万首古诗词,以及注释、赏析等信息,包含10000多名诗人以及诗人的介绍、生平等,同时包含,1600多个词牌介绍,中国70多个朝代解析,和古诗文的近200个分类标签☆421Sep 11, 2023Updated 2 years ago
- ☆269Jan 14, 2025Updated last year
- repository for UniMSE: Towards Unified Multimodal Sentiment Analysis and Emotion Recognition☆203Sep 3, 2025Updated 9 months ago
- OpenEQA Embodied Question Answering in the Era of Foundation Models☆363Sep 20, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Train embodied agents that can answer questions in environments☆315Jul 25, 2023Updated 2 years ago
- [AAAI 2024] Official implementation of NavGPT: Explicit Reasoning in Vision-and-Language Navigation with Large Language Models☆344Nov 7, 2023Updated 2 years ago
- 自己的毕设作品,是一个前后端分离的项目,分为前台门户系统和后台管理系统。前端基于Vue.js、Element UI、Axios等实现页面的构建和请求的发送,后端基于SpringBoot、MyBatis、Redis、Nginx实现。系统实现了流浪动物救助帖子的发布、回复、评论…☆483Apr 20, 2024Updated 2 years ago
- TransMIL: Transformer based Correlated Multiple Instance Learning for Whole Slide Image Classification☆486May 3, 2024Updated 2 years ago
- ThreeDWorld simulation environment☆595Jun 3, 2024Updated 2 years ago
- 收集各种网站前端模板☆759Mar 6, 2025Updated last year
- An open-source implementation of Google's PaLM models☆820Jun 21, 2024Updated last year
- [CVPR2024] StableVITON: Learning Semantic Correspondence with Latent Diffusion Model for Virtual Try-On☆1,261Oct 12, 2025Updated 8 months ago
- 天若有情天亦老,人间正道是沧桑☆1,085Jan 31, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Implementation of the specific Transformer architecture from PaLM - Scaling Language Modeling with Pathways☆824Nov 9, 2022Updated 3 years ago
- A flexible and efficient codebase for training visually-conditioned language models (VLMs)☆991Jul 4, 2024Updated last year
- A system for building 3D Scene Graphs from sensor data in real-time☆1,040Jun 4, 2026Updated last week
- Code for the ICML 2021 (long talk) paper: "ViLT: Vision-and-Language Transformer Without Convolution or Region Supervision"☆1,534Apr 3, 2024Updated 2 years ago
- [Embodied-AI-Survey-2025] Paper List and Resource Repository for Embodied AI☆2,077Updated this week
- LPIPS metric. pip install lpips☆4,237Jul 2, 2024Updated last year
- [ECCV2024] IDM-VTON : Improving Diffusion Models for Authentic Virtual Try-on in the Wild☆5,048Mar 7, 2025Updated last year