Soulstyler: Using Large Language Model to Guide Image Style Transfer for Target Object
☆18Dec 1, 2024Updated last year
Alternatives and similar repositories for Finestyler
Users that are interested in Finestyler are comparing it to the libraries listed below
Sorting:
- We propose MMAD, a novel automated pipeline for precise AD generation. MMAD introduces ambient music alongside visual and linguistic, enh…☆16Dec 31, 2024Updated last year
- Code of AAAI2025 Paper 《VIoTGPT: Learning to Schedule Vision Tools in LLMs towards Intelligent Video Internet of Things》☆15Jan 16, 2025Updated last year
- Ultraman: Single Image 3D Human Reconstruction with Ultra Speed and Detail☆16Jul 5, 2024Updated last year
- Streaming Video Diffusion: Online Video Editing with Diffusion Models☆18Jun 3, 2024Updated last year
- This is the official repository for "LatentMan: Generating Consistent Animated Characters using Image Diffusion Models" [CVPRW 2024]☆22Jul 21, 2024Updated last year
- Official codes and datasets for ACM MM23 paper "3DStyle-Diffusion: Pursuing Fine-grained Text-driven 3D Stylization with 2D Diffusion Mod…☆26Sep 13, 2024Updated last year
- Official implementation of "EG4D: Explicit Generation of 4D Object without Score Distillation" (ICLR 2025)☆36Feb 14, 2025Updated last year
- A PyTorch implementation of LDAST☆26Dec 17, 2023Updated 2 years ago
- Code for "HumanGif: Single-View Human Diffusion with Generative Prior"☆31Jun 29, 2025Updated 8 months ago
- 稚晖君电子Esp32脱机版☆11Jan 15, 2025Updated last year
- 这是一次学校大作业,希望和大家分享,一起进步。此项目分驱动部分,遥控部分,视觉部分以及Web控制部分。是基于ESP32与Jetson Nano做的一个小项目。其中运用到了蓝牙串口片与片之间的通信,IP私域下的多机通信,以及ESP32中便携的Web功能进行通信。具体各部分内容…☆12Nov 5, 2024Updated last year
- 🕹 Pikachu-volleyball game-based multi-agent RL environment using PettingZoo☆11Sep 29, 2024Updated last year
- [ICML 2024] Official Repository for the paper "Transformers Get Stable: An End-to-End Signal Propagation Theory for Language Models"☆10Jul 19, 2024Updated last year
- Web interface for building and managing your own agentic record label.☆10Updated this week
- 智慧园区☆10Aug 3, 2017Updated 8 years ago
- ☆89Feb 15, 2025Updated last year
- Official code for VividPose: Advancing Stable Video Diffusion for Realistic Human Image Animation.☆87Jun 25, 2024Updated last year
- ☆39Oct 19, 2024Updated last year
- Official PyTorch implementation of The Linear Attention Resurrection in Vision Transformer☆16Sep 7, 2024Updated last year
- ChatPaperPlus☆11Mar 13, 2023Updated 2 years ago
- Official Code for "CanonicalFusion: Generating Drivable 3D Human Avatars from Multiple Images (ECCV 2024)"☆40Jul 25, 2025Updated 7 months ago
- An interactive demo based on Segment-Anything for style transfer which enables different content regions apply different styles.☆101Apr 24, 2023Updated 2 years ago
- TexPainter: Generative Mesh Texturing with Multi-view Consistency☆95Nov 13, 2024Updated last year
- [ICLR 2025] Aligning Generative Denoising with Discriminative Objectives Unleashes Diffusion for Visual Perception☆14Jul 4, 2025Updated 8 months ago
- 🤖 A list of latest AGI-related repos, resources and courses including LLMs and AI Agents.☆13Sep 24, 2024Updated last year
- Exposing Text-Image Inconsistency Using Diffusion Models (ICLR 2024)☆10Jun 15, 2024Updated last year
- ☆13Apr 19, 2024Updated last year
- 一个小智控制电脑的接口集合,可以自行扩展AI能力☆33Dec 16, 2025Updated 2 months ago
- The official repository of the paper "X as Supervision: Contending with Depth Ambiguity in Unsupervised Monocular 3D Pose Estimation"☆12Jan 22, 2025Updated last year
- Code for the paper "FinRLlama: A Solution to LLM-Engineered Signals Challenge at FinRL Contest 2024"☆13Feb 14, 2025Updated last year
- Official PyTorch implementation for "Where You Edit is What You Get: Text-Guided Image Editing with Region-Based Attention" (Pattern Reco…☆10Oct 1, 2024Updated last year
- Official Implementation for "TEXTure: Semantic Texture Transfer using Text Tokens"☆11Apr 19, 2023Updated 2 years ago
- A vanilla implementation of ReAct: Synergizing Reasoning and Acting in Language Models☆15Mar 26, 2025Updated 11 months ago
- This report presents a Neural Style Transfer project that focuses on performing style transfer on both images and videos. The process inv…☆12Jun 23, 2023Updated 2 years ago
- 一款开源的企业级智能体,专为运维而生☆52Updated this week
- Code for ACL 2023 main conference paper "Back Translation for Speech-to-text Translation Without Transcripts".☆12Oct 25, 2023Updated 2 years ago
- Seamlessly integrate IoT data with AI agents, enabling the effortless parsing, processing, and utilization of IoT data streams.☆10Jan 27, 2025Updated last year
- A production-ready FastAPI template for building AI agent applications with LangGraph integration. This template provides a robust founda…☆26Updated this week
- Jetbot Voice to Action Tools is a set of ROS2 nodes that utilize the Jetson Automatic Speech Recognition (ASR) deep learning interface li…☆13Feb 6, 2026Updated last month