Hoantrbl / SeeTrekLinks
See&Trek: Training-Free Spatial Prompting for Multimodal Large Language Model
☆42Updated this week
Alternatives and similar repositories for SeeTrek
Users that are interested in SeeTrek are comparing it to the libraries listed below
Sorting:
- ☆67Updated 4 months ago
- ☆251Updated 10 months ago
- ☆206Updated 6 months ago
- DPO-Shift: Shifting the Distribution of Direct Preference Optimization☆60Updated 9 months ago
- Butter is a novel 2D object detection framework designed to enhance hierarchical feature representations for improved detection robustnes…☆85Updated 3 months ago
- [NeurIPS 2025] More Than Generation: Unifying Generation and Depth Estimation via Text-to-Image Diffusion Models☆214Updated last month
- This is the code for Visual Reasoning Sequential Attack, which is a method to jailbreak Multimodal Large Language Models Based on their v…☆48Updated last week
- [Software] Sketch-based tree modeling software, implemented in C++ and OpenGL.☆115Updated 4 months ago
- Mem4Nav: Boosting Vision-and-Language Navigation in Urban Environments with a Hierarchical Spatial-Cognition Long-Short Memory System☆101Updated 4 months ago
- The code for TPAMI paper "Text-Guided Human Image Manipulation via Image-Text Shared Space"☆86Updated 3 years ago
- ☆342Updated 5 months ago
- Text-to-3D Generation by 2D Editing☆112Updated 4 months ago
- [CVPR 2025 Highlight] Official Implementation of SURGEON: Memory-Adaptive Fully Test-Time Adaptation via Dynamic Activation Sparsity☆113Updated 6 months ago
- ☆36Updated last year
- ☆54Updated 6 months ago
- Code for paper "CycleDiff: Cycle Diffusion Models for Unpaired Image-to-image Translation"☆57Updated last month
- A pytorch implementation of the paper "TreeLoRA: Efficient Continual Learning via Layer-Wise LoRAs Guided by a Hierarchical Gradient-Simi…☆342Updated 2 months ago
- This is the source code for the ECCV paper "MTFormer: Multi-Task Learning via Transformer and Cross-Task Reasoning"☆200Updated 3 years ago
- 采集管家☆313Updated 6 months ago
- Official implemetation of "Enhancing Close-up Novel View Synthesis via Pseudo-labeling" [AAAI 2025]☆15Updated 8 months ago
- NanoHTNet: Nano Human Topology Network for Efficient 3D Human Pose Estimation☆218Updated last month
- Official Pytorch implementation for ICML 2025 paper "Large Continual Instruction Assistant"☆65Updated 4 months ago
- GigaTrain: An Efficient and Scalable Training Framework for AI Models☆252Updated last week
- Flexible RAG tools, Features semantic search, document indexing, and intelligent reranking with minimal intrusion design.☆89Updated 3 months ago
- ☆200Updated this week
- ☆143Updated last year
- ☆135Updated last year
- The Collapse of Patches☆32Updated last week
- mini-webui delivers a streamlined AI chat console for teams that need rapid iteration, reliable integrations, and production-ready guardr…☆44Updated 3 weeks ago
- The simplest tyron pipeline!最简单的aigc换装算法!☆54Updated 10 months ago