AnyTrans: Translate AnyText in the Image with Large Scale Models (EMNLP2024 Findings)
☆24Dec 11, 2024Updated last year
Alternatives and similar repositories for AnyTrans
Users that are interested in AnyTrans are comparing it to the libraries listed below
Sorting:
- This is a PyTorch implementation of MCLN proposed by our paper "Multi-branch Collaborative Learning Network for 3D Visual Grounding"(ECCV…☆26Oct 10, 2024Updated last year
- ☆16Dec 25, 2025Updated 2 months ago
- ☆10Mar 31, 2025Updated 11 months ago
- Visualize attention maps in Diffusion Models☆22Mar 10, 2025Updated 11 months ago
- tutorials☆22Aug 12, 2022Updated 3 years ago
- [NeurIPS'24] I2EBench: A Comprehensive Benchmark for Instruction-based Image Editing☆33Dec 9, 2025Updated 2 months ago
- [ICML2024]The official implementation of SemiRES in PyTorch.☆33Jun 20, 2024Updated last year
- [ACM MM 2024] Improving Composed Image Retrieval via Contrastive Learning with Scaling Positives and Negatives☆39Sep 9, 2025Updated 5 months ago
- [CVPR 2024] LoSh: Long-Short Text Joint Prediction Network for Referring Video Object Segmentation☆13Jun 17, 2024Updated last year
- [MM2024 Oral] 3D-GRES: Generalized 3D Referring Expression Segmentation☆42Dec 15, 2024Updated last year
- GAIIC2024无人机视角下的双光目标检测 - Rank6 解决方案☆11Jun 17, 2024Updated last year
- ☆14Aug 28, 2024Updated last year
- [ECCV 2024] The first zero-shot setting for spatio-temporal video grounding.☆11Jul 16, 2024Updated last year
- Repository of PIXAR, a Pixel-based Auto-Regressive Language Model☆17Sep 15, 2025Updated 5 months ago
- Code release for "Category-Specific Prompts for Animal Action Recognition with Pretrained Vision-Language Models"☆14Feb 21, 2024Updated 2 years ago
- [CVPR 2023] "TrojViT: Trojan Insertion in Vision Transformers" by Mengxin Zheng, Qian Lou, Lei Jiang☆14Jan 5, 2024Updated 2 years ago
- ☆19Jul 8, 2025Updated 7 months ago
- 【2024 ECAI】First Creating Backgrounds Then Rendering Texts: A New Paradigm for Visual Text Blending☆14Jun 16, 2025Updated 8 months ago
- [ICCV 2021] Multimodal Knowledge Expansion☆10Aug 28, 2021Updated 4 years ago
- 收集量子机器学习的基础、算法、学习、项目等资料的收集。Here you can get all the Quantum Machine learning Basics, Algorithms ,Study Materials ,Projects and the descri…☆11Jan 4, 2018Updated 8 years ago
- [CVPR 2025] Official implementation of SSP: High Temporal Consistency through Semantic Similarity Propagation in Semi-Supervised Video Se…☆15Jun 26, 2025Updated 8 months ago
- We introduce Chart2Code, the first user-driven, hierarchical benchmark that systematically evaluates Large Multimodal Models on chart-to-…☆24Jan 27, 2026Updated last month
- This project is a demonstration of a content-based recommendation system for Spotify that leverages user's preferences and audio features…☆17Apr 4, 2023Updated 2 years ago
- [Neurips 2024] Video Diffusion Models are Training-free Motion Interpreter and Controller☆50Aug 5, 2025Updated 6 months ago
- ☆25Nov 22, 2024Updated last year
- RESAnything: Attribute Prompting for Arbitrary Referring Segmentation☆17Nov 28, 2025Updated 3 months ago
- ☆11Oct 16, 2023Updated 2 years ago
- The official code and data for paper "VidEgoThink: Assessing Egocentric Video Understanding Capabilities for Embodied AI"☆16Mar 25, 2025Updated 11 months ago
- Complete Head and Foot visualization from DICOM files using PyQT5☆11Jun 24, 2021Updated 4 years ago
- ☆12Oct 17, 2024Updated last year
- The official implement of "Routing Experts: Learning to Route Dynamic Experts in Existing Multi-modal Large Language Models"☆17Mar 24, 2025Updated 11 months ago
- ☆12Jul 21, 2022Updated 3 years ago
- Calibrating LLM Confidence by Probing Perturbed Representation Stability☆17Jul 5, 2025Updated 7 months ago
- The official version of the paper "MMI-Det: Exploring Multi-Modal Integration for Visible and Infrared Object Detection"☆19Oct 24, 2024Updated last year
- Learning Motion and Temporal Cues for Unsupervised Video Object Segmentation[TNNLS2024]☆13May 6, 2025Updated 9 months ago
- Motion-conditional image animation for video editing☆20Dec 2, 2023Updated 2 years ago
- ☆11Jun 12, 2024Updated last year
- ☆12Jul 12, 2024Updated last year
- ☆11Jun 3, 2023Updated 2 years ago