828Tina / textvqa_grounding_task_qwen2.5-vl-ftView external linksLinks
☆83May 20, 2025Updated 8 months ago
Alternatives and similar repositories for textvqa_grounding_task_qwen2.5-vl-ft
Users that are interested in textvqa_grounding_task_qwen2.5-vl-ft are comparing it to the libraries listed below
Sorting:
- ☆13Jan 3, 2024Updated 2 years ago
- 通用数字人系统是一个基于深度学习和WebRTC技术的智能交互平台,集成了Azure Avatar数字人渲染、语音识别合成、自然语言处理等技术。系统支持实时对话、知识问答和情感交互,可实现30FPS以上的流畅渲染和200ms以内的低延迟响应。核心功能包括基于GPT的智能对话、…☆27Dec 17, 2025Updated 2 months ago
- This repositary contains an implemetation of the two stage networks CVNet and SuperGlobal, for Image Retrieval.☆24Feb 20, 2024Updated last year
- Official repository of "Interactive Text-to-Image Retrieval with Large Language Models: A Plug-and-Play Approach" (ACL 2024 Oral)☆34Mar 24, 2025Updated 10 months ago
- ☆18Jun 10, 2025Updated 8 months ago
- The official code of Towards Balanced Alignment: Modal-Enhanced Semantic Modeling for Video Moment Retrieval (AAAI2024)☆32Mar 29, 2024Updated last year
- Automatic defect recognition in X-ray testing using computer vision☆12Dec 8, 2018Updated 7 years ago
- Adaptive and Robust Multi-Task Learning☆10May 19, 2024Updated last year
- [NeurIPS 2025 Spotlight] Transformer Copilot: Learning from The Mistake Log in LLM Fine-tuning☆15Nov 14, 2025Updated 3 months ago
- RGBT Tracking via All-layer Multimodal Interactions with Mamba☆15May 7, 2025Updated 9 months ago
- YouTubeGPT is an LLM-based web-app that can be run locally and allows you to summarize and chat (Q&A) with YouTube videos.☆12Updated this week
- Find strongest response of convolutional layers on an image dataset. Automatically compute receptive field for any CNN layer.☆14Feb 19, 2021Updated 4 years ago
- ☆14Jul 1, 2025Updated 7 months ago
- 红外和可见光融合☆10Apr 17, 2019Updated 6 years ago
- The Codes and Data of A Comprehensive Benchmark for Multimodal Large Language Models in Industrial Anomaly Detection [ICLR'25]☆229Jan 14, 2026Updated last month
- ☆16Oct 9, 2024Updated last year
- ☆12Sep 23, 2025Updated 4 months ago
- Chest Xray Classifier using CNNs and Transfer Learning. The jupyter notebook of interest is titled 'Xrays_alt.ipynb'☆11May 18, 2018Updated 7 years ago
- [AAAI2026] CADTrack: Learning Contextual Aggregation with Deformable Alignment for Robust RGBT Tracking☆23Jan 31, 2026Updated 2 weeks ago
- mobileNet SSD 基于caffe的前向检测☆10Nov 30, 2018Updated 7 years ago
- ☆12Jun 1, 2024Updated last year
- [AAAI 2025 Oral] ODDN: Addressing Unpaired Data Challenges in Open-World Deepfake Detection on Online Social Networks https://arxiv.org/…☆10Jun 25, 2025Updated 7 months ago
- Repository for "Training Audio Captioning Models without Audio"☆10Sep 26, 2023Updated 2 years ago
- No reference blur metric based on Just Noticeable Blur (JNB)☆11Aug 30, 2019Updated 6 years ago
- ☆11Sep 25, 2024Updated last year
- ☆11Aug 31, 2025Updated 5 months ago
- Implementing Face Beautification: Beyond Makeup Transfer (https://www.catalyzex.com/paper/arxiv:1912.03630)☆13May 6, 2020Updated 5 years ago
- This is the official implementation of paper: Landmark Localization from Medical Images with Generative Distribution Prior☆12Mar 4, 2024Updated last year
- [ICCV25] MGSR: 2D/3D Mutual-boosted Gaussian Splatting for High-fidelity Surface Reconstruction under Various Light Conditions☆19Oct 14, 2025Updated 4 months ago
- Code for MME-SID accepted to CIKM 2025 Full Research track.☆27Oct 29, 2025Updated 3 months ago
- scikit-procrustes, a collection of solvers for the (Weighted) Orthogonal Procrustes Problem☆11Apr 3, 2019Updated 6 years ago
- ios app signature server,Using the Apple Undisclosed Web Interface Signing Application.☆10Apr 21, 2019Updated 6 years ago
- ppt转数字人后台☆17Apr 9, 2025Updated 10 months ago
- 🏆 SUCCESS-GS: Survey of Compactness and Compression for Efficient Static and Dynamic Gaussian Splatting☆18Feb 4, 2026Updated last week
- An unnecessarily tiny and minimal implementation of GPT-2 in NumPy.☆11Feb 12, 2023Updated 3 years ago
- Agentic Keyframe Search for Video Question Answering☆16Apr 7, 2025Updated 10 months ago
- A custom watcher plugin for Elasticsearch that feeds Apache Kafka☆11Mar 9, 2018Updated 7 years ago
- BiLSTM+CRF☆10Jan 15, 2019Updated 7 years ago
- ☆11Apr 23, 2022Updated 3 years ago