☆84May 20, 2025Updated 10 months ago
Alternatives and similar repositories for textvqa_grounding_task_qwen2.5-vl-ft
Users that are interested in textvqa_grounding_task_qwen2.5-vl-ft are comparing it to the libraries listed below
Sorting:
- Comprehensive benchmark for video text understanding☆28Jun 4, 2025Updated 9 months ago
- Domain Adaptation with Adversarial Training on Penultimate Activations (AAAI 2023)☆11Aug 1, 2023Updated 2 years ago
- [Neurips 24 Spotlight] Training in Pairs + Inference on Single Image with Anchors☆48Feb 20, 2025Updated last year
- This demo demonstrates the AI capabilities of the mcxn947. It displays the image captured by the camera on the LCD screen and performs fa…☆12Jul 21, 2025Updated 8 months ago
- This is official repository of Physics-AD☆20Feb 24, 2026Updated 3 weeks ago
- dancetrack 比赛第二名☆13Jan 29, 2023Updated 3 years ago
- Multiple-Person Multi-Camera Tracker☆13Feb 17, 2017Updated 9 years ago
- Towards Training-free Open-world Segmentation via Image Prompt Foundation Models,☆18Nov 22, 2024Updated last year
- ☆11Nov 8, 2022Updated 3 years ago
- Add YOLOv3_tiny and data augment(clip, brighten, change saturation)☆14Jan 14, 2021Updated 5 years ago
- This repository contains all the source code needed to reproduce the experiments or review the results obtained in the research paper "…☆13Dec 9, 2023Updated 2 years ago
- [AAAI' 26]SparseSurf: Sparse-View 3D Gaussian Splatting for Surface Reconstruction☆26Nov 19, 2025Updated 4 months ago
- [ICCV25] MGSR: 2D/3D Mutual-boosted Gaussian Splatting for High-fidelity Surface Reconstruction under Various Light Conditions☆21Feb 19, 2026Updated last month
- [ACCV 2024 (Oral, Best Application Paper)] Official Implementation of NT-VOT211: A Large-Scale Benchmark for Night-time Visual Object Tra…☆15Dec 30, 2025Updated 2 months ago
- ☆22Mar 6, 2026Updated 2 weeks ago
- Halcon using and programming all in one.☆21May 25, 2025Updated 9 months ago
- 非极大值抑制,包含了matlab,c,,c++,3种实现的代码,完美运行。并带c++,Matlab测试demo。所有程序都有详细的注释。GOOD LUCK!☆31Apr 5, 2018Updated 7 years ago
- 3D LUTs for Real Time sRGB White-Balance Correction☆13Dec 14, 2023Updated 2 years ago
- ☆19Jun 10, 2025Updated 9 months ago
- Multi-Organ Foundation Model for Universal Ultrasound Image Segmentation with Task Prompt and Anatomical Prior☆16Sep 30, 2024Updated last year
- EchoInk-R1: Exploring Audio-Visual Reasoning in Multimodal LLMs via Reinforcement Learning [🔥The Exploration of R1 for General Audio-Vi…☆76May 18, 2025Updated 10 months ago
- shopee integration for n8n☆15Sep 20, 2024Updated last year
- 基于LLaVA1.6微调的Xray识别的多模态大模型☆10Oct 22, 2024Updated last year
- This repo contains implementation of deep learning-based steel surface defect segmentation models. Extensive experiments on several deep …☆21May 19, 2025Updated 10 months ago
- 本项目从零开始构建并优化了一个千万参数级别的大规模预训练语言模型,涵盖预训练、有监督微调(SFT)和R1推理蒸馏三个阶段。项目采用自定义Transformer架构(包括RMSNorm、分组注意力、多Query机制、SwiGLU激活和RoPE位置编码),实现高效的长文本处理和…☆21Mar 10, 2025Updated last year
- Conditional EEG diffusion model☆16Apr 5, 2024Updated last year
- [ECCV2024] ModTr: Modality Translation for Object Detection Adaptation Without Forgetting Prior Knowledge☆19Nov 28, 2024Updated last year
- 高光谱图像计算机视觉分类图像预处理工具集,包含去除图片无关背景,数据增强,生成标签文件等功能☆18Nov 4, 2023Updated 2 years ago
- This is a implementation of integrating a simple but efficient attention block in CNN + bidirectional LSTM for video classification.☆26Aug 2, 2024Updated last year
- [CVPR 2026] VGGDrive: Empowering Vision-Language Models with Cross-View Geometric Grounding for Autonomous Driving☆71Mar 10, 2026Updated last week
- unofficial implementation of https://arxiv.org/pdf/2301.08871v1.pdf on pytorch☆15Apr 20, 2023Updated 2 years ago
- This is a simple toolkit to view and crop image patches for image/video super-resolution tasks.☆11Jan 6, 2023Updated 3 years ago
- The Codes and Data of A Comprehensive Benchmark for Multimodal Large Language Models in Industrial Anomaly Detection [ICLR'25]☆238Jan 14, 2026Updated 2 months ago
- (NeurIPS 2024) BiDM: Pushing the Limit of Quantization for Diffusion Models☆22Nov 20, 2024Updated last year
- Co-DETR (Detection Transformer) compiled from PyTorch to NVIDIA TensorRT☆20Apr 19, 2025Updated 11 months ago
- Implementation of popular deep learning networks with TensorRT network definition APIs☆10Mar 25, 2021Updated 4 years ago
- Official implementation of Instance-wise and Center-of-Instance (ICI) segmentation loss☆12Oct 6, 2023Updated 2 years ago
- 最基本最小白的自然语言处理入门读物,基于deepseek-r1,涵盖了传统NLP和现代大模型☆23Jan 16, 2026Updated 2 months ago
- Official implement of ICLR 2025 "One-for-All Few-Shot Anomaly Detection via Instance-Induced Prompt Learning"☆38May 8, 2025Updated 10 months ago