☆93May 20, 2025Updated last year
Alternatives and similar repositories for textvqa_grounding_task_qwen2.5-vl-ft
Users that are interested in textvqa_grounding_task_qwen2.5-vl-ft are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆13Jan 3, 2024Updated 2 years ago
- Comprehensive benchmark for video text understanding☆29Jun 4, 2025Updated last year
- ☆37Jun 9, 2025Updated last year
- Domain Adaptation with Adversarial Training on Penultimate Activations (AAAI 2023)☆11Aug 1, 2023Updated 2 years ago
- This demo demonstrates the AI capabilities of the mcxn947. It displays the image captured by the camera on the LCD screen and performs fa…☆12May 18, 2026Updated last month
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- dancetrack 比赛第二名☆13Jan 29, 2023Updated 3 years ago
- ☆28Oct 31, 2024Updated last year
- Add YOLOv3_tiny and data augment(clip, brighten, change saturation)☆14Jan 14, 2021Updated 5 years ago
- This is official repository of Physics-AD☆21Feb 24, 2026Updated 3 months ago
- [ACCV 2024 (Oral, Best Application Paper)] Official Implementation of NT-VOT211: A Large-Scale Benchmark for Night-time Visual Object Tra…☆16Dec 30, 2025Updated 5 months ago
- ☆27Mar 6, 2026Updated 3 months ago
- [ICCV25] MGSR: 2D/3D Mutual-boosted Gaussian Splatting for High-fidelity Surface Reconstruction under Various Light Conditions☆21Feb 19, 2026Updated 4 months ago
- 3D LUTs for Real Time sRGB White-Balance Correction☆13Dec 14, 2023Updated 2 years ago
- ☆17May 18, 2024Updated 2 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- ☆22Apr 22, 2025Updated last year
- 在千问最新的多模态image-text模型Qwen3-VL-4B-Instruct 进行多种lora微调对比效果,通过langchain+RAG+多智能体(Multi-Agent)进行部署☆50Dec 14, 2025Updated 6 months ago
- [AAAI' 26]SparseSurf: Sparse-View 3D Gaussian Splatting for Surface Reconstruction☆32May 11, 2026Updated last month
- 基于LLaVA1.6微调的Xray识别的多模态大模型☆10Oct 22, 2024Updated last year
- 本项目从零开始构建并优化了一个千万参数级别的大规模预训练语言模型,涵盖预训练、有监督微调(SFT)和R1推理蒸馏三个阶段。项目采用自定义Transformer架构(包括RMSNorm、分组注意力、多Query机制、SwiGLU激活和RoPE位置编码),实现高效的长文本处理和…☆22Mar 10, 2025Updated last year
- (CVPR 2025 Highlight) Official repository of paper "AODRaw: Towards RAW Object Detection in Diverse Conditions" (https://arxiv.org/pdf/24…☆24Apr 6, 2025Updated last year
- Some papers about instance segmentation☆20Aug 9, 2022Updated 3 years ago
- [ECCV2024] ModTr: Modality Translation for Object Detection Adaptation Without Forgetting Prior Knowledge☆19Nov 28, 2024Updated last year
- This is a simple toolkit to view and crop image patches for image/video super-resolution tasks.☆11Jan 6, 2023Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Co-DETR (Detection Transformer) compiled from PyTorch to NVIDIA TensorRT☆20Apr 19, 2025Updated last year
- Official implementation of Instance-wise and Center-of-Instance (ICI) segmentation loss☆12Oct 6, 2023Updated 2 years ago
- The Codes and Data of A Comprehensive Benchmark for Multimodal Large Language Models in Industrial Anomaly Detection [ICLR'25]☆258Jan 14, 2026Updated 5 months ago
- Official implement of ICLR 2025 "One-for-All Few-Shot Anomaly Detection via Instance-Induced Prompt Learning"☆40May 8, 2025Updated last year
- ☆22Jun 19, 2024Updated 2 years ago
- ☆14Sep 2, 2025Updated 9 months ago
- A Framework for Symbolic MUsic Graph Explanations☆11Jul 30, 2025Updated 10 months ago
- Project for SNARE benchmark☆11Jun 5, 2024Updated 2 years ago
- 通用数字人系统是一个基于深度学习和WebRTC技术的智能交互平台,集成了Azure Avatar数字人渲染、语音识别合成、自然语言处理等技术。系统支持实时对话、知识问答和情感交互,可实现30FPS以上的流畅渲染和200ms以内的低延迟响应。核心功能包括基于GPT的智能对话、…☆33Dec 17, 2025Updated 6 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆12Jun 1, 2024Updated 2 years ago
- ☆15Aug 3, 2019Updated 6 years ago
- (ICLR 2025) BinaryDM: Accurate Weight Binarization for Efficient Diffusion Models☆25Oct 4, 2024Updated last year
- Agentic Keyframe Search for Video Question Answering☆18Apr 7, 2025Updated last year
- ☆11Nov 3, 2021Updated 4 years ago
- The official code of Towards Balanced Alignment: Modal-Enhanced Semantic Modeling for Video Moment Retrieval (AAAI2024)☆32Mar 29, 2024Updated 2 years ago
- 最基本最小白的自然语言处理入门读物,基于deepseek-r1,涵盖了传统NLP和现代大模型☆29Jan 16, 2026Updated 5 months ago