hzq-zjm / ppocrv5-tensorrtLinks
ppocrv5, 以TensorRT-v10版本作为推理引擎
☆32Updated 7 months ago
Alternatives and similar repositories for ppocrv5-tensorrt
Users that are interested in ppocrv5-tensorrt are comparing it to the libraries listed below
Sorting:
- [ICCV2025 Highlight] DicFace: Dirichlet-Constrained Variational Codebook Learning for Temporally Coherent Video Face Restoration☆447Updated 6 months ago
- 之前在做视频理解相关的工作,用qt写了一个视频动作标注工具, 简单易用。☆21Updated last year
- [AAAI 2026] Playmate2: Training-Free Multi-Character Audio-Driven Animation via Diffusion Transformer with Reward Feedback☆293Updated 2 months ago
- Efficient controlnet for DiTs☆382Updated 8 months ago
- [NeurIPS2025 spotlight★] Official implementation for "RepLDM: Reprogramming Pretrained Latent Diffusion Models for High-Quality, High-Eff…☆219Updated 3 weeks ago
- ☆386Updated 6 months ago
- GigaTrain: An Efficient and Scalable Training Framework for AI Models☆915Updated 2 months ago
- GigaDatasets: A Unified and Lightweight Framework for Data Processing, Curation, and Visualization☆534Updated last month
- ☆115Updated last month
- 🚀 A minimal and lightweight video streaming management platform 一个极简轻量的视频流媒体管理平台☆442Updated this week
- MTLA: Multi-head Temporal Latent Attention☆760Updated 3 months ago
- A powerful baseline for image classification, face recognition and image retrieval with Pytorch☆583Updated 2 months ago
- ☆1,157Updated 2 months ago
- ☆56Updated 6 months ago
- [AAAI 2026 Oral] Cook and Clean Together: Teaching Embodied Agents for Parallel Task Execution☆356Updated last month
- Dataset approched by A Benchmark and Frequency Compression Method for Infrared Few-Shot Object Detection☆1,004Updated 10 months ago
- GigaModels: A Comprehensive Repository and Platform for Multi-modal, Generative, and Perceptual Models☆860Updated last month
- Official implementation of UniCalli: A Unified Diffusion Framework for Column-Level Generation and Recognition of Chinese Calligraphy☆186Updated this week
- JetYOLO:Speed through your TensorRT/Deepstream app development.☆118Updated last year
- Res-SAM Framework for GPR Underground Hazard Detection☆1,610Updated 2 months ago
- We introduce the Audio Logical Reasoning (ALR) dataset, consisting of 6,446 text-audio annotated samples specifically designed for comple…☆1,104Updated 2 months ago
- A professional tool to safely and efficiently apply LLM-suggested code changes to your local codebase in a controllable, reviewable way.☆336Updated 2 weeks ago
- A real-time interactive Omni Avatar built on LiveKit, which allows you to seamlessly integrate with any open source Avatar components (re…☆558Updated last week
- 基于 Qwen2-0.5B 以及 SigLIP 实现的轻量化多模态风格化问答大模型☆29Updated 5 months ago
- 3D generation made easy!☆436Updated last month
- [NeurIPS 2025 (D&B)] Rethinking Evaluation of Infrared Small Target Detection☆352Updated 3 months ago
- A PyTorch implementation of diffusion models built from scratch☆45Updated 9 months ago
- [TPAMI 2025] Implementation of the paper “Heatmap Pooling for Action Recognition from RGB Videos”.☆62Updated last month
- [MM 2025] EventVAD: Training-Free Event-Aware Video Anomaly Detection☆518Updated 6 months ago
- Fat-Cat: A document-centric context management Agent. Making context as simple as reading chat history.☆432Updated 2 weeks ago