☆13Jun 22, 2022Updated 3 years ago
Alternatives and similar repositories for tianchi-ocr
Users that are interested in tianchi-ocr are comparing it to the libraries listed below
Sorting:
- JoVA: Unified Multimodal Learning for Joint Video-Audio Generation☆30Dec 22, 2025Updated 2 months ago
- Reinforcing Action Policies by Prophesying☆40Nov 26, 2025Updated 3 months ago
- Official code release for paper "Robo-Imagine: A Robotic Video Generation Model, For Autoregressive Long-Term Task Video Generation With …☆28Jul 13, 2025Updated 7 months ago
- ☆18Apr 10, 2025Updated 10 months ago
- ☆17Dec 12, 2023Updated 2 years ago
- YOLO Series☆14Oct 20, 2023Updated 2 years ago
- ☆18Oct 22, 2024Updated last year
- OpenAI GPT-4 assistant, combined with the power of YoloV8 realtime object detection, Whisper speech recognition, text to speech and googl…☆17Jan 18, 2024Updated 2 years ago
- LAR-YOLOv8☆22Feb 28, 2024Updated 2 years ago
- 基于yolov7 加入 depth回归☆19Nov 4, 2022Updated 3 years ago
- [ICCV 2025] RAGNet: Large-scale Reasoning-based Affordance Segmentation Benchmark towards General Grasping☆37Nov 21, 2025Updated 3 months ago
- [EMNLP 2025 Findings] 3D-Aware Vision-Language Models Fine-Tuning with Geometric Distillation☆31Jun 12, 2025Updated 8 months ago
- yolov5实现基于kld的旋转目标检测☆27Nov 25, 2022Updated 3 years ago
- ☆22Jun 20, 2023Updated 2 years ago
- ☆64Nov 2, 2025Updated 4 months ago
- [ECCV2022] Homogeneous Multi-modal Feature Fusion and Interaction for 3D Object Detection☆25Jul 22, 2022Updated 3 years ago
- YOLOv5 🚀 in PyTorch > ONNX > CoreML > TFLite☆26Mar 5, 2022Updated 4 years ago
- [NeurIPS'22] Projector Ensemble Feature Distillation☆30Jan 4, 2024Updated 2 years ago
- [ICCV 2025] Implementation of the paper "Q-Frame: Query-aware Frame Selection and Multi-Resolution Adaptation for Video-LLMs"☆71Oct 25, 2025Updated 4 months ago
- Object Detection and YOLOV7-AC☆29Dec 3, 2023Updated 2 years ago
- ☆79Nov 3, 2025Updated 4 months ago
- Official code of the paper "VideoMolmo: Spatio-Temporal Grounding meets Pointing"☆53Jul 5, 2025Updated 8 months ago
- Official code repo for our work "Native Visual Understanding: Resolving Resolution Dilemmas in Vision-Language Models"☆53Jun 17, 2025Updated 8 months ago
- ☆30Mar 4, 2022Updated 4 years ago
- provide some new architecture, channel pruning and quantization methods for yolov5☆31Oct 13, 2025Updated 4 months ago
- 知识蒸馏复现相关☆27Aug 3, 2022Updated 3 years ago
- 基于streamlit的YOLOv8可视化交互界面☆35Sep 14, 2023Updated 2 years ago
- An unofficial code for paper "Learning Frequency-aware Dynamic Network for Efficient Super-Resolution"☆34May 20, 2021Updated 4 years ago
- Unofficial implement of "Pix2seq: A Language Modeling Framework for Object Detection" on mmdetection☆33Apr 18, 2022Updated 3 years ago
- [NeurIPS 2025]Genesis: Multimodal Driving Scene Generation with Spatio-Temporal and Cross-Modal Consistency☆76Sep 19, 2025Updated 5 months ago
- UniLat3D: Geometry-Appearance Unified Latents for Single-Stage 3D Generation☆135Nov 19, 2025Updated 3 months ago
- YOLOv8 image segmentation through ONNX in Python☆36Aug 8, 2023Updated 2 years ago
- Official Code of CVPR 2025 paper "SOLAMI: Social Vision-Language-Action Modeling for Immersive Interaction with 3D Autonomous Characters"☆52Jul 13, 2025Updated 7 months ago
- YOLOv7+KLD☆39Oct 10, 2023Updated 2 years ago
- 🚀 Simple and efficient use for Ultralytics yolov5🚀☆32Jan 17, 2023Updated 3 years ago
- ☆46Feb 23, 2023Updated 3 years ago
- [arXiv '24] Efficient Cell Nuclei Instance Segmentation with Large Convolution Kernels☆47Aug 28, 2024Updated last year
- 基于yoloV5-V6系列,train_palte添加多头检测。train_key添加关键点检测算法。☆45Nov 9, 2022Updated 3 years ago
- GHOST is accepted by TGRS☆50Jul 24, 2024Updated last year