usydnlp/vdoc

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/usydnlp/vdoc)

usydnlp / vdoc

☆15

Alternatives and similar repositories for vdoc

Users that are interested in vdoc are comparing it to the libraries listed below

Sorting:

ChenyuGAO-CS / SMA
View on GitHub
The imdb files with SBD-Trans OCR for TextVQA dataset.
☆11Nov 30, 2021Updated 4 years ago
WenjinW / LATIN-Prompt
View on GitHub
☆51May 28, 2024Updated last year
uakarsh / latr
View on GitHub
Implementation of LaTr: Layout-aware transformer for scene-text VQA,a novel multimodal architecture for Scene Text Visual Question Answer…
☆55Oct 30, 2024Updated last year
furkanbiten / idl_data
View on GitHub
OCR Annotations from Amazon Textract for Industry Documents Library
☆103Aug 20, 2022Updated 3 years ago
onealwj / MVLT
View on GitHub
PyTorch implementation of BMVC2022 paper Masked Vision-Language Transformers for Scene Text Recognition
☆29Nov 11, 2022Updated 3 years ago
ysig / learnable-typewriter
View on GitHub
The Learnable Typewriter: A Generative Approach to Text Line Analysis
☆34Oct 31, 2024Updated last year
PhoebusSi / SAR
View on GitHub
Code for our ACL2021 paper: "Check It Again: Progressive Visual Question Answering via Visual Entailment"
☆31Nov 24, 2021Updated 4 years ago
sachinraja13 / TabStructNet
View on GitHub
☆132Mar 24, 2023Updated 2 years ago
facebookresearch / dmae_st
View on GitHub
Directed masked autoencoders
☆14Feb 20, 2026Updated last week
RanaMostafaAbdElMohsen / LMOT
View on GitHub
[IEEE Access - 2022] LMOT : Efficient Light-Weight Detection and Tracking in Crowds
☆41Dec 18, 2024Updated last year
U-Sharma / NeuralScaleID
View on GitHub
☆12Oct 5, 2020Updated 5 years ago
twelvelabs-io / pegasus-1-eval
View on GitHub
Repository for evaluating Pegasus-1 and video-language foundation models
☆14Nov 12, 2024Updated last year
KleinYuan / tf-segmentation
View on GitHub
Real-time semantic segmentation inference production ready code based on deeplab-resnet/psp-net and tensorflow
☆11May 18, 2018Updated 7 years ago
xadrianzetx / mobileunet-tensorflow
View on GitHub
Lane segmentation model trained with tensorflow implementation MobileNetV2 based U-Net
☆11Mar 24, 2023Updated 2 years ago
husterpzh / PSSR
View on GitHub
Official code for the paper: "Perception and Semantic Aware Regularization for Sequential Confidence Calibration （CVPR2023）"
☆10May 15, 2024Updated last year
SJTU-Plus / ManagerBot
View on GitHub
QQ 群验证机器人
☆10Nov 9, 2021Updated 4 years ago
bfolder / UIDevice-HardwareModel
View on GitHub
Determines hardware type of current iOS device.
☆35Oct 12, 2015Updated 10 years ago
dali92002 / OCR-TR
View on GitHub
Optocal Character Recognition (OCR / HTR) using Transformers
☆11Aug 20, 2022Updated 3 years ago
xiaojino / RUArt
View on GitHub
RUArt: A Novel Text-Centered Solution for Text-Based Visual Question Answering
☆10Nov 27, 2022Updated 3 years ago
JungUnYun / License-Plate-Recognition
View on GitHub
License Plate Recognition based on semantic segmentation approach using U-Net
☆13Dec 5, 2019Updated 6 years ago
kurtjacobsdev / torch2ios
View on GitHub
Torch7 Library - Convert NN Models To iOS Format
☆11Aug 8, 2016Updated 9 years ago
vis-nlp / OpenCQA
View on GitHub
☆12Jun 20, 2023Updated 2 years ago
kaka-lin / yolov3-tf2
View on GitHub
Implemented YOLOv3 with Tensorflow 2.0
☆14Jan 12, 2023Updated 3 years ago
usydnlp / CONDA
View on GitHub
This repository is for CONDA dataset
☆10Nov 28, 2022Updated 3 years ago
ZechuanLi / GO-N3RDet
View on GitHub
[CVPR 2025] GO-N3RDet: Geometry Optimized NeRF-enhanced 3D Object Detector
☆16Mar 19, 2025Updated 11 months ago
KimiaLabMayo / kimia_path24
View on GitHub
Colored Kimia Path24 Dataset: Configurations and Benchmarks with Deep Embeddings
☆10Jun 6, 2024Updated last year
dghost / GLVideoFilter
View on GitHub
An open-source prototype to test the feasibility of real-time video filtering as an accessibility tool.
☆19Jan 2, 2021Updated 5 years ago
INK-USC / Reflect
View on GitHub
Data and Code for Paper "Reflect Not Reflex: Inference-Based Common Ground Improves Dialogue Response Quality" (EMNLP 2022)
☆11Nov 28, 2022Updated 3 years ago
fabianandresgrob / ChartOCR
View on GitHub
ChartOCR, based on original repo.
☆13Mar 22, 2023Updated 2 years ago
zxytim / pdf2images
View on GitHub
Convert pdf to pages of images
☆13Apr 18, 2020Updated 5 years ago
leon2000-ai / TSGaussian
View on GitHub
TSGaussian: Semantic and Depth-Guided Target-Specific Gaussian Splatting from Sparse Views
☆18Jan 14, 2026Updated last month
huangxuwei / jni-maven-example
View on GitHub
A JNI Example with Maven
☆10Jan 22, 2018Updated 8 years ago
srebuffi / semisup_scarse
View on GitHub
PyTorch implementation of Semi-Supervised Learning with Scarce Annotations https://arxiv.org/pdf/1905.08845.pdf
☆13Jan 6, 2020Updated 6 years ago
n00neimp0rtant / ControlFreak
View on GitHub
hardware gamepad support in any app
☆15May 21, 2012Updated 13 years ago
OpenMask3D / openmask3d.github.io
View on GitHub
☆11May 8, 2024Updated last year
deciduus / deciduus-agent-zero
View on GitHub
Agent Zero AI framework
☆13Dec 15, 2025Updated 2 months ago
aalto-intelligent-robotics / REACT
View on GitHub
Code for REACT: Real-time Efficient Attribute Clustering and Transfer for Updatable 3D Scene Graph
☆16Feb 12, 2026Updated 2 weeks ago
NinaWie / COVID-BLUES
View on GitHub
Dataset of lung ultrasound videos for research on AI-based medical image analysis
☆17Nov 9, 2025Updated 3 months ago
TheNobody-12 / MOT_WITH_YOLOV9_STRONG_SORT
View on GitHub
☆11Jun 18, 2024Updated last year