☆20Dec 18, 2025Updated 2 months ago
Alternatives and similar repositories for HisDoc1B
Users that are interested in HisDoc1B are comparing it to the libraries listed below
Sorting:
- Reproducing the Past: A Dataset for Benchmarking Inscription Restoration (ACM MM'24)☆13Oct 15, 2025Updated 4 months ago
- [EMNLP 2024] TongGu, a classical Chinese language model.☆61Sep 28, 2024Updated last year
- [ACL 2025 main] The official GitHub page of "Reviving Cultural Heritage: A Novel Approach for Comprehensive Historical Document Restorati…☆54Dec 22, 2025Updated 2 months ago
- [AAAI2025 Oral] Predicting the Original Appearance of Damaged Historical Documents☆105Jul 15, 2025Updated 7 months ago
- Dataset for TALLIP2019 paper "Ancient-Modern Chinese Translation with a New Large Training Dataset"☆25Jul 8, 2022Updated 3 years ago
- ☆70Jul 6, 2020Updated 5 years ago
- [IJCAI-2024] The official code of Self-Supervised Pre-training with Symmetric Superimposition Modeling for Scene Text Recognition☆10Aug 10, 2025Updated 6 months ago
- ☆13Oct 25, 2024Updated last year
- ☆17Jul 24, 2025Updated 7 months ago
- ☆10Dec 18, 2024Updated last year
- ☆11Nov 5, 2025Updated 3 months ago
- Visual Speech Recognition For Low-Resource Languages with Automatic Labels (ICASSP 2024)☆16Mar 17, 2025Updated 11 months ago
- Ruby script to download bulk results from Archive.org's TV News database of closed captions☆14Mar 20, 2013Updated 12 years ago
- python ai for 书法识别 Calligraphy recognition☆11Apr 15, 2019Updated 6 years ago
- My implementation (PyTorch) for the paper SST: Single-Stream Temporal Action Proposals (http://vision.stanford.edu/pdf/buch2017cvpr.pdf).☆10Dec 8, 2022Updated 3 years ago
- ☆12Mar 24, 2024Updated last year
- An implementation of the Contrast Predictive Coding (CPC) method to train audio features in an unsupervised fashion.☆10Feb 22, 2022Updated 4 years ago
- ☆13Feb 25, 2025Updated last year
- Video action classification benchmark for common CNN architectures, implemented in PyTorch☆11Jan 31, 2022Updated 4 years ago
- Code for the paper: How Much Context Does My Attention-Based ASR System Need?☆10Feb 18, 2026Updated last week
- ☆14Jun 3, 2024Updated last year
- 基于深度学习的汉字字体补全系统(VQ-VAE+扩散模型)☆22Aug 10, 2025Updated 6 months ago
- Pretrain, finetune and deploy AI models on multiple GPUs, TPUs with zero code changes.☆13Feb 20, 2024Updated 2 years ago
- MutRex - A generator of fault detecting strings for regular expressions☆12Mar 18, 2024Updated last year
- The official implement of CTRNet++.☆14Dec 30, 2024Updated last year
- Peng et al. "RED-Net: A Recurrent Encoder–Decoder Network for Video-Based Face Alignment". IJCV, 2018.☆12Jul 19, 2018Updated 7 years ago
- The implementation of 'Watch, Listen, Attend and Spell’ (WLAS) network that learns to transcribe videos of mouth motion to character on p…☆11Mar 23, 2018Updated 7 years ago
- (SLT 2024) Learning Video Temporal Dynamics with Cross-Modal Attention for Robust Audio-Visual Speech Recognition☆13Oct 22, 2024Updated last year
- ☆13May 9, 2022Updated 3 years ago
- Official implementation of UPOCR: Towards unified pixel-level OCR interface (ICML 2024)☆67Jun 6, 2024Updated last year
- Support code for LAEO-Net paper☆13Mar 24, 2021Updated 4 years ago
- ☆12Dec 2, 2017Updated 8 years ago
- ☆11May 31, 2019Updated 6 years ago
- [🏆 IJCV 2025 & ACCV 2024 Best Paper Honorable Mention] Official pytorch implementation of the paper "High-Quality Visually-Guided Sound …☆28Nov 1, 2025Updated 4 months ago
- This is a primitive Graphic editor written in Qt 5.10☆11Jul 6, 2021Updated 4 years ago
- Mason-Alberta Phonetic Segmenter☆15Feb 24, 2026Updated last week
- WildVSR☆21Dec 13, 2023Updated 2 years ago
- Complete Web Scraping of TED.com for Metadata, Transcript, Audio, Video, Images using Parallel Programming☆11Jun 25, 2020Updated 5 years ago
- ☆16Dec 10, 2023Updated 2 years ago