Comprehensive Scene Text Recognition Toolkit across 11 Indian Languages
☆44Dec 6, 2025Updated 3 months ago
Alternatives and similar repositories for IndicPhotoOCR
Users that are interested in IndicPhotoOCR are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official Implementation of Few-shot Visual Relationship Co-localization☆25Aug 25, 2021Updated 4 years ago
- Flow Chart Image-to-Code Generation☆36Aug 13, 2023Updated 2 years ago
- Implementation of Baseline for Scene Text-to-Scene Text Translation☆19Mar 30, 2025Updated 11 months ago
- ☆25Feb 26, 2026Updated last month
- Implementation of Deep Learning Neural Network (RUnet) for Super-Resolution☆23Feb 9, 2020Updated 6 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- [ESWA 2025] Official pytorch implementation of "What and When to look?: Temporal Span Proposal Network for Video Relation Detection"☆16Aug 9, 2021Updated 4 years ago
- ☆45Dec 15, 2022Updated 3 years ago
- Text to Speech for Indic languages☆52Mar 23, 2022Updated 4 years ago
- (CVPR 2023) Official implemention of the paper "Weakly Supervised Video Representation Learning with Unaligned Text for Sequential Videos…☆31Apr 2, 2024Updated last year
- FloVD official pytorch codes☆47May 13, 2025Updated 10 months ago
- Vecna is a Python chatbot which recommends songs and movies depending upon your feelings☆12Jun 28, 2022Updated 3 years ago
- Extract information from XBRL files in the ESEF format☆13Jan 3, 2026Updated 2 months ago
- This repository contain my 75Day Hard Generative AI and LLM Learning Challenge.☆49May 18, 2024Updated last year
- A simple GPT-3 interface to automate core legal writing tasks☆13Mar 8, 2023Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Video Visual Relation Detection (VidVRD) tracklets generation. also for ACM MM Visual Relation Understanding Grand Challenge☆40Dec 5, 2022Updated 3 years ago
- Code for paper, "TL;DW? Summarizing Instructional Videos with Task Relevance & Cross-Modal Saliency" ECCV 2022☆39Feb 17, 2023Updated 3 years ago
- Colab notebook to finetune GLIDE.☆12Mar 22, 2022Updated 4 years ago
- We archive data because we are interested in the diffs. All data is from https://video-api.cartoonnetwork.com. We run the check every min…☆10Updated this week
- A pathway and collection of resources to learning Jax from beginning to advance.☆11Jan 2, 2021Updated 5 years ago
- ☆11Sep 7, 2020Updated 5 years ago
- Non-local Modeling for Image Quality Assessment☆13Dec 20, 2023Updated 2 years ago
- MaXM is a suite of test-only benchmarks for multilingual visual question answering in 7 languages: English (en), French (fr), Hindi (hi),…☆13Jan 16, 2024Updated 2 years ago
- Code for the paper "Code-Mixing on Sesame Street: Dawn of the Adversarial Polyglots" (NAACL-HLT 2021)☆10May 1, 2025Updated 10 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- A Datasette plugin providing an MLOps platform to train, eval and predict machine learning models☆17Mar 12, 2026Updated 2 weeks ago
- mPLM-Sim: Better Cross-Lingual Similarity and Transfer in Multilingual Pretrained Language Models☆11Jan 19, 2024Updated 2 years ago
- Belief Revision based Caption Re-ranker with Visual Semantic Information. COLING 2022☆11Apr 13, 2025Updated 11 months ago
- ☆14Aug 30, 2022Updated 3 years ago
- Scraping LegiFrance naturalisation decrees for fun and OSINT profit☆12May 27, 2023Updated 2 years ago
- Automated Question-Answering Over Knowledge Graphs in O&M of Wind Turbines☆12Aug 16, 2022Updated 3 years ago
- LLM Oracle is a GPT-4 powered tool for predicting future events. It's like a Magic 8 Ball that is able to perform basic research, calcula…☆19May 27, 2023Updated 2 years ago
- Code and pruned models for our paper: K. Gkrispanis, N. Gkalelis, V. Mezaris, "Filter-Pruning of Lightweight Face Detectors Using a Geome…☆14May 8, 2024Updated last year
- AutoTag-YOLOv8 is an object detection project that uses the YOLOv8 model and leverages the power of SAM and DINGO models for automatic la…☆13May 3, 2023Updated 2 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- A reddit scraping and analysis bot to visualize linguistic and content trends☆12Oct 5, 2021Updated 4 years ago
- The GitHub open source software repository on interpreting super-resolution CNNs for sub-pixel motion compensation in video coding☆11May 20, 2022Updated 3 years ago
- ☆11May 9, 2022Updated 3 years ago
- A metric learning method to learn a provably robust Mahalanobis distance☆10Jan 29, 2022Updated 4 years ago
- WebRTC-based real-time audio streaming with Faster Whisper ASR integration for live speech-to-text transcription.☆13Sep 27, 2024Updated last year
- Code for ACL 2024 findings paper "wav2vec-S: Adapting Pre-trained Speech Models for Streaming"☆11Apr 20, 2025Updated 11 months ago
- Word-level language identification for Bangla-English code-mixed social media data, using a BiLSTM with subword embeddings.☆10Aug 13, 2023Updated 2 years ago