Bhashini-IITJ / IndicPhotoOCRLinks
Comprehensive Scene Text Recognition Toolkit across 11 Indian Languages
☆40Updated last month
Alternatives and similar repositories for IndicPhotoOCR
Users that are interested in IndicPhotoOCR are comparing it to the libraries listed below
Sorting:
- Indic-Conformer models for ASR☆20Updated last year
- A open-source framework designed to adapt pre-trained Language Models (LLMs), such as Llama, Mistral, and Mixtral, to a wide array of dom…☆23Updated last year
- Implementation of Baseline for Scene Text-to-Scene Text Translation☆18Updated 9 months ago
- Vistaar: Diverse Benchmarks and Training Sets for Indian Language ASR☆70Updated 7 months ago
- Simple, Unified Repository for Retrieval-based Voice Conversion☆17Updated last year
- A pipeline for transliteration, spell correction, POS tagging and word sense disambiguation of Hinglish code mixed data to Hindi Devanaga…☆37Updated last year
- A blueprint for creating Pretraining and Fine-Tuning datasets for Indic languages☆282Updated last year
- Shoonya - Platform to Annotate and label data at scale.☆60Updated 2 months ago
- Notebook and Scripts that showcase running quantized diffusion models on consumer GPUs☆38Updated last year
- (WACV 2025 - Oral) Vision-language conversation in 10 languages including English, Chinese, French, Spanish, Russian, Japanese, Arabic, H…☆84Updated 5 months ago
- 🎯 Speech Recognition Challenge by Speech Lab - IIT Madras☆11Updated 5 years ago
- A curated list of resources in audio visual question answering and related area. :-)☆17Updated 6 months ago
- A simple, consistent and extendable toolkit for IndicTrans2. (Pypi: https://pypi.org/project/indictranstoolkit)☆37Updated 5 months ago
- Document Summarization App using large language model (LLM) and Langchain framework. Used a pre-trained T5 model and its tokenizer from H…☆13Updated 2 years ago
- CalBERT - Code-mixed Adaptive Language representations using BERT, published at AAAI-MAKE 2022☆13Updated 2 years ago
- Visualize multi-model embedding spaces. The first goal is to quickly get a lay of the land of any embedding space. Then be able to scroll…☆27Updated last year
- Word-level language identification for Bangla-English code-mixed social media data, using a BiLSTM with subword embeddings.☆10Updated 2 years ago
- This repository is a comprehensive project that leverages the XLM-Roberta model for intent detection. This repository is a valuable resou…☆16Updated last year
- Sample and Computation Redistribution for Efficient Face Detection☆15Updated last year
- Hosts text-to-speech corpus and speech synthesizers for African languages.☆17Updated 2 years ago
- Pretraining, fine-tuning and evaluation scripts for IndicBERT-v2 and IndicXTREME☆105Updated 9 months ago
- A Machine Learning tool to create the training dataset very quickly & easily by using a smart chrome extension☆14Updated 2 years ago
- Arxflix turns your boring Arxiv research paper into a captivating video.☆57Updated 3 months ago
- Mission to create a Hebrew TTS model as powerful and user-friendly as WaveNet☆38Updated last year
- ☆125Updated last year
- IndicGenBench is a high-quality, multilingual, multi-way parallel benchmark for evaluating Large Language Models (LLMs) on 4 user-facing …☆56Updated last year
- Code repository for "Introducing Airavata: Hindi Instruction-tuned LLM"☆63Updated last year
- A desktop compatible version of the Defog app☆14Updated last year
- Pretraining and finetuning for visual instruction following with Mixture of Experts☆16Updated last year
- Fun project: LLM powered RAG Discord Bot that works seamlessly on CPU☆33Updated 2 years ago