Better models for Indic Scripts
☆54Sep 12, 2018Updated 7 years ago
Alternatives and similar repositories for tessdata
Users that are interested in tessdata are comparing it to the libraries listed below
Sorting:
- Synthetically generate random text document images with ground-truth☆12Jul 20, 2021Updated 4 years ago
- Generate large textual corpora for almost any language by crawling the web☆13Feb 17, 2024Updated 2 years ago
- Parse Searchable Electoral Rolls☆11Apr 20, 2025Updated 11 months ago
- Gaze-based Source Code Navigation for Brackets.io☆11Feb 22, 2017Updated 9 years ago
- Code repository for the paper "Improving End-to-End SLU performance with Prosodic Attention and Distillation" accepted at Interspeech 202…☆27May 17, 2023Updated 2 years ago
- Working on a new version of Brainspell (an open-source platform for neuroimaging literature) to make a public JSON API that collaborators…☆17Dec 7, 2022Updated 3 years ago
- Kompakkt - the web based 3D viewer and 3D annotation system.☆17Jan 27, 2026Updated last month
- Research project on the state of the field of Multilingual Digital Humanities, with an initial focus on Arabic☆13Mar 1, 2026Updated 2 weeks ago
- Rooted in calligraphy, Gotu is a modulated display typeface in Devanagari and Latin, with large loops and voluminous counters.☆11Jan 10, 2020Updated 6 years ago
- Automatic Context Sensitive Spelling Correction for Bangla Text Using Bert and Levenstein Distance☆21Nov 18, 2024Updated last year
- Code for "A Hierarchical End-to-End Model for Jointly Improving Text Summarization and Sentiment Classification" (IJCAI 2018)☆23Jul 14, 2018Updated 7 years ago
- Unicode case mapping and character class data for use by TeX☆19Nov 24, 2025Updated 3 months ago
- Text to Speech for Indic languages☆52Mar 23, 2022Updated 3 years ago
- guides and test data for OCR4all☆32Oct 4, 2022Updated 3 years ago
- This will hold the data pipeline to convert raw audio data to speech which will act as input dataset for speech-to-text pipeline☆32Feb 15, 2023Updated 3 years ago
- Code for the SIGIR 2020 paper "A Unified Dual-view Model for Review Summarization and Sentiment Classification with Inconsistency Loss"☆21Feb 3, 2021Updated 5 years ago
- Template and steps to build your personal blog using Jekyll and Minimal Mistake☆10Feb 24, 2020Updated 6 years ago
- Benchmark Python and Cython code☆13Jun 13, 2014Updated 11 years ago
- ☆10Mar 16, 2023Updated 3 years ago
- convert PubLayNet data into METS/PAGE-XML☆10Mar 17, 2020Updated 6 years ago
- ☆12May 22, 2022Updated 3 years ago
- Some demos about opencv.js☆34Oct 9, 2020Updated 5 years ago
- Collection of Common Machine Translation Tools☆11Jul 26, 2022Updated 3 years ago
- Open Source Speech Inferencing Libary for Indic Languages☆13Apr 11, 2022Updated 3 years ago
- Official Implementation of "Transferring Inductive Biases Through Knowledge Distillation"☆15Jun 3, 2020Updated 5 years ago
- ☆11Nov 14, 2021Updated 4 years ago
- The Modern Data Stack in a (Smaller) Box☆12Jan 28, 2023Updated 3 years ago
- ☆26Feb 20, 2026Updated last month
- ☆52Feb 13, 2024Updated 2 years ago
- Prabhupadavani: A Code-mixed Speech Translation Data for 25 languages☆13Oct 12, 2022Updated 3 years ago
- ☆10Apr 3, 2024Updated last year
- Homepage of Software Engineering for Machine Learning☆17Feb 4, 2026Updated last month
- This repo has been migrated to https://code.larus.se/lmas/Damerau-Levenshtein☆11Jul 21, 2023Updated 2 years ago
- ☆10Sep 14, 2016Updated 9 years ago
- My solutions for Advanced Python Mastery (course by @dabeaz)☆11Jan 29, 2024Updated 2 years ago
- Unofficial implementation of the paper: "NeRF-In: Free-Form NeRF Inpainting with RGB-D Priors"☆11Apr 30, 2023Updated 2 years ago
- ☆12May 19, 2021Updated 4 years ago
- Code for "Mind Your Inflections! Improving NLP for Non-Standard Englishes with Base-Inflection Encoding" (EMNLP 2020).☆11May 1, 2025Updated 10 months ago
- A minimal TPU compatible Jax implementation of NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis.☆13Apr 21, 2022Updated 3 years ago