mathigatti / img2txt
Easy formatted text extraction from images using Google Vision API
β41Updated 3 years ago
Alternatives and similar repositories for img2txt:
Users that are interested in img2txt are comparing it to the libraries listed below
- Dataiku DSS plugin to detect languages, correct misspellings, and clean text data π§Όβ23Updated this week
- β20Updated 2 years ago
- Extract dates from textβ64Updated 3 years ago
- β15Updated 3 years ago
- An ongoing series of notebooks aimed at helping fellow NLP enthusiasts think about applying new tools and techniques to practical tasks.β18Updated 4 years ago
- A python library for extracting text from PDFs without losing the formatting of the PDF content.β75Updated 3 years ago
- A Python package to get useful information from documents using TopicRank Algorithm.β16Updated last year
- Find duplicate text files.β12Updated this week
- datasets with text data for use in NLP, Text analysis, information extraction, ML research.β16Updated 5 years ago
- Corpus and a baseline neural network system for Named Entity Recognition in Hindi-English Code-Mixed social media text.β45Updated 4 years ago
- β33Updated 5 years ago
- β22Updated 3 years ago
- semantically distinct key phrase extraction using hilbert hashes.β48Updated 2 years ago
- Text similarity using BERT sentence embeddingsβ20Updated 4 years ago
- An example of how to use spaCy for extremely large files without running into memory issuesβ36Updated 2 years ago
- Web App Capable of Predicting Next Word Using BERTβ14Updated 2 years ago
- A Docker Wrapper to make the machine easily learn any language on top of INRIA OSCAR dataset using GPT2β10Updated 4 years ago
- Generate multiple choice fill-in-the-blank questions from any article.β13Updated 2 years ago
- Automated paraphrases Generationβ36Updated 2 years ago
- [WIP] Behold, semantic-search, built over sentence-transformers to make it easy for search engineers to evaluate, optimise and deploy modβ¦β15Updated last year
- A PyPI package for easy text annotation in a Jupyter Notebook.β28Updated 3 years ago
- CorrectLy - Open Source Spelling & Grammar correctionβ39Updated 2 years ago
- Upload an image of a document and extract text, names, facts and figuresβ21Updated 5 months ago
- Given a text, wrap it into phrases and send them to Yandex's search engine. If it yields a "did you mean:", substitute the original phrasβ¦β11Updated 6 years ago
- Machine Learning-assisted correction of OCR errors in historical corporaβ9Updated 2 months ago
- Document Search Engine project with TF-IDF abd Google universal sentence encoder modelβ53Updated last year
- This repository aims to implement an Image Search engine powered by the CLIP model.β40Updated 2 years ago
- β11Updated 4 years ago
- Keyword extraction with spaCyβ31Updated 3 years ago
- Pre-built Scrapy spiders for AutoExtractβ19Updated 8 months ago