mathigatti / img2txtLinks
Easy formatted text extraction from images using Google Vision API
☆41Updated 4 years ago
Alternatives and similar repositories for img2txt
Users that are interested in img2txt are comparing it to the libraries listed below
Sorting:
- Extract dates from text☆65Updated 4 years ago
- A python library for extracting text from PDFs without losing the formatting of the PDF content.☆78Updated 3 years ago
- semantically distinct key phrase extraction using hilbert hashes.☆50Updated 3 years ago
- 🚀GUI for training spaCy models☆55Updated 4 years ago
- A collection of over 1.5 Million tweets data translated to French, with their sentiment.☆35Updated 8 years ago
- ☆19Updated 3 years ago
- Corpus and a baseline neural network system for Named Entity Recognition in Hindi-English Code-Mixed social media text.☆46Updated 5 years ago
- Document processing using transformers☆22Updated 2 years ago
- This Python package can be used to systematically extract multiple data elements (e.g., title, keywords, text) from news sources around t…☆33Updated 2 years ago
- Tool that does layout analysis and/or text recognition using tesseract and outputs the result in Page XML format☆46Updated 6 months ago
- Find duplicate text files.☆15Updated 9 months ago
- python ocr using tesseract/ with EAST opencv detector☆42Updated last year
- Asent is a python library for performing efficient and transparent sentiment analysis using spaCy.☆119Updated last year
- Apply different text recognition services to images of handwritten documents.☆187Updated 2 years ago
- A system for reading scanned documents and grouping them into high level topics☆14Updated 5 years ago
- Run tesseract with the tesserocr bindings with @OCR-D's interfaces☆39Updated 5 months ago
- Python library to extract tabular data from images and scanned PDFs☆283Updated last year
- Tutorial for Topic Modelling using PySpark and Spark NLP☆17Updated 5 years ago
- A PyPI package for easy text annotation in a Jupyter Notebook.☆28Updated 4 years ago
- ☆32Updated 7 years ago
- A simple text reuse detection CLI tool.☆136Updated last year
- Python 3 library for processing historical English☆67Updated last year
- German sentiment scores with SentiWS as extension for spaCy☆38Updated 2 years ago
- find any kind of occupation or job title in a text or file☆84Updated last year
- sentiment analysis using spacy☆11Updated 3 years ago
- Named entity recognition for the legal domain☆42Updated 4 years ago
- CorrectLy - Open Source Spelling & Grammar correction☆42Updated 2 years ago
- Word2Vec encodings based search engine for Stackoverflow questions☆26Updated 2 years ago
- Dataiku DSS plugin to detect languages, correct misspellings, and clean text data 🧼☆22Updated 9 months ago
- BFSI sectors deal with lots of unstructured scanned documents which are archived in document management systems for further use.For examp…☆41Updated 4 years ago