mathigatti / img2txt
Easy formatted text extraction from images using Google Vision API
☆41Updated 3 years ago
Alternatives and similar repositories for img2txt:
Users that are interested in img2txt are comparing it to the libraries listed below
- Extract dates from text☆64Updated 4 years ago
- A Python package to get useful information from documents using TopicRank Algorithm.☆16Updated last year
- Streamlit-based Web App for Ai Text Generation based on GPT-2 Models from HuggingFace Model Hub using Python library aitextgen☆27Updated 4 years ago
- Fastlaw's purpose is to replace generic word embeddings for work on supervised machine learning NLP-tasks with legal texts.☆38Updated 5 years ago
- ☆34Updated 5 years ago
- A simple web application for searching Word2Vec embeddings derived from approximately 2,000 law reports published by the The Incorporated…☆26Updated 2 years ago
- A python library for extracting text from PDFs without losing the formatting of the PDF content.☆77Updated 3 years ago
- CorrectLy - Open Source Spelling & Grammar correction☆40Updated 2 years ago
- Named entity recognition for the legal domain☆42Updated 3 years ago
- Labeled segmentation for the document structure of printed books☆13Updated 7 years ago
- Scripts for building a geo-located web corpus using Common Crawl data☆11Updated 2 weeks ago
- Python 3 library for processing historical English☆67Updated 8 months ago
- A fully customisable language detection pipeline for spaCy☆92Updated 5 years ago
- Metaphor detection using NLP techniques, made in Python using NLTK☆18Updated 11 years ago
- This Python package can be used to systematically extract multiple data elements (e.g., title, keywords, text) from news sources around t…☆33Updated 2 years ago
- A library to extract a publication date from a web page, along with a measure of the accuracy.☆41Updated 5 years ago
- Corpus and a baseline neural network system for Named Entity Recognition in Hindi-English Code-Mixed social media text.☆45Updated 4 years ago
- Build Semantic Search with S-BERT and Fine-tune your model in unsupervised way☆58Updated 2 years ago
- Python wrapper for xpdf☆19Updated 5 years ago
- ☆14Updated 2 years ago
- Run tesseract with the tesserocr bindings with @OCR-D's interfaces☆39Updated this week
- Statistical text analysis and semantic networks with Python☆14Updated 7 years ago
- Many Natural Language Processing tasks rely on sentence boundary detection (SBD). Although amazing libraries like spacy provide state of …☆61Updated 4 years ago
- Document Search Engine project with TF-IDF abd Google universal sentence encoder model☆53Updated last year
- SpacyV3 Text Categorizer Tutorial☆17Updated 4 years ago
- BERT semantic search engine for searching literature research papers for coronavirus covid-19 in google colab☆31Updated 5 years ago
- Find duplicate text files.☆14Updated 3 months ago
- Embedding Visualizer (EmbedViz) data app made with Streamlit library☆22Updated 4 years ago
- Analyze XML extracted from PDFs (e.g. from TET or PDFMiner)☆20Updated 7 years ago
- [WIP] Behold, semantic-search, built over sentence-transformers to make it easy for search engineers to evaluate, optimise and deploy mod…☆15Updated 2 years ago