ShairozS / Scan2TopicLinks
A system for reading scanned documents and grouping them into high level topics
☆14Updated 5 years ago
Alternatives and similar repositories for Scan2Topic
Users that are interested in Scan2Topic are comparing it to the libraries listed below
Sorting:
- ☆21Updated 4 years ago
- Easy formatted text extraction from images using Google Vision API☆41Updated 4 years ago
- Apply different text recognition services to images of handwritten documents.☆184Updated 2 years ago
- Run tesseract with the tesserocr bindings with @OCR-D's interfaces☆39Updated 4 months ago
- A curated list of my GitHub stars!☆36Updated this week
- A work automation tool that includes an email parser and report writer☆24Updated 4 years ago
- Text Summarization using NLP to fetch BBC News Article and summarize its text and also it includes custom article Summarization☆41Updated 2 years ago
- Complex data extraction and orchestration framework designed for processing unstructured documents. It integrates AI-powered document pip…☆72Updated last week
- BFSI sectors deal with lots of unstructured scanned documents which are archived in document management systems for further use.For examp…☆41Updated 4 years ago
- Streamlit app to Translate text to or between 50 languages with mBART-50 from Huggingface and Facebook☆25Updated 4 years ago
- A Named Entity Recognition system that extracts soft skills from text☆27Updated last year
- This repository contains a notebook to demonstrate the power of Document Text Recognition (DocTR) library☆13Updated 4 years ago
- 🚀GUI for training spaCy models☆55Updated 4 years ago
- Custom recipe and utilities for document processing☆199Updated 3 years ago
- DL models that take a document image file as input, locate the position of paragraphs, lines, images, etc. with their labels and confiden…☆26Updated 4 years ago
- Dense Article Dataset (DAD): A Benchmark Dataset for Document Layout Analysis☆16Updated 3 years ago
- Applying BERT for named entity recognition on resumes.☆68Updated 2 years ago
- Classification of scientific papers☆24Updated 2 years ago
- This repo is about the classification of rhetorical roles in Legal Documents such as: Citation, Findings of Fact, Evidence, Legal Rule, R…☆15Updated 3 years ago
- Document Search Engine Tool☆74Updated 2 years ago
- 😎 A community-curated list of awesome lawtech software and learning resources for legal technology and design.☆28Updated 5 years ago
- Extract text from your DOCX documents.☆11Updated last year
- The goal is to pilot Microsoft Cognitive Services to unlock the strategic value of UN unstructured content by building on AI and semantic…☆14Updated 2 years ago
- Fastlaw's purpose is to replace generic word embeddings for work on supervised machine learning NLP-tasks with legal texts.☆38Updated 6 years ago
- Dataset and pre-trained model for Skill2vec☆82Updated last year
- Summarize text provided in a PDF file☆26Updated 6 years ago
- YouTube Transcript Cleaner is a simple web-based application that improves the readability of YouTube transcripts.☆27Updated 6 months ago
- Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.☆29Updated 2 years ago
- Use Google's state-of-the-art T5 pre-train model to create human-like summarization☆24Updated 4 years ago
- Transcribe Voice File to Text☆21Updated 4 years ago