caltechlibrary / documentaristLinks
Process Caltech Archives' digital documents and photos, and annotate each page or image with information about its contents
☆12Updated 3 years ago
Alternatives and similar repositories for documentarist
Users that are interested in documentarist are comparing it to the libraries listed below
Sorting:
- OCR-D post-correction module based on weighted finite-state transducers☆11Updated 2 years ago
- Apply different text recognition services to images of handwritten documents.☆188Updated 3 years ago
- A Datasette plugin providing an MLOps platform to train, eval and predict machine learning models☆17Updated last month
- This Python package can be used to systematically extract multiple data elements (e.g., title, keywords, text) from news sources around t…☆34Updated 2 years ago
- Transcribes and summarizes speech or audio☆36Updated 4 years ago
- Python tools for Tesseract OCR training☆26Updated 3 years ago
- Segmenting a given document using recursive xy-cut algorithm.☆12Updated 7 years ago
- Visual search interface☆11Updated 4 years ago
- Deploy DL/ ML inference pipelines with minimal extra code.☆102Updated last year
- Text classification automl☆21Updated 4 years ago
- Rhyme with AI☆45Updated 5 years ago
- Deeplearing based Reverse Image Search using Annoy library☆15Updated 6 years ago
- Python based Wikidata framework for easy dataframe extraction☆45Updated 2 years ago
- Collect the Best Papers from the Top Conferences, also including statistics and visualization keywords of accepted papers from Top Confer…☆16Updated 5 years ago
- Experiments with Hugging Face 🔬 🤗☆46Updated last year
- A Machine Learning tool to create the training dataset very quickly & easily by using a smart chrome extension☆14Updated 2 years ago
- ☆15Updated last year
- ☆12Updated last year
- Take any phone-taken picture and turn it into a document scan.☆96Updated last year
- Extract information from XBRL files in the ESEF format☆13Updated last month
- Utilities for working with videos☆13Updated 7 months ago
- Visual similarity search engine demo with use of PyTorch Metric Learning and Qdrant☆12Updated 3 years ago
- Ergonomic line-by-line transcription of scanned text.☆54Updated this week
- Collection of tools to extract features from film material.☆41Updated 7 years ago
- Matplotlib Image labeller for classifying images☆11Updated last month
- ☆10Updated 2 years ago
- Tool that does layout analysis and/or text recognition using tesseract and outputs the result in Page XML format☆46Updated 10 months ago
- Creating a simple recommendation system on the Basis of similarity☆11Updated 7 years ago
- Convert any image into a Region Adjacency Graph (RAG)☆12Updated 5 years ago
- Dataiku DSS plugin to detect languages, correct misspellings, and clean text data 🧼☆22Updated last week