caltechlibrary / documentaristLinks
Process Caltech Archives' digital documents and photos, and annotate each page or image with information about its contents
☆12Updated 3 years ago
Alternatives and similar repositories for documentarist
Users that are interested in documentarist are comparing it to the libraries listed below
Sorting:
- ☆12Updated last year
- Post-processing OCR errors with seq2seq models☆28Updated 5 years ago
- A Datasette plugin providing an MLOps platform to train, eval and predict machine learning models☆17Updated 3 weeks ago
- ☆15Updated last year
- OCR-D post-correction module based on weighted finite-state transducers☆11Updated last year
- This Python package can be used to systematically extract multiple data elements (e.g., title, keywords, text) from news sources around t…☆33Updated 2 years ago
- Deploy DL/ ML inference pipelines with minimal extra code.☆102Updated last year
- The projects lets you extract glossary words and their definitions from a given piece of text automatically using NLP techniques☆29Updated 5 years ago
- Web-based tool for straight-forward class annotation of audio files☆11Updated 5 years ago
- Deeplearing based Reverse Image Search using Annoy library☆15Updated 6 years ago
- Complex data extraction and orchestration framework designed for processing unstructured documents. It integrates AI-powered document pip…☆80Updated last week
- A tidy and complete archive of metadata for papers on arxiv.org, 1993-2019☆28Updated 6 years ago
- Simple and clean Python implementation of TextRank as per seminal paper by Rada Mihalcea and Paul Tarau. This implementation performs bot…☆11Updated 4 years ago
- Python based Wikidata framework for easy dataframe extraction☆45Updated 2 years ago
- Visualize large text collections with WebGL☆27Updated last year
- Tools for using OpenAI Codex to do various useful things☆48Updated 4 years ago
- A toolset for handwriting recognition☆71Updated 2 years ago
- An Alexa skill providing a conversational interface to any public figure (as mimicked by GPT3). The legacy GUI is no longer maintained.☆20Updated 2 years ago
- Automatically install missing Python modules using pip at import time.☆19Updated 2 years ago
- Transcribes and summarizes speech or audio☆36Updated 4 years ago
- Segmenting a given document using recursive xy-cut algorithm.☆12Updated 7 years ago
- Finds linguistic patterns effortlessly☆39Updated 2 years ago
- A text generation Transformer model trained on Reddit posts.☆16Updated 3 years ago
- ☆20Updated 4 years ago
- Text classification automl☆21Updated 4 years ago
- Web Scraping, Document Deduplication & GPT-2 Fine-tuning with a newly created scam dataset.☆27Updated 4 years ago
- 🚀GUI for training spaCy models☆55Updated 4 years ago
- Python package for converting xml and epubs to text files☆33Updated 5 years ago
- Using Conditional Random Fields for segmenting Latin words written in scriptio continua☆10Updated 7 years ago
- Instagram-like filters with deep learning☆57Updated last year