readbeyond / lachesis
lachesis automates the segmentation of a transcript into closed captions
☆33Updated 8 years ago
Alternatives and similar repositories for lachesis:
Users that are interested in lachesis are comparing it to the libraries listed below
- A simple os x desktop app built in electron, using Aeneas under the hood to generate captions files from media(audio or video) and plain …☆30Updated 2 years ago
- c-span opened captions node buffer server + google docs apps script☆8Updated 5 years ago
- A library that helps you to convert from one subtitle format to another☆19Updated 6 years ago
- ☆14Updated 2 years ago
- This a module to extract RDF from an HTML5 page annotated with microdata. The module implements the algorithm defined and published by th…☆44Updated 2 years ago
- NWJS os x desktop based application that given a video/audio file returns a transcription using IBM Watson Speech to text API☆41Updated 8 years ago
- Hyperaudio Lite - a Super-lightweight Interactive Transcript Player☆145Updated 4 months ago
- A raspberry pi 64bit image with spacy and neuralcoref pre-installed☆21Updated 5 years ago
- Convert a corpus of PDF to clean text files on a distributed architecture☆38Updated last year
- Convert Kindle Clippings to Objects that conform with W3C Web Annotation Vocabulary☆14Updated 7 years ago
- LaMachine - A software distribution of our in-house as well as some 3rd party NLP software - Virtual Machine, Docker, or local compilatio…☆68Updated last year
- Tools for bulk indexing of WARC/ARC files on Hadoop, EMR or local file system.☆45Updated 7 years ago
- a client side transcriptions text editor to proofread and correct the text before re-alignement back on the server.☆19Updated 7 years ago
- Homebase of the IPTC EXTRA project about rule-based text categorization☆13Updated 7 years ago
- Recognition Models for Kraken and CLSTM☆14Updated 5 years ago
- Python port for IWNLP.Lemmatizer☆17Updated last year
- A set of workflows for corpus building through OCR, post-correction and normalisation☆48Updated 2 years ago
- Put together a multilingual corpus from a variety of sources. Used for wordfreq and word embeddings.☆51Updated 3 years ago
- Ergonomic line-by-line transcription of scanned text.