benetech / VideoDeduplication
☆35Updated last year
Alternatives and similar repositories for VideoDeduplication:
Users that are interested in VideoDeduplication are comparing it to the libraries listed below
- A collection of code, data and information related to our audit of TikTok.☆21Updated last week
- A large-scale curated dataset of Visual.ly infographics with metadata and additional crowdsourced annotations for research applications i…☆30Updated 6 years ago
- An ICIJ app to conduct data validation and cleaning.☆20Updated 3 weeks ago
- Parse a video directory to create one image every n seconds, then identify duplicate images and show possible video duplicates for manual…☆10Updated 4 years ago
- Command line utility to manipulate faces in videos and images☆52Updated 4 years ago
- Codec is a collaborative tool for managing video evidence.☆64Updated 10 months ago
- Remove duplicate documents/videos/images via popular algorithms such as SimHash, SpotSig, Shingling, etc.☆18Updated last year
- Attempt to use perceptual hash (pHash) to segment a video into "scenes" very quickly (Normally under a minute for hour long HD videos).☆46Updated 2 years ago
- Data model and processing tools for investigative entity data☆224Updated this week
- This library builds a graph-representation of the content of PDFs. The graph is then clustered, resulting page segments are classified an…☆22Updated 4 years ago
- Template repository and README for submissions to Bellingcat's Global Hackathon☆16Updated 2 years ago
- Pre-built Scrapy spiders for AutoExtract☆19Updated 9 months ago
- Grabbing all news.☆62Updated 5 years ago
- Images of Text to Text: Call Tesseract from Python and OCR a directory of pdfs☆15Updated 5 years ago
- Backend for the search engine service in Liquid Investigations.☆20Updated 4 months ago
- Deeplearing based Reverse Image Search using Annoy library☆17Updated 5 years ago
- Computer Vision Segmentation for Document Layout Analysis☆10Updated 2 years ago
- etl pipeline, graphical explorer and general toolbox for investigations with follow the money data☆15Updated last year
- Automated behaviors that run in browser to interact with complex sites automatically. Used by ArchiveWeb.page and Browsertrix Crawler.☆39Updated last month
- 🧐 Scrape your friends' Facebook photos☆25Updated 4 years ago
- API client for Aleph, supports bulk entity and document upload.☆28Updated 4 months ago
- ImagePlot is a free software tool that visualizes collections of images and video of any size. It is implemented as a macro which works w…☆67Updated 7 years ago
- Scrape VK media☆57Updated last year
- Find duplicate files☆23Updated 2 years ago
- Deep Neural Network - Automatic selection of Thumbnails for Videos☆34Updated 6 years ago
- A verification “Swiss army knife” helping journalists, fact-checkers, and human rights defenders to save time and be more efficient in th…☆30Updated this week
- Docker Container for a Make-based, PDF extraction using OCR☆12Updated 6 months ago
- A tutorial on optical character recognition using tesseract, ImageMagick and other open source tools☆67Updated 3 weeks ago
- CLI utility to find near duplicate images and remove all but the best copy.☆160Updated this week
- ☆10Updated 5 years ago