Training/test data for Dragnet
☆42Jan 29, 2015Updated 11 years ago
Alternatives and similar repositories for dragnet_data
Users that are interested in dragnet_data are comparing it to the libraries listed below
Sorting:
- code and data used to build a training dataset for dragnet models☆10Nov 29, 2020Updated 5 years ago
- Just the facts -- web page content extraction☆1,280Jul 8, 2025Updated 7 months ago
- Data science tools from Moz☆23Jan 11, 2017Updated 9 years ago
- Web content extraction using machine learning☆34Mar 3, 2021Updated 4 years ago
- simple implementation of pix2pix by pytorch☆11Jun 2, 2017Updated 8 years ago
- Word embeddings for job postings☆13Dec 8, 2022Updated 3 years ago
- Source code for the paper "Web2Text: Deep Structured Boilerplate Removal", full paper @ ECIR'18☆170Oct 28, 2021Updated 4 years ago
- Content Extraction via Text Density (SIGIR11)☆25Sep 21, 2015Updated 10 years ago
- A classifier for detecting soft 404 pages☆58Feb 10, 2026Updated 2 weeks ago
- A real pal when you want to add VGG16 to your Keras model.☆27May 17, 2016Updated 9 years ago
- An exercise in unsupervised machine learning: Extract Article's Text in HTml documents.☆431Jan 16, 2026Updated last month
- ☆25Jan 19, 2017Updated 9 years ago
- Web Content Extraction Through Machine Learning☆185Apr 4, 2014Updated 11 years ago
- A python backend to predict prices of candlesticks.☆23Feb 28, 2019Updated 7 years ago
- A dataset that includes photos downloaded from Flickr and annotations that indicates a local window representing a good composition.☆78Feb 10, 2025Updated last year
- Repository for the mijn.amsterdam.nl portal☆11Updated this week
- Implementation of Vision Based Page Segmentation algorithm in Java☆105Oct 25, 2019Updated 6 years ago
- ☆13Sep 13, 2015Updated 10 years ago
- Bootyman deploys and manages large-scale Laravel SaaS applications in self-contained VMs in cloud☆11Jan 3, 2023Updated 3 years ago
- Flask app for monitoring OEE☆11Sep 25, 2023Updated 2 years ago
- Flex 3/4 sample applications to demonstrate usages of the BabelFx (l10nInjection) framework☆20Sep 10, 2016Updated 9 years ago
- A Time Window library for Python.☆13Apr 17, 2020Updated 5 years ago
- An @angular/cli based starter containing common components and services as well as a reference site.☆14Mar 3, 2025Updated 11 months ago
- ☆10Jul 6, 2018Updated 7 years ago
- ZFP compression and decompression compiled to WebAssembly☆11Jan 22, 2024Updated 2 years ago
- A Nigerian online store price comparison website☆12Dec 9, 2022Updated 3 years ago
- Narwhal is a keyword and KEY NARRATIVE manager that creates language-aware classes. Because Narhwal does not use NLP it avoids complexity…☆12Oct 16, 2018Updated 7 years ago
- Work in progress transmit from Google Code☆1,128Jan 3, 2018Updated 8 years ago
- ☆91Jun 2, 2016Updated 9 years ago
- ☆19Sep 5, 2013Updated 12 years ago
- Html article content extractor in Golang.☆12Oct 31, 2022Updated 3 years ago
- Fun with Firefox 3D view.☆11Nov 13, 2017Updated 8 years ago
- Skype in the terminal, because Skype is too slow☆14Feb 25, 2014Updated 12 years ago
- Reinforcement learning algorithms for path finding☆12Jul 30, 2017Updated 8 years ago
- HlsKit provides strong HLS video conversion features backed by ffmpeg. Prepare your mp4 files for streaming.☆13Aug 8, 2025Updated 6 months ago
- A pipeline framework for data science projects☆10Aug 9, 2022Updated 3 years ago
- Benchmarks of approximate nearest neighbor libraries in Python☆11Jul 31, 2020Updated 5 years ago
- Simple cross-process mutex based on file locks☆10Sep 14, 2017Updated 8 years ago
- Code for the paper "Hone as You Read: A Practical Type of Interactive Summarization"☆12May 6, 2021Updated 4 years ago