code and data used to build a training dataset for dragnet models
☆10Nov 29, 2020Updated 5 years ago
Alternatives and similar repositories for dragnet_data
Users that are interested in dragnet_data are comparing it to the libraries listed below
Sorting:
- Training/test data for Dragnet☆42Jan 29, 2015Updated 11 years ago
- Web content extraction using machine learning☆34Mar 3, 2021Updated 5 years ago
- 3d Bin Packing - Currently focusing primarily on 3D-Knapsack problem in packing☆10Jul 20, 2020Updated 5 years ago
- Software mention extraction and linking from scientific articles☆14Sep 2, 2022Updated 3 years ago
- Pretrained parameters for CT deep learning models.☆13Sep 24, 2019Updated 6 years ago
- 📖 Using deep learning and scraping to analyze/summarize articles! Just drop in any URL!☆19Dec 8, 2022Updated 3 years ago
- Support Next.js redirects in Cloudflare Pages☆18Nov 27, 2022Updated 3 years ago
- Handwritten Digit Recognition using Softmax Regression in Python☆13Sep 5, 2018Updated 7 years ago
- Segment a HTML document into structural data☆12Jan 15, 2019Updated 7 years ago
- Translation of query languages to serialized KoralQuery protocol☆14Mar 9, 2026Updated last week
- ✂️⚽🖼️ Object Removal without Machine Learning☆21Nov 26, 2019Updated 6 years ago
- Benchmark scripts for comparing different tokenizers and sentence segmenters of German☆12Feb 27, 2023Updated 3 years ago
- Web API proposal for receiving shared data☆18Aug 6, 2018Updated 7 years ago
- 🛒 A scraping tool for Finn.no.☆12May 31, 2022Updated 3 years ago
- Dynamixel Interface board for communicating a Dynamixel motor with your preferred MCU.☆16Jan 15, 2025Updated last year
- Basis of FragDenStaat.de's „Koalitionstracker“☆15Jul 14, 2025Updated 8 months ago
- Small string compression using smaz compression algorithm. Fast, because it's in C. Supports Python 3+☆13Oct 18, 2025Updated 5 months ago
- 🕸 GlotWeb: Web Indexing for Minority Languages (WWW 2026)☆17Feb 27, 2026Updated 3 weeks ago
- Active Speaker Detection☆19Jun 19, 2020Updated 5 years ago
- The Wikinflection Corpus, from the paper "Wikinflection Corpus: A (Better) Multilingual, Morpheme-Annotated Inflectional Corpus" (Metheni…☆12Dec 15, 2023Updated 2 years ago
- Benson turns a list of URLs into mp3s of the contents of each web page - take control over your reading backlog!☆16Oct 30, 2024Updated last year
- Fingerprint Authentication using BiometricPrompt Compat☆12Jun 6, 2019Updated 6 years ago
- Jupytext talk at PyParis 2018☆11Dec 10, 2018Updated 7 years ago
- A reddit bot that finds original publish dates on linked articles.☆10Nov 30, 2024Updated last year
- Programmatically instantiate and modify Firebase instances.☆19Feb 14, 2017Updated 9 years ago
- Specification for a query language to request Verifiable Presentations from wallets etc.☆10Jan 13, 2026Updated 2 months ago
- SMOR (Stuttgart Morphology) with alternative lemmatization component☆13Aug 10, 2023Updated 2 years ago
- 智能制造工业AI Top2解决方案☆20Aug 22, 2018Updated 7 years ago
- ☆15Feb 19, 2016Updated 10 years ago
- IPLD Schema Implementation: parser and utilities☆16Mar 6, 2026Updated 2 weeks ago
- Backend, IA-specific tools for crawling and processing the scholarly web. Content ends up in https://fatcat.wiki☆28Jul 31, 2024Updated last year
- An IETF specification for cryptographic hyperlinking☆15May 2, 2021Updated 4 years ago
- You've made the list, we'll help you check it twice. Given a domain-like string, verifies inclusion in a list you provide.☆19Nov 13, 2020Updated 5 years ago
- External link tracking tool for Wikimedia partnerships☆11Oct 3, 2025Updated 5 months ago
- SQL over RPC, specifically for SQLite☆10Jul 17, 2018Updated 7 years ago
- Code for "Web Page Segmentation Revisited: Evaluation Framework and Dataset", accepted as resources paper to CIKM 2020☆14Jan 13, 2023Updated 3 years ago
- Modular tool that extracts images and labels from multiple datasets and parses them to Darknet format.☆32Jun 20, 2019Updated 6 years ago
- Test whether W3C spec repos match a set of best practices☆21Updated this week
- A Python scraping module, that extracts text from articles found in RSS feeds. Uses SQLite as database.☆20Jul 5, 2024Updated last year