scrapinghub / product-extraction-benchmarkView external linksLinks
☆16Apr 24, 2024Updated last year
Alternatives and similar repositories for product-extraction-benchmark
Users that are interested in product-extraction-benchmark are comparing it to the libraries listed below
Sorting:
- A component that tries to avoid downloading duplicate content☆27Feb 8, 2026Updated last week
- Show summary of a large number of URLs in a Jupyter Notebook☆17Updated this week
- Scrapy middleware for the autologin☆36Updated this week
- Python client for Zyte API☆28Updated this week
- Page Object pattern for Scrapy☆126Jan 28, 2026Updated 2 weeks ago
- Python implementation of WHATWG URL Living Standard☆21Jun 20, 2024Updated last year
- Web scraping Page Objects core library☆104Jan 27, 2026Updated 2 weeks ago
- Price and currency parsing utility☆27Mar 6, 2023Updated 2 years ago
- extract difference between two html pages☆32Updated this week
- The missing datasets manager. Like hombrew but for datasets. CLI-tool for search and discover datasets!☆41May 29, 2017Updated 8 years ago
- 🌩️ The Deep Learning framework based on Lightning☆11Dec 11, 2025Updated 2 months ago
- Django application for Loginza service☆39Oct 2, 2014Updated 11 years ago
- DeepAlign: Alignment-based Process Anomaly Correction Using Recurrent Neural Networks☆10Mar 25, 2023Updated 2 years ago
- Content classification/clustering through language processing☆25Mar 10, 2012Updated 13 years ago
- A collection of personal ZAP scripts☆13Apr 10, 2023Updated 2 years ago
- A generic crawler☆78Updated this week
- Faster replacement for Python's urlparse module☆45Sep 30, 2018Updated 7 years ago
- Default Twisted does not ship with a CONNECT-enabled HTTP(s) proxy. This code provides one.☆51Feb 21, 2017Updated 8 years ago
- Expected edit distance implementation using OpenFst tools☆11May 13, 2015Updated 10 years ago
- Python version of Cuki IR generator☆13Feb 19, 2023Updated 2 years ago
- Provides syntax highlighting for Apptainer/Singularity definition files.☆10Dec 24, 2025Updated last month
- Python wrapper of axel, a light command line download accelerator☆10Mar 26, 2017Updated 8 years ago
- A collection of examples, tests and documentation for building a real-time web app with python tornado.☆18Feb 26, 2014Updated 11 years ago
- Benchmark of common hash functions☆10Sep 15, 2019Updated 6 years ago
- A semantic food search web application built with Django, Solr, SBERT, and Docker☆10Apr 14, 2025Updated 10 months ago
- DEFCON-RUSSIA WEB☆12Mar 30, 2021Updated 4 years ago
- Prefect integrations with Microsoft Planetary Computer.☆11Jul 15, 2024Updated last year
- Code and pruned models for our paper: K. Gkrispanis, N. Gkalelis, V. Mezaris, "Filter-Pruning of Lightweight Face Detectors Using a Geome…☆14May 8, 2024Updated last year
- HTTP Shell is a CLI tool based on the Kui framework that provides developers a modern alternative to http clients for interacting with AP…☆12Dec 17, 2020Updated 5 years ago
- It is just like localStorage - but built on top of Cache API☆11Dec 28, 2015Updated 10 years ago
- Fito is a python library that helps to organize your data so you can access it in a more understandable and easy way☆10Feb 26, 2018Updated 7 years ago
- Deploy Dask on Marathon☆10Feb 6, 2017Updated 9 years ago
- Provides for deploying custom ETL containers on AIStore, with subsequent user-defined extraction-transformation-loading in parallel, on t…☆19Nov 26, 2025Updated 2 months ago
- Discrete event simulation and computational models for engine block manufacturing at a marine propulsion company☆10Jul 20, 2018Updated 7 years ago
- A blockchain simulator based on SimPy in python.☆14Dec 18, 2018Updated 7 years ago
- Create your custom Qt + PyQt SDK for multiple platforms☆10Jun 7, 2019Updated 6 years ago
- Linux /proc data in a consistent, parsed format.☆10Mar 28, 2016Updated 9 years ago
- Extension for the SenTestingKit for asynchronous testing☆104May 20, 2013Updated 12 years ago
- Wrapper to run 2to3 automatically at import time☆13Dec 9, 2011Updated 14 years ago