A smart distributed crawler that infers navigation models of structured websites, used to cluster pages based on their structure and extract data from them.
☆10Aug 17, 2025Updated 6 months ago
Alternatives and similar repositories for smart-crawler
Users that are interested in smart-crawler are comparing it to the libraries listed below
Sorting:
- Html article content extractor in Golang.☆12Oct 31, 2022Updated 3 years ago
- A python module to process data for Frame Semantic Parsing☆23Nov 3, 2020Updated 5 years ago
- Question Answering via Integer Programming (TableILP)☆28Apr 22, 2016Updated 9 years ago
- Simple automatic reconnecting WebSocket☆12Feb 27, 2023Updated 3 years ago
- An open-source session replay tool for single-page applications that uses AI analysis, aggregated trends, and a RAG chatbot to help devel…☆11Jan 23, 2026Updated last month
- Minimal binary codec for SocketCluster based on pbf☆10Oct 30, 2017Updated 8 years ago
- Simplifies data migration between Apache Ignite clusters by relying on Apache Avro as an intermediate storage format☆13Jun 27, 2023Updated 2 years ago
- 是APEX贡献的一个基于大数据平台能力的数据开发平台,帮助企业以最小成本实现链接数据,构建和沉淀数仓模型,降低数据应用门槛,沉淀数据价值。☆12Oct 31, 2024Updated last year
- RespireNet is an innovative web-based application that harnesses the capabilities of deep learning and Mel-frequency cepstral coefficient…☆10Aug 2, 2023Updated 2 years ago
- Collaborative Discourse Manager☆11Nov 6, 2016Updated 9 years ago
- Time control for simulations☆11Jan 18, 2023Updated 3 years ago
- Flask app for monitoring OEE☆11Sep 25, 2023Updated 2 years ago
- KuaiSearch PERKS☆12Nov 16, 2021Updated 4 years ago
- Wireless Brother KH-9xx knitting machine connection☆13Sep 3, 2016Updated 9 years ago
- Homebrew tap to install the latest Maven build☆10Updated this week
- node.js app for control of Hanover flipdot display☆10Dec 20, 2025Updated 2 months ago
- 使用vue1.x写的博客(前端部分)☆10Aug 23, 2018Updated 7 years ago
- web crawler☆14Sep 27, 2022Updated 3 years ago
- jquery plugin for soccer field display with players on their positions☆14Jun 2, 2018Updated 7 years ago
- Azure Machine Learning - MLOps Python SDKv2☆10Jul 24, 2023Updated 2 years ago
- PDF table extraction☆10Dec 14, 2021Updated 4 years ago
- Rebalancing a portfolio with optimal buy/sell decisions using Metaheuristics☆12Mar 11, 2021Updated 4 years ago
- Tutorial / template project for a vertx3 REST API that persists in a DB using JDBC☆12Nov 16, 2015Updated 10 years ago
- Simple implementation of a custom parquet reader/writer☆11Aug 12, 2016Updated 9 years ago
- DeepCleaner is a NodeJS module designed to tidy up nasty looking JSON.☆12Mar 2, 2023Updated 3 years ago
- first attempt at description2code from 2016☆10Nov 15, 2018Updated 7 years ago
- 批量监控指定QQ消息窗口并将新消息发送至邮箱☆11Apr 13, 2023Updated 2 years ago
- ☆32Sep 19, 2025Updated 5 months ago
- A css/js coverage tool for websites☆10Nov 25, 2019Updated 6 years ago
- Pirate Trading Platform: Open source automated trading based on algorithmic market evaluation☆13Sep 25, 2017Updated 8 years ago
- This is the code for reproducing the TABBIE baseline in our paper: "Retrieval-Based Transformer for Table Augmentation"☆12Sep 17, 2025Updated 5 months ago
- API to manipulate the states of infrared controlled devices☆10Nov 10, 2023Updated 2 years ago
- ☆19Sep 5, 2013Updated 12 years ago
- init☆13Feb 3, 2021Updated 5 years ago
- Unsupervised Word Discovery☆10Jul 26, 2019Updated 6 years ago
- SQL over RPC, specifically for SQLite☆10Jul 17, 2018Updated 7 years ago
- Widgets JSON for OpenBB Terminal Pro☆15Aug 30, 2024Updated last year
- Automaton & Cognition☆16Apr 14, 2024Updated last year
- A starter application with akka-http and react☆10Dec 12, 2017Updated 8 years ago