fcibecchini/smart-crawler

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/fcibecchini/smart-crawler)

fcibecchini / smart-crawler

A smart distributed crawler that infers navigation models of structured websites, used to cluster pages based on their structure and extract data from them.

☆10

Alternatives and similar repositories for smart-crawler

Users that are interested in smart-crawler are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

crawlerclub / ce
View on GitHub
Html article content extractor in Golang.
☆12Oct 31, 2022Updated 3 years ago
stanfordnlp / miniwob-plusplus-demos
View on GitHub
Demos for the MiniWoB++ benchmark
☆21Feb 23, 2018Updated 8 years ago
worker8 / SimpleCurrency
View on GitHub
A simple application that converts currency
☆18Nov 12, 2021Updated 4 years ago
Dev43 / ethinitium
View on GitHub
Eth-initium (ethereum start) is an open source repository for those who want to understand how the ethereum blockchain functions along wi…
☆13Dec 10, 2022Updated 3 years ago
ManagedKube / kubernetes-cost-agent
View on GitHub
Gathers Kubernetes cost information for a cluster
☆13Dec 18, 2018Updated 7 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
atkaper / k8s-network-test-daemonset
View on GitHub
K8S (kubernetes) Network Test Daemonset - checks integrity of your cluster's virtual network
☆14Oct 29, 2020Updated 5 years ago
lemonrock / rdma-core
View on GitHub
Rust bindings for rdma-core
☆18Apr 4, 2018Updated 8 years ago
danhhz / cargo-stress
View on GitHub
A utility for catching non-deterministic test failures
☆18Jul 23, 2024Updated last year
ExpanseLLC / lambda_authorizer
View on GitHub
Custom Lambda Authorizer for ApiGateway using Node and Promise Pattern
☆13Feb 10, 2019Updated 7 years ago
pfalcon / pymsasid3
View on GitHub
Pure-Python x86 disassembler, ported to modern Python, with bugfixes
☆26Jan 24, 2018Updated 8 years ago
fengjx / open-vue-blog
View on GitHub
使用vue1.x写的博客（前端部分）
☆10Aug 23, 2018Updated 7 years ago
xAlpharax / NeuralODE-Notes-Projects
View on GitHub
Repository for notes, projects and snippets on NODEs. Includes results after training CNN based networks with different methods on MNIST,…
☆12Mar 25, 2023Updated 3 years ago
grammar-team / whale-grammar
View on GitHub
Korean grammar checker extension for whale browser
☆17Mar 8, 2023Updated 3 years ago
iamvee / onion
View on GitHub
Host a Tor hidden service inside a Docker container without exposing any clearnet ports.
☆12Oct 7, 2020Updated 5 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
tminions / binocularss
View on GitHub
An Android RSS reader built with Kotlin and Jetpack Compose
☆16Jan 25, 2023Updated 3 years ago
Sam-Martin / terraform-aws-config-module
View on GitHub
A Terraform Module for Controlling AWS Config (via CloudFormation)
☆10Mar 30, 2017Updated 9 years ago
jeroenr / kafka-k8s-monitoring
View on GitHub
Demo kafka kubernetes setup + monitoring
☆12Jul 8, 2019Updated 7 years ago
qw3rtman / jbin
View on GitHub
Java Binary Executables
☆17Aug 21, 2019Updated 6 years ago
jda / go-crowd
View on GitHub
Go library for interacting with Atlassian Crowd
☆13Aug 31, 2018Updated 7 years ago
imduffy15 / token-cli
View on GitHub
Command line utility for interacting with OAuth2 infrastructure to generate tokens
☆19Oct 26, 2022Updated 3 years ago
outware / caveman
View on GitHub
Companion application used for dynamically managing the environment variables for a target application.
☆16Mar 9, 2016Updated 10 years ago
aRestless / keycloak-openid2-steam
View on GitHub
A minimal Keycloak IdentityProvider implementation of OpenID2 for Steam
☆18May 2, 2019Updated 7 years ago
bmatthews68 / ldapunit
View on GitHub
Simplifies the task of creating unit tests that depend on an LDAP directory server.
☆12Jun 13, 2026Updated last month
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
Lauriapple1 / kmsclient
View on GitHub
Helper script to encrypt and decrypt data using Amazon KMS
☆14Aug 13, 2015Updated 10 years ago
akb89 / pyfn
View on GitHub
A python module to process data for Frame Semantic Parsing
☆23Nov 3, 2020Updated 5 years ago
Xlythe / FloatingView
View on GitHub
Quick and simple library for creating floating views and activities
☆20Jan 7, 2024Updated 2 years ago
apache / cloudstack-ec2stack
View on GitHub
Apache CloudStack EC2 Stack
☆15Jan 30, 2023Updated 3 years ago
fedorovdima / aws-gameday
View on GitHub
☆10Oct 15, 2019Updated 6 years ago
kobotoolbox / collect
View on GitHub
Kobo's version of ODK Collect, for use with KoboToolbox
☆23May 24, 2026Updated last month
anacrolix / sqlrpc
View on GitHub
SQL over RPC, specifically for SQLite
☆10Jul 17, 2018Updated 8 years ago
thomasdarimont / wjax2018-spring-keycloak
View on GitHub
Code & Slides for my "Securing Spring Apps with Keycloak" talk at WJAX 2018
☆14Dec 6, 2018Updated 7 years ago
yashprakash13 / Gesty
View on GitHub
A basic Android eBook Reader app with hands off reading feature - use gestures to control the page turn and use automatic scrolling for p…
☆13Apr 8, 2021Updated 5 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
kafri8889 / Calculator-Compose
View on GitHub
Calculator app made with Jetpack Compose
☆16Jun 3, 2022Updated 4 years ago
gansidui / bktree
View on GitHub
bk-tree for golang
☆11Jul 30, 2022Updated 3 years ago
arunkumar9t2 / trie
View on GitHub
A Java implementation of the Trie data structure
☆23May 13, 2018Updated 8 years ago
improbable-eng / vault-kv-extract
View on GitHub
A script for migrating hidden Vault secrets out of an etcd storage backend
☆19Jan 28, 2021Updated 5 years ago
panchdevs / srs
View on GitHub
Software Requirement Specification for the Twitter Sentiment Analysis project
☆13Feb 11, 2015Updated 11 years ago
wgorder / spring-cloud-examples
View on GitHub
Spring Cloud example application
☆10Oct 7, 2015Updated 10 years ago
vobst / BPFVol3
View on GitHub
Linux BPF plugins for Volatility3
☆23Jan 19, 2024Updated 2 years ago