TeamHG-Memex/url-summary

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/TeamHG-Memex/url-summary)

TeamHG-Memex / url-summary

Show summary of a large number of URLs in a Jupyter Notebook

☆19

Alternatives and similar repositories for url-summary

Users that are interested in url-summary are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

TeamHG-Memex / tor-proxy
View on GitHub
a tor socks proxy docker image
☆12Apr 8, 2026Updated 3 months ago
TeamHG-Memex / extract-html-diff
View on GitHub
extract difference between two html pages
☆33Apr 8, 2026Updated 3 months ago
rmax / databrewer
View on GitHub
The missing datasets manager. Like hombrew but for datasets. CLI-tool for search and discover datasets!
☆41May 29, 2017Updated 9 years ago
scrapinghub / page_finder
View on GitHub
Find which links on a web page are pagination links
☆29Jan 12, 2017Updated 9 years ago
rmax / scrapydo
View on GitHub
Crochet-based blocking API for Scrapy.
☆47Feb 24, 2017Updated 9 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
deeppavlov / ru_sentence_tokenizer
View on GitHub
A simple and fast rule-based sentence segmentation. Tested on OpenCorpora and SynTagRus datasets.
☆52Jul 4, 2018Updated 8 years ago
TeamHG-Memex / sitehound-frontend
View on GitHub
Site Hound (previously THH) is a Domain Discovery Tool
☆24Apr 8, 2026Updated 3 months ago
passivetotal / maltego_machines
View on GitHub
Machines created to speed up analysis inside of Maltego
☆16Mar 17, 2016Updated 10 years ago
TeamHG-Memex / arachnado
View on GitHub
Web Crawling UI and HTTP API, based on Scrapy and Tornado
☆162Apr 8, 2026Updated 3 months ago
scrapinghub / exporters
View on GitHub
Exporters is an extensible export pipeline library that supports filter, transform and several sources and destinations
☆39May 21, 2024Updated 2 years ago
xtannier / WebAnnotator
View on GitHub
WebAnnotator is a tool for annotating Web pages. WebAnnotator is implemented as a Firefox extension (https://addons.mozilla.org/en-US/fi…
☆48Dec 17, 2021Updated 4 years ago
seagatesoft / webdext
View on GitHub
Intelligent Web Data Extractor
☆74Dec 5, 2022Updated 3 years ago
pcbje / pymtgx
View on GitHub
Python API for generating Maltego mtgx files.
☆18Sep 27, 2016Updated 9 years ago
TeamHG-Memex / undercrawler
View on GitHub
A generic crawler
☆81Apr 8, 2026Updated 3 months ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
scrapinghub / product-extraction-benchmark
View on GitHub
☆16Apr 10, 2026Updated 3 months ago
stummjr / scrapy-fieldstats
View on GitHub
A Scrapy extension to log items coverage when the spider shuts down
☆18Apr 11, 2020Updated 6 years ago
TeamHG-Memex / html-text
View on GitHub
Extract text from HTML
☆135Apr 8, 2026Updated 3 months ago
jayzeng / dirbot
View on GitHub
Scrapy project to scrape public web directories (educational)
☆22Mar 18, 2017Updated 9 years ago
brianwarehime / mcrits
View on GitHub
Visualize your CRITs IOC's in Maltego
☆12Jan 13, 2015Updated 11 years ago
psapezhka / grafana-dashboards
View on GitHub
Set of useful grafana dashboards
☆14Apr 15, 2021Updated 5 years ago
nadiinchi / HSE_minor_DataAnalysis_seminars_iad16
View on GitHub
Repository with materials for HSE minor students (group iad16)
☆15Jan 26, 2017Updated 9 years ago
WlndyMiller / SteamTransforms
View on GitHub
Maltego transforms for the Steam community
☆13Aug 5, 2017Updated 8 years ago
matthewruttley / mozclassify
View on GitHub
Algorithms for URL Classification
☆19Apr 13, 2015Updated 11 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
deathbybandaid / pimotd
View on GitHub
This tweaks the motd do be much cooler
☆12May 15, 2017Updated 9 years ago
brianwarehime / crt.sh-Maltego-Transforms
View on GitHub
Local Maltego Transforms for crt.sh
☆12Sep 8, 2017Updated 8 years ago
thnyheim / misp2bro
View on GitHub
Python script that gets IOC from MISP and converts it into BRO intel files.
☆13Apr 17, 2016Updated 10 years ago
TeamHG-Memex / soft404
View on GitHub
A classifier for detecting soft 404 pages
☆65Apr 8, 2026Updated 3 months ago
scrapinghub / js2xml
View on GitHub
Convert Javascript code to an XML document
☆188Mar 14, 2022Updated 4 years ago
TeamHG-Memex / scrapy-dockerhub
View on GitHub
[UNMAINTAINED] Deploy, run and monitor your Scrapy spiders.
☆12Apr 8, 2026Updated 3 months ago
PaloAltoNetworks / misp-to-autofocus
View on GitHub
Script for pulling events from a MISP database and converting them to Autofocus queries.
☆13Dec 28, 2015Updated 10 years ago
shritesh / brainfuck-rs-wasm
View on GitHub
A Brainfuck interpreter written in Rust and compiled to WebAssembly
☆10Dec 4, 2017Updated 8 years ago
mrsuh / rent-parser
View on GitHub
☆16Sep 3, 2019Updated 6 years ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
UWNetworksLab / satellite
View on GitHub
Satellite: Measuring The Internet's Stars
☆40Sep 2, 2020Updated 5 years ago
crits / mcrits
View on GitHub
CRITs IOC Visualization in Maltego
☆28Jan 8, 2015Updated 11 years ago
CiviCERT / suspicious-email-submitter
View on GitHub
The Suspicious Email Submitter is a discontinued browser extension (Chrome, Chromium, Firefox) for the easy submission of suspicious emai…
☆15Mar 6, 2023Updated 3 years ago
lestrrat-go / urlenc
View on GitHub
Marshal/Unmarshal interface for structs that can encode/decode themselves to URL query strings
☆11Jun 6, 2018Updated 8 years ago
TeamHG-Memex / MaybeDont
View on GitHub
A component that tries to avoid downloading duplicate content
☆28Apr 8, 2026Updated 3 months ago
scrapy / pypydispatcher
View on GitHub
A fork of http://pydispatcher.sourceforge.net/ with PyPy support
☆16Jul 3, 2017Updated 9 years ago
bkj / wit
View on GitHub
Algorithms for "schema matching"
☆26Jul 6, 2016Updated 10 years ago