TeamHG-Memex/soft404

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/TeamHG-Memex/soft404)

TeamHG-Memex / soft404

A classifier for detecting soft 404 pages

☆65

Alternatives and similar repositories for soft404

Users that are interested in soft404 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

TeamHG-Memex / extract-html-diff
View on GitHub
extract difference between two html pages
☆33Apr 8, 2026Updated 3 months ago
TeamHG-Memex / MaybeDont
View on GitHub
A component that tries to avoid downloading duplicate content
☆28Apr 8, 2026Updated 3 months ago
benhoyt / soft404
View on GitHub
Soft 404 (dead page) detector in Python
☆15Oct 1, 2018Updated 7 years ago
TeamHG-Memex / undercrawler
View on GitHub
A generic crawler
☆81Apr 8, 2026Updated 3 months ago
nicklaslof / searching
View on GitHub
Searching 404
☆24Sep 12, 2020Updated 5 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
TeamHG-Memex / autologin-middleware
View on GitHub
Scrapy middleware for the autologin
☆36Apr 8, 2026Updated 3 months ago
Siddhant-K-code / 404-Error-Page---Astronaut
View on GitHub
404 Error Page - Astronaut
☆26Jan 7, 2020Updated 6 years ago
abramhindle / CMPUT404-project-socialdistribution
View on GitHub
CMPUT404-project-socialdistribution
☆25Apr 6, 2023Updated 3 years ago
domuk / 404Wasteland-Chernarus
View on GitHub
404Games Wastelands V2 - Chernarus
☆30Jun 25, 2013Updated 13 years ago
TeamHG-Memex / autopager
View on GitHub
Detect and classify pagination links
☆107Apr 8, 2026Updated 3 months ago
coconauts / 404-games
View on GitHub
404-themed canvas games
☆24Nov 24, 2022Updated 3 years ago
xuanfeng / 404
View on GitHub
404页面
☆21Sep 29, 2013Updated 12 years ago
jckantor / CBE40455
View on GitHub
Process Operations
☆43May 14, 2023Updated 3 years ago
TeamHG-Memex / sitehound-frontend
View on GitHub
Site Hound (previously THH) is a Domain Discovery Tool
☆24Apr 8, 2026Updated 3 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
uofa-cmput404 / nodejs-ws-lab
View on GitHub
NodeJS & WebSockets lab. University of Alberta CMPUT 404
☆34Nov 28, 2023Updated 2 years ago
TeamHG-Memex / Formasaurus
View on GitHub
Formasaurus tells you the type of an HTML form and its fields using machine learning
☆121Apr 8, 2026Updated 3 months ago
pjnovas / invaders404
View on GitHub
Another custom HTML5 CANVAS 404 error page with the classic game Space Invaders made in JavaScript.
☆54Aug 31, 2016Updated 9 years ago
M4DM0e / Door404
View on GitHub
Web application backdoor builder
☆82Jun 9, 2021Updated 5 years ago
TeamHG-Memex / url-summary
View on GitHub
Show summary of a large number of URLs in a Jupyter Notebook
☆19Apr 8, 2026Updated 3 months ago
TeamHG-Memex / tor-proxy
View on GitHub
a tor socks proxy docker image
☆12Apr 8, 2026Updated 3 months ago
scrapinghub / webpager
View on GitHub
Paginating the web
☆37Feb 11, 2014Updated 12 years ago
Geta / 404handler
View on GitHub
The popular 404 handler for EPiServer, enabling better control over your 404 page in addition to allowing redirects for old urls that no …
☆76Jul 7, 2026Updated 2 weeks ago
rmax / databrewer
View on GitHub
The missing datasets manager. Like hombrew but for datasets. CLI-tool for search and discover datasets!
☆41May 29, 2017Updated 9 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
TeamHG-Memex / deep-deep
View on GitHub
Adaptive crawler which uses Reinforcement Learning methods
☆167Apr 8, 2026Updated 3 months ago
seagatesoft / webdext
View on GitHub
Intelligent Web Data Extractor
☆74Dec 5, 2022Updated 3 years ago
TeamHG-Memex / aquarium
View on GitHub
Splash + HAProxy + Docker Compose
☆195Apr 8, 2026Updated 3 months ago
FireflyLogic / pewpew
View on GitHub
Firefly Logic's 404 game
☆47Dec 1, 2015Updated 10 years ago
TeamHG-Memex / autologin
View on GitHub
A project to attempt to automatically login to a website given a single seed
☆129Apr 8, 2026Updated 3 months ago
Edubr2020 / CVE-2021-40444--CABless
View on GitHub
Modified code so that we don´t need to rely on CAB archives
☆104Sep 22, 2021Updated 4 years ago
52linglong / 404
View on GitHub
404模板
☆64May 11, 2020Updated 6 years ago
astorm / MagentoBetter404
View on GitHub
A programmer's 404 page for the Magento Ecommerce system.
☆47May 23, 2014Updated 12 years ago
scrapy / scurl
View on GitHub
Performance-focused replacement for Python urllib
☆21Apr 13, 2026Updated 3 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
seomoz / dragnet_data
View on GitHub
Training/test data for Dragnet
☆42Jan 29, 2015Updated 11 years ago
nik0spapp / sdalg
View on GitHub
Web page segmentation and noise removal
☆55Feb 4, 2024Updated 2 years ago
yansheng836 / 404pages
View on GitHub
收集了几个自定义的404页面的模板。
☆53Oct 6, 2022Updated 3 years ago
HunterXing / 404blog
View on GitHub
个人博客项目 vue.js和node.js 前后端分离
☆83Dec 10, 2022Updated 3 years ago
danieldulaney / xkcdfs
View on GitHub
A FUSE filesystem for browsing the xkcd webcomic
☆14Jun 14, 2023Updated 3 years ago
redapple / parslepy
View on GitHub
Python implementation of the Parsley language for extracting structured data from web pages
☆92Oct 26, 2017Updated 8 years ago
kootenpv / deep_eye2mouse
View on GitHub
Move the mouse by your webcam + eyes
☆21Oct 24, 2017Updated 8 years ago