VIDA-NYU/ache

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/VIDA-NYU/ache)

VIDA-NYU / ache

ACHE is a web crawler for domain-specific search.

☆484

Alternatives and similar repositories for ache

Users that are interested in ache are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

VIDA-NYU / memex
View on GitHub
☆13Nov 30, 2015Updated 10 years ago
chrismattmann / imagecat
View on GitHub
ImageCat is an Apache OODT RADIX application that uses Apache Solr, Apache Tika and Apache OODT to ingest 10s of millions of files (image…
☆96Aug 26, 2018Updated 7 years ago
mitll / topic-clustering
View on GitHub
☆44Jan 15, 2016Updated 10 years ago
mitll / MITIE
View on GitHub
MITIE: library and tools for information extraction
☆29Jan 22, 2015Updated 11 years ago
darpa-i2o / memex-program-index
View on GitHub
A list of memex-related tools and their repository URLs
☆158Feb 17, 2018Updated 8 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
nasa-jpl-memex / memex-explorer
View on GitHub
Viewers for statistics and dashboarding of Domain Search Engine data
☆128Jan 19, 2016Updated 10 years ago
Sotera / Datawake
View on GitHub
Browser add-on and web server to support collection and analysis of web browsing data.
☆14Mar 9, 2016Updated 10 years ago
NextCenturyCorporation / dig
View on GitHub
Faceted search engine for domain-specific exploration of the Web
☆45Feb 10, 2017Updated 9 years ago
VIDA-NYU / domain_discovery_tool_deprecated
View on GitHub
Seed acquisition tool to bootstrap focused crawlers
☆23Apr 24, 2017Updated 9 years ago
mille856 / CMU_memex
View on GitHub
☆20Nov 1, 2017Updated 8 years ago
pymonger / facetview-memex
View on GitHub
Facet Search interface for MEMEX.
☆13Feb 26, 2015Updated 11 years ago
IlyasHabeeb / Machine_Learning_Focused_Crawler
View on GitHub
A focused web crawler that uses Machine Learning to fetch better relevant results.
☆13Jan 12, 2019Updated 7 years ago
wKovacs64 / pwl
View on GitHub
Password Lense: reveal character types in a password
☆23Oct 18, 2025Updated 9 months ago
TeamHG-Memex / deep-deep
View on GitHub
Adaptive crawler which uses Reinforcement Learning methods
☆167Apr 8, 2026Updated 3 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
nasa-jpl-memex / memex-gate
View on GitHub
General Architecture for Text Engineering
☆50Mar 23, 2016Updated 10 years ago
VIDA-NYU / auctus
View on GitHub
Mirror from: https://gitlab.com/ViDA-NYU/auctus/auctus
☆44May 12, 2025Updated last year
nasa-jpl-memex / image_space
View on GitHub
Interactive Image similarity and Visual Search and Retrieval application
☆95Apr 16, 2024Updated 2 years ago
TeamHG-Memex / autologin
View on GitHub
A project to attempt to automatically login to a website given a single seed
☆129Apr 8, 2026Updated 3 months ago
nasa-jpl-memex / topic_space
View on GitHub
Topic modeling web application
☆40Jul 23, 2015Updated 10 years ago
usc-isi-i2 / etk
View on GitHub
Extraction Toolkit
☆83Nov 18, 2021Updated 4 years ago
VIDA-NYU / urban-data-study
View on GitHub
☆12Apr 7, 2015Updated 11 years ago
TeamHG-Memex / undercrawler
View on GitHub
A generic crawler
☆81Apr 8, 2026Updated 3 months ago
nixwizard / kube-alien
View on GitHub
☆25May 9, 2021Updated 5 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
Andromeda1957 / LinPwn
View on GitHub
Interactive Post Exploitation Tool
☆37Oct 1, 2019Updated 6 years ago
TeamHG-Memex / Formasaurus
View on GitHub
Formasaurus tells you the type of an HTML form and its fields using machine learning
☆121Apr 8, 2026Updated 3 months ago
mandiant / flashmingo
View on GitHub
Automatic analysis of SWF files based on some heuristics. Extensible via plugins.
☆120Jun 19, 2019Updated 7 years ago
TeamHG-Memex / extract-html-diff
View on GitHub
extract difference between two html pages
☆33Apr 8, 2026Updated 3 months ago
Sotera / DatawakeDepot
View on GitHub
Loopback web application for administration of Datawake networks
☆10May 2, 2017Updated 9 years ago
ElevenPaths / PESTO
View on GitHub
☆26Apr 5, 2020Updated 6 years ago
corumir / Practical-Tradecraft
View on GitHub
Resources, articles, thoughts, datasets, papers on TI tradecraft
☆10Aug 24, 2018Updated 7 years ago
SabreOSS / conf4j
View on GitHub
Conf4j Type-safe Configuration Library for Java
☆13Dec 2, 2024Updated last year
smirnovvad / rbuster
View on GitHub
yet another dirbuster
☆18Jan 14, 2021Updated 5 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
Stahlz / JQShell
View on GitHub
A weaponized version of CVE-2018-9206
☆62Oct 30, 2018Updated 7 years ago
Kitware / SMQTK
View on GitHub
Python toolkit for pluggable algorithms and data structures for multimedia-based machine learning.
☆79Jul 28, 2025Updated 11 months ago
chrisallenlane / novahot
View on GitHub
A webshell framework for penetration testers.
☆301Aug 10, 2025Updated 11 months ago
apache / stormcrawler
View on GitHub
A scalable, mature and versatile web crawler based on Apache Storm
☆986Updated this week
aglahe / dsra-dcos
View on GitHub
Data Science Research Architecture, Data Center OS
☆21May 12, 2016Updated 10 years ago
ThunderGunExpress / ThunderQuery
View on GitHub
☆12Apr 21, 2019Updated 7 years ago
scrapinghub / autologin
View on GitHub
A project to attempt to automatically login to a website given a single seed
☆11Jun 17, 2024Updated 2 years ago