YahooArchive / anthelion
Anthelion is a plugin for Apache Nutch to crawl semantic annotations within HTML pages.
☆2,843Updated 9 years ago
Alternatives and similar repositories for anthelion:
Users that are interested in anthelion are comparing it to the libraries listed below
- A proxy-less censorship resistance tool☆983Updated 7 years ago
- [DEPRECATED]Douban CODE☆1,811Updated 4 years ago
- Kids Is Data Stream☆1,223Updated 4 years ago
- MySQL performance monitoring and analysis.☆1,439Updated 2 years ago
- A high performance replicated log service. (The development is moved to Apache Incubator)☆2,218Updated 5 years ago
- Quicksheet for Algorithms☆826Updated 5 months ago
- Cotton (formerly known as Mysos)☆588Updated 9 years ago
- Multi-path Tunnel☆1,249Updated 9 years ago
- Free Web Scraping Tool with Java☆582Updated last year
- A high-level distributed crawling framework.☆1,505Updated 2 years ago
- (Deprecated) Lossless h.264 recoder/recompressor. For newest version see:☆1,067Updated 8 years ago
- A secure socket tunnel works on getqujing.com☆1,654Updated 8 years ago
- Apache Heron (Incubating) is a realtime, distributed, fault-tolerant stream processing engine from Twitter☆3,625Updated 2 years ago
- Actor Messaging platform☆3,278Updated 3 years ago
- Experiments and proposals for gRPC features.☆1,068Updated 3 years ago
- The Baidu File System.☆2,861Updated 6 years ago
- Python clone of Spark, a MapReduce alike framework in Python☆2,682Updated 4 years ago
- A TCP performance profiling tool.☆1,848Updated 7 years ago
- Open source version of jianliao.com☆2,717Updated 7 years ago
- "XcodeGhost" Source☆1,922Updated 9 years ago
- A machine learning package built for humans.☆4,793Updated 7 months ago
- A high availability MySQL cluster that guarantees data consistency between a master and slaves.☆2,465Updated 6 years ago
- SSDB - A fast NoSQL database, an alternative to Redis☆8,214Updated 2 years ago
- Mass Service Engine in Cluster(MSEC) is opened source by QQ team from Tencent. It is a backend DEV &OPS engine, including RPC,name findin…☆2,745Updated 5 years ago
- Seesaw v2 is a Linux Virtual Server (LVS) based load balancing platform.☆5,661Updated this week
- 查看被删的微信好友☆4,787Updated 4 years ago
- Instructions for setting up the software on your deep learning machine☆1,976Updated 6 years ago
- 网站「看知乎」的爬虫☆881Updated 7 years ago
- Augmented Traffic Control: A tool to simulate network conditions☆4,323Updated 7 years ago
- "Hello, I am your personal health companion"☆341Updated 2 years ago