YahooArchive / anthelion
Anthelion is a plugin for Apache Nutch to crawl semantic annotations within HTML pages.
☆2,842Updated 9 years ago
Alternatives and similar repositories for anthelion:
Users that are interested in anthelion are comparing it to the libraries listed below
- A proxy-less censorship resistance tool☆983Updated 6 years ago
- The Baidu File System.☆2,856Updated 6 years ago
- [DEPRECATED]Douban CODE☆1,812Updated 4 years ago
- Kids Is Data Stream☆1,224Updated 4 years ago
- A high performance replicated log service. (The development is moved to Apache Incubator)☆2,219Updated 5 years ago
- An Internet-Scale Database.☆1,900Updated 9 months ago
- Apache Heron (Incubating) is a realtime, distributed, fault-tolerant stream processing engine from Twitter☆3,630Updated 2 years ago
- Disque is a distributed message broker☆8,023Updated 3 years ago
- Mass Service Engine in Cluster(MSEC) is opened source by QQ team from Tencent. It is a backend DEV &OPS engine, including RPC,name findin…☆2,743Updated 5 years ago
- Open source version of jianliao.com☆2,717Updated 7 years ago
- Cotton (formerly known as Mysos)☆588Updated 9 years ago
- MySQL performance monitoring and analysis.☆1,439Updated last year
- (Deprecated) Lossless h.264 recoder/recompressor. For newest version see:☆1,067Updated 8 years ago
- Find potential bugs in your services with Diffy☆3,830Updated 4 years ago
- Python clone of Spark, a MapReduce alike framework in Python☆2,684Updated 4 years ago
- A TCP performance profiling tool.☆1,848Updated 7 years ago
- A high availability MySQL cluster that guarantees data consistency between a master and slaves.☆2,464Updated 6 years ago
- SSDB - A fast NoSQL database, an alternative to Redis☆8,205Updated 2 years ago
- Hackpad is a web-based realtime wiki.☆3,551Updated last year
- Augmented Traffic Control: A tool to simulate network conditions☆4,325Updated 6 years ago
- Fastsocket is a highly scalable socket and its underlying networking implementation of Linux kernel. With the straight linear scalability…☆3,754Updated 6 years ago
- Open Machine Intelligence Framework for Hackers. (GPU/CPU)☆5,553Updated 11 months ago
- A Distributed and High-Performance Monitoring System☆3,028Updated 6 years ago
- Enterprise Stream Process Engine☆3,902Updated last year
- Placing labels on a timeline without overlap.☆3,884Updated last year
- Free Web Scraping Tool with Java☆583Updated last year
- Neural network OCR.☆1,128Updated 8 years ago
- Seesaw v2 is a Linux Virtual Server (LVS) based load balancing platform.☆5,656Updated this week
- Microsoft Distributed Machine Learning Toolkit☆2,748Updated 6 years ago
- Build a distributed SQL database from the ground up☆2,149Updated 2 years ago