jourlin / WebCrawlerLinks
An academic open source and open data web crawler
☆27Updated 8 years ago
Alternatives and similar repositories for WebCrawler
Users that are interested in WebCrawler are comparing it to the libraries listed below
Sorting:
- Rich browser-based frontend for elasticsearch☆102Updated 10 years ago
- Neddick: Open Source Information Discovery Platform☆36Updated 2 years ago
- pythonic filesystem library☆35Updated 13 years ago
- General Architecture for Text Engineering☆49Updated 9 years ago
- An HTTP dashboard for Godot2.☆16Updated 9 years ago
- Generate Microsoft Word resumes from JSON Resume data☆32Updated 9 years ago
- Quickly analyze and explore email with advanced analytics and visualization.☆55Updated 4 years ago
- Personal finance management for Node.js hackers and Google Docs users☆50Updated 9 years ago
- A full-text search engine in the browser☆22Updated 8 years ago
- detect and publish postgres events on a zeromq PUB socket☆34Updated 10 years ago
- Deprecated, use https://github.com/mozilla-services/iprepd☆15Updated 7 years ago
- Hourly Data Dump of Hacker News (since 2006-10-09)☆50Updated 6 years ago
- Navigating around a grid of cells like XPath for spreadsheets; supports Python 3.5+☆48Updated 2 years ago
- Swift middleware for Zerocloud☆53Updated 7 years ago
- This is the facade for installation and access to the individual components☆15Updated last week
- SurveyMan programming language.☆46Updated 9 years ago
- The User Activity Logging Engine, or User-ALE, is a logging mechanism used to quantitatively assess the behavioural and cognitive state o…☆13Updated 9 years ago
- [Deprecated] JavaScript SDK for Voucherify - coupons, vouchers, promo codes☆59Updated 2 years ago
- ☆31Updated 10 years ago
- Lexical categorization engine for large datasets. Good for NLP and Data Mining.☆107Updated 9 years ago
- Pirate Trading Platform: Open source automated trading based on algorithmic market evaluation☆13Updated 8 years ago
- Ruby SDK for the Kevel Management API☆43Updated 2 years ago
- A module that processes new Edgar filings and sends out notifications☆14Updated 10 years ago
- REST API for Text Summarization and Keywords Extraction☆16Updated 3 years ago
- An NX Hacker News clone with real-time updates and animations.☆58Updated 8 years ago
- Traptor -- A distributed Twitter feed☆26Updated 3 years ago
- Natural Language Generator for Python☆27Updated 8 years ago
- A streaming real-time event processor based on Riemann written in Node.js -- Streams2/3 edition☆19Updated 9 years ago
- The Social Harvest server that exposes an API and harvests data from the web to be analyzed.☆114Updated 10 years ago
- Site Hound (previously THH) is a Domain Discovery Tool☆23Updated last week