jourlin / WebCrawlerLinks
An academic open source and open data web crawler
☆27Updated 7 years ago
Alternatives and similar repositories for WebCrawler
Users that are interested in WebCrawler are comparing it to the libraries listed below
Sorting:
- Performance dashboard☆19Updated this week
- run multiple shell commands in parallel and coordinate their output☆31Updated 13 years ago
- Learning problem-solving, logic/set, math, physics, economics through functional programming using Haskell☆19Updated 9 years ago
- lua state machine taking advantage of first class functions. Made this while reading a book on oop in lua.☆17Updated 7 years ago
- Hash-based password manager☆19Updated 6 years ago
- Stackbuilder builds stacks of virtual machines☆21Updated 8 months ago
- Nutch 2.3.1 plugin for whitelisting/blacklisting specific HTML elements☆14Updated 3 years ago
- Omni scheduler/core engine for Megam Vertice☆13Updated 7 years ago
- Compiler for writing DeepDive applications in a Datalog-like language — ⚠️🚧🛑 REPO MOVED TO DEEPDIVE 👇🏿☆19Updated 8 years ago
- Masques is a distributed social network.☆36Updated 9 years ago
- A Node.js full text search library☆32Updated 14 years ago
- A collection of efficient utilities for a data scientist.☆41Updated 10 years ago
- awesome-unikernels☆15Updated 10 years ago
- Interactive MySQL query editor for the terminal with syntax highlighting☆22Updated 6 years ago
- Block median value perceptual hash RFC for URN namespace☆27Updated 5 years ago
- Write tables in the command line.☆17Updated 11 years ago
- Treat curl configuration files as curlrc subcommands.☆11Updated 4 years ago
- Swift middleware for Zerocloud☆53Updated 6 years ago
- Lexical categorization engine for large datasets. Good for NLP and Data Mining.☆104Updated 8 years ago
- SurveyMan programming language.☆46Updated 8 years ago
- A collecton of generic reference counted data structures, tools to create compatible C style classes, and demo applications☆81Updated 2 months ago
- Embeddable Hacker News button + vote counter for your site☆414Updated 6 years ago
- Highly performant version of open-text-summarizer☆38Updated 11 years ago
- Greylock is an embedded search engine which is aimed at index size and performace☆12Updated 8 years ago
- A statically typed binary tree in Go without casts or reflection☆19Updated 12 years ago
- Terminus DB Schemas - Formal descriptions and documentation of all the internal data structures used by Terminus DB☆10Updated 5 years ago
- Example project which simulates an interesting analytics use case using MemSQL Pipelines.☆14Updated 8 years ago
- A LevelDB-backed RDFLib Store for RDFLib=>6.0☆18Updated last year
- Reference implementation of a Tent server in Ruby☆499Updated 8 years ago
- TimerMetrics captures timings and enables periodic metrics every n events☆15Updated 5 years ago