Fureteur is a simple, configurable, fault-tolerant web crawler written is Scala
☆29Oct 14, 2014Updated 11 years ago
Alternatives and similar repositories for fureteur
Users that are interested in fureteur are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A library of examples showing how to use the Common Crawl corpus (2008-2012, ARC format)☆66Aug 5, 2016Updated 9 years ago
- sbt APIs targeted for eventual inclusion in sbt core☆12Feb 21, 2015Updated 11 years ago
- Blog crawler for the blogforever project.☆23Jan 31, 2014Updated 12 years ago
- Next-generation Cassandra Conference, September 26, 2017☆12Aug 23, 2018Updated 7 years ago
- Crawl-Anywhere - Web Crawler and document processing pipeline with Solr integration.☆99Jul 1, 2017Updated 9 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Autoproxy automatically detects proxies and stores them in the respective environment variables (e.g. http_proxy).☆14Oct 2, 2016Updated 9 years ago
- Making sense of online conversations as networks☆36May 6, 2021Updated 5 years ago
- Sketch adaptors for Pig.☆10May 15, 2026Updated last month
- ☆10Feb 26, 2019Updated 7 years ago
- An unofficial dashboard of which online services are available or blocked by federal government departments in Canada.☆16Mar 6, 2024Updated 2 years ago
- Scala DSL for web crawling☆148Aug 2, 2016Updated 9 years ago
- Scala helpers for Dropwizard.☆85Aug 16, 2016Updated 9 years ago
- Bandit algorithms and test framework in Java☆41Jun 21, 2015Updated 11 years ago
- A free multithreaded proxy checking program written in Java. Load a proxy list and check each proxy to verify it's alive to create a new …☆11Nov 5, 2015Updated 10 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Universal Forensic Indexer and Analyzer☆10Jan 8, 2017Updated 9 years ago
- ☆36Nov 7, 2023Updated 2 years ago
- Models of finite automata (DFA, NFA) with support of common operations and easily readable creation of objects☆14Feb 4, 2019Updated 7 years ago
- Web page content extractor☆32Feb 26, 2013Updated 13 years ago
- Scala utilities for teaching computational linguistics and prototyping algorithms.☆43Dec 29, 2012Updated 13 years ago
- sbt-web plugin for checksum files☆32Jun 20, 2026Updated 2 weeks ago
- Standalone JavaScript client for websocket-rails.☆10Apr 7, 2015Updated 11 years ago
- it's a simple LKM rootkit.☆12Aug 2, 2016Updated 9 years ago
- Real-time, collaborative, threat modeling tool. / Un outil collaboratif de modélisation des menaces en temps réel.☆16Jun 22, 2026Updated last week
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Kairos, combines a focused crawler and an information extraction engine, to convert a list of conference websites into a index filled wit…☆19Feb 20, 2011Updated 15 years ago
- Automatic CAPTCHA decoding☆11Apr 17, 2012Updated 14 years ago
- Spring Boot Web with Hessian☆11Jul 2, 2014Updated 12 years ago
- A Nutch 2.2.1 plugin which allows users to shuffle off the responsibility for retrieving pages to a selenium hub/node spoke system. This …☆16Jun 9, 2016Updated 10 years ago
- Windows Live API binding and connect support.☆18Dec 1, 2024Updated last year
- 基于搜索引擎实现网盘搜索☆12Nov 15, 2018Updated 7 years ago
- MOVED TO: github.com/akka/akka-persistence-dynamodb☆24Jan 30, 2016Updated 10 years ago
- CAROS yocto meta layer☆11Jun 23, 2017Updated 9 years ago
- My home directory, versioned. This includes my dotfiles, vim configuration, and a few other utilities.☆26Apr 9, 2026Updated 2 months ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- A CoffeeScript plugin for SBT☆14Jun 20, 2026Updated 2 weeks ago
- Benchmarks for circe and other JSON libraries☆28Jun 30, 2025Updated last year
- Implementing java based text extractors as web APIs (currently only Boilerpipe & Goose)☆16Apr 1, 2012Updated 14 years ago
- Aperture-Tiles uses familiar web-based map interactions to allow exploration of arbitrary huge data sets.☆75May 23, 2023Updated 3 years ago
- port.js is an expanded version of Michael Gundlach’s Chrome-(and-Opera!)–to–Safari porting library for extensions <https://adblockforchro…☆62Aug 27, 2013Updated 12 years ago
- java分布式爬虫,主机和从机控制的机制☆14May 21, 2015Updated 11 years ago
- Rate limiting (throttling) implementation for Promises on Node.js☆17Oct 15, 2016Updated 9 years ago