Apache Nutch fork tunned for web services and data discovery.
☆10May 18, 2015Updated 10 years ago
Alternatives and similar repositories for nutch-crawler
Users that are interested in nutch-crawler are comparing it to the libraries listed below
Sorting:
- ☆25Apr 6, 2015Updated 10 years ago
- Nutch with Cassandra and Elasticsearch on Docker☆17Oct 26, 2021Updated 4 years ago
- A python wrapper to the NASA Common Metadata Repository API☆20Oct 14, 2021Updated 4 years ago
- ☆28Jun 9, 2016Updated 9 years ago
- Zotero styles page☆14Feb 6, 2025Updated last year
- Teaching data visualization at Columbia University.☆10Oct 2, 2015Updated 10 years ago
- Hadoop-based tool for extraction of large scale synchronous grammars for paraphrasing and machine translation☆15Dec 2, 2016Updated 9 years ago
- JAWS is "Just A Web Shell" framework for delivering Force.com web applications to iOS (iPhone/iPad) devices.☆14Mar 27, 2011Updated 14 years ago
- Stac-fastapi implementation with DuckDB backend.☆15Sep 14, 2025Updated 5 months ago
- My dotfiles☆12Feb 9, 2026Updated 3 weeks ago
- Content will be open and available for use by our user communities, and will be used to help foster open science practices with our exter…☆12Nov 2, 2023Updated 2 years ago
- Mirror of Apache Pony Mail (Incubating) Site☆12Jul 19, 2024Updated last year
- A generic interface wrapping multiple backends to provide a consistent pubsub API☆13Oct 31, 2018Updated 7 years ago
- A toy HTTP server used as a sandbox for learning c++11 features, kqueue & libuv non-blocking IO☆11Jul 5, 2016Updated 9 years ago
- Hyper.sh Website☆12Mar 5, 2019Updated 7 years ago
- Ring middleware that uses tools.namespace to reload changed files☆11Aug 28, 2016Updated 9 years ago
- Little toolkit wrote in C to extract GPS data from Dash Cam 70mai Pro MP4 files to SRT (subtitles)☆11Jun 10, 2020Updated 5 years ago
- Pluto - A multi-sport betting bot for Discord☆21Feb 9, 2026Updated 3 weeks ago
- A Next.js chat app to use Llama 2 locally using node-llama-cpp☆12Oct 27, 2024Updated last year
- ☆13Apr 11, 2022Updated 3 years ago
- Basic setup to start coding phel☆10Apr 2, 2023Updated 2 years ago
- ☆70Aug 9, 2021Updated 4 years ago
- A twitter streaming, website-scraping, websocket-transporting news delivery webapp written in Go☆10Jul 17, 2015Updated 10 years ago
- rfc3986 compliant url parser for janet.☆17Jun 13, 2022Updated 3 years ago
- A library for representing HTML in Janet☆12Apr 29, 2022Updated 3 years ago
- ☆14Jul 27, 2024Updated last year
- Simple red5 demo application. Broadcast your webcam and mic and play in the client side. It have both client and server side code☆13Sep 25, 2014Updated 11 years ago
- A data management system for electronic tags on marine animals☆13Mar 31, 2025Updated 11 months ago
- Code for Max-Margin Deep Generative Models☆12Jan 1, 2015Updated 11 years ago
- Data Analysis and Image Processing Python Course☆12Nov 4, 2014Updated 11 years ago
- Sample (proof of concept) for data fetching with Amazon Lambda & SQS☆10Jan 21, 2015Updated 11 years ago
- Pangeo for the European Open Science cloud☆13Feb 6, 2026Updated 3 weeks ago
- Highly flexible and efficient computation of n-dimensional binned statistic(s) for n-variable(s)☆11Mar 31, 2025Updated 11 months ago
- Yet Another KDD Cup 2015 Solution.☆11Sep 11, 2015Updated 10 years ago
- The UberKit is a Rails plugin with a set of UI tools to ease common development.☆102Jan 14, 2010Updated 16 years ago
- Xapian full text search plugin for Ruby on Rails☆129Aug 29, 2018Updated 7 years ago
- Python based data warehouse solution for the Lambda Architecture.☆14Jun 24, 2015Updated 10 years ago
- ☆21Jul 6, 2015Updated 10 years ago
- ☆12Sep 19, 2022Updated 3 years ago