For interacting with nutch via Python
☆29Apr 5, 2026Updated 3 weeks ago
Alternatives and similar repositories for nutchpy
Users that are interested in nutchpy are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Nutch-Python is a Python binding to the Apache Nutch™ REST services allowing Nutch to be called natively in the Python community. — Edit☆39Apr 15, 2016Updated 10 years ago
- Stream Processing ToolKit☆17Aug 14, 2015Updated 10 years ago
- tool for validating conda recipes and conda packages☆13Aug 15, 2024Updated last year
- Big GeoSpatial Data Points Visualization Tool☆19May 6, 2016Updated 9 years ago
- Browser-based annotation tool for Framenet☆16Jan 27, 2015Updated 11 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Uses Apache Lucene, OpenNLP and geonames and extracts locations from text and geocodes them.☆38Apr 9, 2024Updated 2 years ago
- open source, distributed, restful crawler engine☆14Feb 3, 2015Updated 11 years ago
- Stanford CoreNLP NER addon for Apache Tika's NamerEntityParser☆13Feb 26, 2022Updated 4 years ago
- a simple crawler framework☆56May 28, 2015Updated 10 years ago
- A Python/GeoJS bridge utilizing the Jupyter widget infrastructure☆14Dec 30, 2022Updated 3 years ago
- Narwhal is a keyword and KEY NARRATIVE manager that creates language-aware classes. Because Narhwal does not use NLP it avoids complexity…☆12Oct 16, 2018Updated 7 years ago
- An extended version of Scala's scaladoc command☆21Jul 2, 2011Updated 14 years ago
- Translation quality evaluation for Firefox Translations models☆12Oct 23, 2023Updated 2 years ago
- Use tensorflow and machine learning algorithms to predict the emotion depicted by an image of a face.☆15Dec 8, 2015Updated 10 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Fast filtering and animation of large dynamic networks☆39May 24, 2016Updated 9 years ago
- A stylish alternative for caching your map tiles.☆15Jul 31, 2017Updated 8 years ago
- ☆18Apr 3, 2018Updated 8 years ago
- OBSOLETE | Simpler and centralized CI configuration for Python extensions.☆16Mar 7, 2023Updated 3 years ago
- ISI tutorials☆12Oct 28, 2016Updated 9 years ago
- Minimal web-based client for NewsBlur.☆19Dec 7, 2014Updated 11 years ago
- A JupyterLab extension for GeoJS☆17Jan 13, 2023Updated 3 years ago
- Gaia is a geospatial analysis library jointly developed by Kitware and Epidemico.☆33Apr 8, 2019Updated 7 years ago
- Repo for the April 10-12 workshop to be held in Berkeley, CA☆14Jul 19, 2018Updated 7 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Seed acquisition tool to bootstrap focused crawlers☆23Apr 24, 2017Updated 9 years ago
- Code for Unsupervised Learning of Morphological Forest☆14Aug 12, 2019Updated 6 years ago
- WPS API for ESGF Compute Working Team☆13Mar 11, 2021Updated 5 years ago
- JSON-LD representation of EML☆14Jan 15, 2026Updated 3 months ago
- The HDF5 Cloud Optimized Read Only Python Package☆32Apr 6, 2026Updated 3 weeks ago
- ☆15Feb 28, 2019Updated 7 years ago
- Tensorflow data structures generated from protobuf definitions☆19Oct 31, 2017Updated 8 years ago
- Spark-Crawler: Apache Nutch-like crawler that runs on Apache Spark.☆423Mar 30, 2023Updated 3 years ago
- ☆32Jul 6, 2015Updated 10 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Buddies of Budgie documentation, built with Docusaurus.☆12Mar 29, 2026Updated last month
- Sort-friendly URI Reordering Transform (SURT) python module☆45Sep 11, 2025Updated 7 months ago
- An implementation of Mikolov's word2vec in Python using Theano and Lasagne.☆37Jul 17, 2017Updated 8 years ago
- Run a Linux Desktop on a JupyterHub☆13Mar 25, 2021Updated 5 years ago
- A collection of implementations of fair ML algorithms☆12Jan 7, 2018Updated 8 years ago
- For Publishing ScalaJS Package to npm☆14Jul 1, 2024Updated last year
- Highly flexible and efficient computation of n-dimensional binned statistic(s) for n-variable(s)☆11Mar 31, 2025Updated last year